HuaiYu 发表于 2006-6-23 23:12:00

[LISP]怎么样判断汉字?

<P>如题:例 (setq strA "汉字") (setq strB "HanZi")</P>
<P>请问怎么判断 strA 为汉字 strB不为汉字??</P>
<P>谢谢.....</P>

hhc 发表于 2006-6-24 11:00:00

<P>ASCII&nbsp;码&nbsp;&nbsp;<BR>27&nbsp;ESC&nbsp;32&nbsp;SPACE&nbsp;33&nbsp;!&nbsp;34&nbsp;"&nbsp;<BR>35&nbsp;#&nbsp;36&nbsp;$&nbsp;37&nbsp;%&nbsp;38&nbsp;&amp;&nbsp;<BR>39&nbsp;'&nbsp;40&nbsp;(&nbsp;41&nbsp;)&nbsp;42&nbsp;*&nbsp;<BR>43&nbsp;+&nbsp;44&nbsp;'&nbsp;45&nbsp;-&nbsp;46&nbsp;.&nbsp;<BR>47&nbsp;/&nbsp;48&nbsp;0&nbsp;49&nbsp;1&nbsp;50&nbsp;2&nbsp;<BR>51&nbsp;3&nbsp;52&nbsp;4&nbsp;53&nbsp;5&nbsp;54&nbsp;6&nbsp;<BR>55&nbsp;7&nbsp;56&nbsp;8&nbsp;57&nbsp;9&nbsp;58&nbsp;:&nbsp;<BR>59&nbsp;;&nbsp;60&nbsp;&lt;&nbsp;61&nbsp;=&nbsp;62&nbsp;&gt;&nbsp;<BR>63&nbsp;?&nbsp;64&nbsp;@&nbsp;65&nbsp;A&nbsp;66&nbsp;B&nbsp;<BR>67&nbsp;C&nbsp;68&nbsp;D&nbsp;69&nbsp;E&nbsp;70&nbsp;F&nbsp;<BR>71&nbsp;G&nbsp;72&nbsp;H&nbsp;73&nbsp;I&nbsp;74&nbsp;J&nbsp;<BR>75&nbsp;K&nbsp;76&nbsp;L&nbsp;77&nbsp;M&nbsp;78&nbsp;N&nbsp;<BR>79&nbsp;O&nbsp;80&nbsp;P&nbsp;81&nbsp;Q&nbsp;82&nbsp;R&nbsp;<BR>83&nbsp;S&nbsp;84&nbsp;T&nbsp;85&nbsp;U&nbsp;86&nbsp;V&nbsp;<BR>87&nbsp;W&nbsp;88&nbsp;X&nbsp;89&nbsp;Y&nbsp;90&nbsp;Z&nbsp;<BR>91&nbsp;[&nbsp;92&nbsp;\&nbsp;93&nbsp;]&nbsp;94&nbsp;^&nbsp;<BR>95&nbsp;_&nbsp;96&nbsp;`&nbsp;97&nbsp;a&nbsp;98&nbsp;b&nbsp;<BR>99&nbsp;c&nbsp;100&nbsp;d&nbsp;101&nbsp;e&nbsp;102&nbsp;f&nbsp;<BR>103&nbsp;g&nbsp;104&nbsp;h&nbsp;105&nbsp;i&nbsp;106&nbsp;j&nbsp;<BR>107&nbsp;k&nbsp;108&nbsp;l&nbsp;109&nbsp;m&nbsp;110&nbsp;n&nbsp;<BR>111&nbsp;o&nbsp;112&nbsp;p&nbsp;113&nbsp;q&nbsp;114&nbsp;r&nbsp;<BR>115&nbsp;s&nbsp;116&nbsp;t&nbsp;117&nbsp;u&nbsp;118&nbsp;v&nbsp;<BR>119&nbsp;w&nbsp;120&nbsp;x&nbsp;121&nbsp;y&nbsp;122&nbsp;z&nbsp;<BR>123&nbsp;{&nbsp;124&nbsp;|&nbsp;125&nbsp;}&nbsp;126&nbsp;~</P>
<P>以上为各种字符的ASCII码,用ASCII函数遍历字符串中的每个字符,如果其值超出这个范围的即为汉字,当然还有可能为其他文字,如日文等,汉字的ASCII码好象有一个专门的范围,分什么区位码,没研究过.</P>

wzz1968 发表于 2006-6-24 11:53:00

<P>(if (&gt; &nbsp;(ascii str) 160)</P>
<P>&nbsp;&nbsp;&nbsp;&nbsp;汉字</P>
<P>&nbsp;&nbsp; </P>

hhc 发表于 2006-6-24 16:00:00

(ascii "δ") →166    (ascii "★") →161 (ascii "≈") →161   (ascii "→") →161 (ascii "э") →167显然这5个不是汉字

HuaiYu 发表于 2006-6-24 19:11:00

为什么不是128呢???我怀疑是128呢,但没有证据............

NetBee 发表于 2006-6-24 21:23:00

就是128了!

wzz1968 发表于 2006-6-25 14:59:00

<P>这个问题说起来比较复杂,在计算中,一个字符用一个八为二进制表示,称为一个字节.共有256个状态.前128个已经定义为标准的ASCII码.</P>
<P>后128后用语非拉丁系双字节模式.如中文\日文\韩文等,一般用两个字节表示一个字.</P>
<P>在汉字系统中,汉字及其各种双字节符号由两个大于160的扩充ASCII码表示,为了表示方便.用第一个字节的ASCII码减去160称为区码,用第二个字节的ASCII码减去160称为位码,</P>
<P>区码和位码各有94个.</P>
<P>汉字系统的分区中,第一区表示标准的标点符号,第二区是各种序号,第三区是标准ASCII玛,第六区是西腊字母,第九区是制表符.</P>
<P>大于16区以上为汉字区.</P>
<P>故判断方法是:</P>
<P>(if (&gt; (asscii str) 175)</P>
<P>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;汉字</P>
<P>&nbsp;</P>

HuaiYu 发表于 2006-6-25 18:44:00

<P>(Defun C:test ()<BR>&nbsp; (setq str "在汉字系统中汉字及其各种双字节符号由两个大于的扩充码表示为了表示方便用第一个字节的码减去称为区码用第二个字节的码减去称为位码")<BR>&nbsp; (setq i 1)<BR>&nbsp; (repeat (fix (* (strlen str) 0.5))<BR>&nbsp;&nbsp;&nbsp; (setq code (substr str i 2)<BR>&nbsp;&nbsp; i&nbsp;&nbsp;&nbsp; (+ 2 i)<BR>&nbsp;&nbsp;&nbsp; )<BR>&nbsp;&nbsp;&nbsp; (princ "\n")<BR>&nbsp;&nbsp;&nbsp; (princ code)<BR>&nbsp;&nbsp;&nbsp; (princ "&nbsp; ----------&nbsp;")<BR>&nbsp;&nbsp;&nbsp; (princ (ascii code))<BR>&nbsp; )<BR>&nbsp; (prin1)<BR>)</P>
<P>确实没有小于 175 的,</P>
<P>谢谢 <A name=20674><FONT color=#000066><B>wzz1968</B></FONT></A></P>
<P>请问 <A name=20674><FONT color=#000066><B>wzz1968</B></FONT></A> 大侠 你是怎么知道这个东西的啊,哪里的资料,</P>
<P><A name=20674><FONT color=#000066><B></B></FONT></A>&nbsp;</P>

wzz1968 发表于 2006-6-27 07:56:00

<P>回答楼上的:</P>
<P>这是非常基础的计算知识.现在估计找不到这样的资料了,能说清楚这个人也不多.</P>
<P>&nbsp;</P>

kexiya123 发表于 2011-5-26 09:09:02

(Defun C:test ()
(setqstr (getstring " 输入需要判断ASCII码的字符:"))
(setq i 1)
(repeat (fix (* (strlen str) 0.5))
    (setq code (substr str i 2)
   i    (+ 2 i)
    )
    (princ "\n")
    (princ code)
    (princ "---------- ")
    (princ (ascii code))
)
(prin1)
)

略作修改,这样更方便查看
页: [1] 2
查看完整版本: [LISP]怎么样判断汉字?