Greetings,
Recently while working on some code that does lexical analysis on Japanese
text I came across the following sequence in some of my test data (culled
from various sources on the WWW):
U+4E5D U+3007 U+5E74
CJK Ideograph Nine, Ideographic Number Zero, On reading 'nen', "year"
I was interested to see that U+3007 is not considered a Decimal Digit, but
simply as a Numeric (while the ideographic numbers, such as U+4E5D, are
not).
Thanks in advance,
-tre
-- Tom Emerson Basis Technology Corp. Language Hacker http://www.basistech.com "Beware the lollipop of mediocrity: lick it once and you suck forever"
This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:48 EDT