From: Mark Davis (mark.davis@icu-project.org)
Date: Fri Jan 20 2006 - 10:05:10 CST
A couple of things. Look at the extracted files for the numeric info,
since they also contain the CJK.
http://www.unicode.org/Public/UNIDATA/extracted/DerivedNumericType.txt
http://www.unicode.org/Public/UNIDATA/extracted/DerivedNumericValues.txt
Secondly, where letters are given numeric value as part of a traditional
numbering system (such as Greek or Hebrew systems), those values are not
marked. If you are interested in non-decimal systems, I'd suggest
consulting Georges Ifrah's book for background information.
Mark
Tom Emerson wrote:
>Kit Peters writes:
>
>
>>As I mentioned in an earlier post, I am investigating the parsing of
>>non-Western numerals. An example of non-Western numerals would certainly be
>>kanji, but in looking through the 4.0 UnicodeData.txt, I see no entries for
>>the kanji (Juu, Roku, Hachi, Hyaku). Why is this?
>>
>>
>
>The Unified Ideographs are documented in Unihan.txt, not UnicodeData.txt.
>
>Ideographs with numeric uses will have kPrimaryNumeric,
>kAccountingNumeric, or kOtherNumeric values.
>
> -tree
>
>
>
This archive was generated by hypermail 2.1.5 : Fri Jan 20 2006 - 10:09:02 CST