Re: Kanji and other non-Western numerals

From: Mark Davis (mark.davis@icu-project.org)
Date: Fri Jan 20 2006 - 10:05:10 CST

  • Next message: Tom Emerson: "RE: Kanji and other non-Western numerals"

    A couple of things. Look at the extracted files for the numeric info,
    since they also contain the CJK.

    http://www.unicode.org/Public/UNIDATA/extracted/DerivedNumericType.txt
    http://www.unicode.org/Public/UNIDATA/extracted/DerivedNumericValues.txt

    Secondly, where letters are given numeric value as part of a traditional
    numbering system (such as Greek or Hebrew systems), those values are not
    marked. If you are interested in non-decimal systems, I'd suggest
    consulting Georges Ifrah's book for background information.

    Mark

    Tom Emerson wrote:

    >Kit Peters writes:
    >
    >
    >>As I mentioned in an earlier post, I am investigating the parsing of
    >>non-Western numerals. An example of non-Western numerals would certainly be
    >>kanji, but in looking through the 4.0 UnicodeData.txt, I see no entries for
    >>the kanji (Juu, Roku, Hachi, Hyaku). Why is this?
    >>
    >>
    >
    >The Unified Ideographs are documented in Unihan.txt, not UnicodeData.txt.
    >
    >Ideographs with numeric uses will have kPrimaryNumeric,
    >kAccountingNumeric, or kOtherNumeric values.
    >
    > -tree
    >
    >
    >



    This archive was generated by hypermail 2.1.5 : Fri Jan 20 2006 - 10:09:02 CST