Re: writing Chinese dialects

From: Andrew West (andrewcwest@gmail.com)
Date: Fri Jan 26 2007 - 05:16:04 CST

  • Next message: Andrew West: "Re: writing Chinese dialects"

    On 26/01/07, "Arne Götje (高盛華)" <arne@linux.org.tw> wrote:
    >
    > Further more I'd like to notice, that Hong Kong has released the HKSCS
    > standard, which contains more than 4000 additional characters compared
    > to Big5. Some of these characters are in Extension A and B of Unicode.

    All 4,941 characters in HKSCS-2004 are now representable in Unicode
    without recourse to the PUA. Here is a summary of their mapping to
    Unicode:

    Latin-1 Supplement [0080..00FF] : 21
    Latin Extended-A [0100..017F] : 12
    Latin Extended-B [0180..024F] : 10
    IPA Extensions [0250..02AF] : 9
    Spacing Modifier Letters [02B0..02FF] : 1
    Cyrillic [0400..04FF] : 66
    Latin Extended Additional [1E00.1EFF] : 4
    Letterlike Symbols [2100..214F] : 2
    Number Forms [2150.218F] : 10
    Arrows [2190..21FF] : 3
    Miscellaneous Technical [2300..23FF] : 2
    Enclosed Alphanumerics [2460..24FF] : 20
    Box Drawing [2500..257F] : 33
    Dingbats [2700..27BF] : 1
    CJK Radicals Supplement [2E80..2FFF] : 28
    Kangxi Radicals [2F00..2FDF] : 1
    CJK Symbols and Punctuation [3000..303F] : 3
    Hiragana [3040..309F] : 87
    Katakana [30A0..30FF] : 89
    CJK Strokes [31C0..31EF] : 16
    Enclosed CJK Letters and Months [3200..32FF] : 1
    CJK Unified Ideographs Extension A [3400..4DBF] : 562
    CJK Unified Ideographs [4E00..9FFF] : 2,255
    CJK Compatibility Ideographs [F900..FFFF] : 8
    CJK Unified Ideographs Extension B [20000..2A6DF] : 1,682
    CJK Compatibility Ideographs Supplement [2F800..2FA1D] : 11
    Combining Sequences [Latin Extended-A + Combining Diacritical Marks] : 4

    See <http://www.info.gov.hk/digital21/eng/hkscs/document.html> dor details.

    Andrew



    This archive was generated by hypermail 2.1.5 : Fri Jan 26 2007 - 05:18:49 CST