Re: ISO 8859-11 (Thai) cross-mapping table

From: Doug Ewell (dewell@adelphia.net)
Date: Sat Oct 05 2002 - 17:52:06 EDT

  • Next message: Doug Ewell: "Re: Sporadic Unicode revisited"

    Elliotte Rusty Harold <elharo at metalab dot unc dot edu> wrote:

    >> You can get Unicode-format mapping tables for TIS 620 and many other
    >> encodings at http://crl.nmsu.edu/~mleisher/csets.html
    >
    > Thanks. Looking at that, it appears the mapping is imperfect. There
    > are about 10 characters in TIS-620 that are mapped to the Unicode
    > replacement character. This is from 1998 though. Has Unicode's Thai
    > support improved any in later versions?

    These 9 code positions (0xA0, 0xDB..0xDE, 0xFC..0xFF) appear to be
    undefined in TIS 620.2533. Reference [3] below does show a "word
    separator character" at 0xDC, which I interpret as U+200B ZERO WIDTH
    SPACE, but the other positions are still undefined. So this may not be
    a case of Unicode having to play catch-up with the Thai standard.

    -Doug Ewell
     Fullerton, California

    [1] http://www.langbox.com/codeset/tis620.html
    [2] http://www.nectec.or.th/it-standards/std620/std620.htm
    [3]
    http://www.tru64unix.compaq.com/docs/base_doc/DOCUMENTATION/V40E_HTML/SU
    PPDOCS/THAIDOC/THAICH2.HTM



    This archive was generated by hypermail 2.1.5 : Sat Oct 05 2002 - 18:34:04 EDT