Timothy Greenwood writes:
> There seems to be no mapping table from TIS620-2533 to/from Unicode. It
> looks as though it is just a simple direct mapping (TIS code - 0xA0 + 0x0E00
> = Unicode for range A0-FF), but it would be nice to have an officially
> blessed chart.
It's there, it's just not obviously there.
The closest thing is the Microsoft CP874 mapping, which is almost
identical to TIS620-2533. The difference lies in extra codepoints in
C1 added by Microsoft in CP874, including 0x80 (Euro), 0x85
(horizontal ellipsis), and 0xA0 (non-breaking space).
http://www.unicode.org/Public/MAPPINGS/VENDORS/MICSFT/WINDOWS/CP874.TXT
This table is correct, and will work for pure TIS620-2533 and CP874.
Note that this is a mapping for the ASCII-version of TIS620-2533, not
the EBCDIC version. IBM and Apple also have their own variants.
-tree
-- Tom Emerson Basis Technology Corp. Zenkaku Language Hacker http://www.basistech.com "Beware the lollipop of mediocrity: lick it once and you suck forever"
This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:21:13 EDT