Thai conversion

From: Tom Emerson (tree@basistech.com)
Date: Fri Sep 15 2000 - 15:31:03 EDT


Timothy Greenwood writes:
> There seems to be no mapping table from TIS620-2533 to/from Unicode. It
> looks as though it is just a simple direct mapping (TIS code - 0xA0 + 0x0E00
> = Unicode for range A0-FF), but it would be nice to have an officially
> blessed chart.

It's there, it's just not obviously there.

The closest thing is the Microsoft CP874 mapping, which is almost
identical to TIS620-2533. The difference lies in extra codepoints in
C1 added by Microsoft in CP874, including 0x80 (Euro), 0x85
(horizontal ellipsis), and 0xA0 (non-breaking space).

http://www.unicode.org/Public/MAPPINGS/VENDORS/MICSFT/WINDOWS/CP874.TXT

This table is correct, and will work for pure TIS620-2533 and CP874.

Note that this is a mapping for the ASCII-version of TIS620-2533, not
the EBCDIC version. IBM and Apple also have their own variants.

    -tree

-- 
Tom Emerson                                          Basis Technology Corp.
Zenkaku Language Hacker                            http://www.basistech.com
  "Beware the lollipop of mediocrity: lick it once and you suck forever"



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:21:13 EDT