RE: Thai conversion

From: Chris Wendt (christw@microsoft.com)
Date: Fri Sep 15 2000 - 16:12:04 EDT


fyi A graphic representation of code page 874 (very small superset of
TIS-620) is on http://www.microsoft.com/globaldev/reference/sbcs/874.htm

-----Original Message-----
From: Tom Emerson [mailto:tree@basistech.com]
Sent: Friday, September 15, 2000 12:38 PM
To: Unicode List
Cc: Unicode List
Subject: Thai conversion

Timothy Greenwood writes:
> There seems to be no mapping table from TIS620-2533 to/from Unicode. It
> looks as though it is just a simple direct mapping (TIS code - 0xA0 +
0x0E00
> = Unicode for range A0-FF), but it would be nice to have an officially
> blessed chart.

It's there, it's just not obviously there.

The closest thing is the Microsoft CP874 mapping, which is almost
identical to TIS620-2533. The difference lies in extra codepoints in
C1 added by Microsoft in CP874, including 0x80 (Euro), 0x85
(horizontal ellipsis), and 0xA0 (non-breaking space).

http://www.unicode.org/Public/MAPPINGS/VENDORS/MICSFT/WINDOWS/CP874.TXT

This table is correct, and will work for pure TIS620-2533 and CP874.

Note that this is a mapping for the ASCII-version of TIS620-2533, not
the EBCDIC version. IBM and Apple also have their own variants.

    -tree

-- 
Tom Emerson                                          Basis Technology Corp.
Zenkaku Language Hacker                            http://www.basistech.com
  "Beware the lollipop of mediocrity: lick it once and you suck forever"



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:21:13 EDT