From: Arthit Suriyawongkul (Arthit.Suriyawongkul@sun.com)
Date: Mon Oct 07 2002 - 02:24:05 EDT
Dear Elliotte,
ISO-8859-11 has been finished for almost year :)
for refs:
ISO-8859-11 8-bit single-byte coded graphic character sets -- Part 11:
Latin/Thai alphabet
http://www.iso.ch/iso/en/CatalogueDetailPage.CatalogueDetail?CSNUMBER=28263&ICS1=35&ICS2=40&ICS3
or see its draft at
http://www.nectec.or.th/it-standards/iso8859-11/
ISO-8859-11 = TIS-620 + { NBSP }
NBSP = Non-Breaking Space (0xA0 / U+00A0)
---- from http://crl.nmsu.edu/~mleisher/csets.html those "REPLACEMENT CHARACTER". imho, they should be replace with "UNDEFINED", since in other unicode mapping of CP874 (another Thai encoding, which is almost the same as Windows-874, the TIS-620 varaints) http://www.unicode.org/Public/MAPPINGS/VENDORS/MICSFT/WINDOWS/CP874.TXT they also use "UNDEFINED". you may interesting in some comments from http://bugzilla.mozilla.org/show_bug.cgi?id=127755 :) regards, Art Elliotte Rusty Harold wrote: > The Unicode data files at > http://www.unicode.org/Public/MAPPINGS/ISO8859/ do not include a mapping > for ISO-8859-11, Thai. Is there any particular reason for this? Is > ISO-8859-11 unfinished or deprecated or unable to be mapped to Unicode > or some such? If none of these things are true, is there a mapping chart > for this set somewhere? > > I'm working on adding output support for the ISO sets to XOM > (http://www.cafeconleche.org/XOM/) and need the mapping tables to figure > out which characters need to be esacped in which character sets. > > -- > Elliotte Harold >
This archive was generated by hypermail 2.1.5 : Mon Oct 07 2002 - 03:15:25 EDT