From: Elliotte Rusty Harold (elharo@metalab.unc.edu)
Date: Sun Apr 20 2003 - 15:56:20 EDT
For XOM, I need to build up tables of which Unicode characters are
representable in different character sets. The ISO-8859 sets are
easy, relatively speaking. Now I'm moving into the tougher stuff like
Big5 and the other pre-Unicode Asian character sets.
Is there anywhere I can find or piece together a *complete* list of
Unicode characters that are available in Big5 (and other similar
sets)? I've looked at unihan.txt, and it has part of what I need but
not all of it. It specifies which Unicode Han characters are
available in which other character sets. However, most of these
character sets include various ASCII characters, Greek letters,
symbols, digits, and so forth. These do not appear to be listed in
unihan.txt.
-- +-----------------------+------------------------+-------------------+ | Elliotte Rusty Harold | elharo@metalab.unc.edu | Writer/Programmer | +-----------------------+------------------------+-------------------+ | Processing XML with Java (Addison-Wesley, 2002) | | http://www.cafeconleche.org/books/xmljava | | http://www.amazon.com/exec/obidos/ISBN%3D0201771861/cafeaulaitA | +----------------------------------+---------------------------------+ | Read Cafe au Lait for Java news: http://www.cafeaulait.org/ | | Read Cafe con Leche for XML news: http://www.cafeconleche.org/ | +----------------------------------+---------------------------------+
This archive was generated by hypermail 2.1.5 : Sun Apr 20 2003 - 16:30:09 EDT