Re: Endless endianness annoyance

From: Mark Leisher (mleisher@crl.nmsu.edu)
Date: Wed Dec 03 1997 - 13:11:08 EST


    Gianni> Use UTF-8 !

Some problems with this:

1. What if the UTF-8 is not normalized to a form expected by Unicode
   support on my platform?

   I would have to convert UTF-8 to UCS2 anyway to make sure all
   characters are fully decomposed or fully composed, depending on what
   my Unicode support expects! Otherwise, the search pattern will
   probably not match cases that it should match.

2. What if the data only comes in UCS-2 form?
------------------------------------------------------------------------
mleisher@crl.nmsu.edu
Mark Leisher "A designer knows he has achieved perfection
Computing Research Lab not when there is nothing left to add, but
New Mexico State University when there is nothing left to take away."
Box 30001, Dept. 3CRL -- Antoine de Saint-Exupéry
Las Cruces, NM 88003



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:38 EDT