Re: 3rd-party cross-platform UTF-8 support

From: Marcin 'Qrczak' Kowalczyk (qrczak@knm.org.pl)
Date: Sat Sep 22 2001 - 06:06:56 EDT


Thu, 20 Sep 2001 12:46:49 -0700 (PDT), Kenneth Whistler <kenw@sybase.com> pisze:

> If you are expecting better performance from a library that takes UTF-8
> API's and then does all its internal processing in UTF-8 *without*
> converting to UTF-16, then I think you are mistaken. UTF-8 is a bad
> form for much of the kind of internal processing that ICU has to do
> for all kinds of things -- particularly for collation weighting, for
> example. Any library worth its salt would *first* convert to UTF-16
> (or UTF-32) internally, anyway, before doing any significant semantic
> manipulation of the characters.

Why would UTF-16 be easier for internal processing than UTF-8?
Both are variable-length encodings.

-- 
 __("<  Marcin Kowalczyk * qrczak@knm.org.pl http://qrczak.ids.net.pl/
 \__/
  ^^                      SYGNATURA ZASTĘPCZA
QRCZAK



This archive was generated by hypermail 2.1.2 : Sat Sep 22 2001 - 05:34:36 EDT