Re: 3rd-party cross-platform UTF-8 support

From: Andy Heninger (andyh@jtcsv.com)
Date: Mon Sep 24 2001 - 02:05:43 EDT


From: "Marcin 'Qrczak' Kowalczyk" <qrczak@knm.org.pl>

> Why would UTF-16 be easier for internal processing than UTF-8?
> Both are variable-length encodings.
>

Performance tuning is easier with UTF-16. You can optimize for
BMP characters, knowing that surrogate pairs are sufficiently uncommon
that it's OK for them take a bail-out slow path.

Andy Heninger
IBM, Cupertino, CA
heninger@us.ibm.com



This archive was generated by hypermail 2.1.2 : Mon Sep 24 2001 - 01:24:59 EDT