Re: 3rd-party cross-platform UTF-8 support

From: Tom Emerson (tree@basistech.com)
Date: Mon Sep 24 2001 - 13:35:53 EDT


Andy Heninger writes:
> Performance tuning is easier with UTF-16. You can optimize for
> BMP characters, knowing that surrogate pairs are sufficiently uncommon
> that it's OK for them take a bail-out slow path.

Sure, but if you are using UTF-16 (or any other multibyte encoding)
you loose the ability to index characters in an array in constant
time. For some applications that isn't desirable.

    -tree

-- 
Tom Emerson                                          Basis Technology Corp.
Sr. Sinostringologist                              http://www.basistech.com
  "Beware the lollipop of mediocrity: lick it once and you suck forever"



This archive was generated by hypermail 2.1.2 : Mon Sep 24 2001 - 12:24:32 EDT