Re: UTF-8S (was: Re: ISO vs Unicode UTF-8)

From: DougEwell2@cs.com
Date: Tue Jun 05 2001 - 22:04:52 EDT


In a message dated 2001-06-05 10:25:17 Pacific Daylight Time,
cbrown@xnetinc.com writes:

> Normally a strcmp function just loops through the string comparing them
> character by character. If the loop checks for surrogates and compares
> UTF-32 code points you will always get the same result for all encoding,
the
> standard Unicode code point order.
>
> Ultimately this is the "do it right the first time" way of implementing
> Unicode.

This is what I've been trying to say all along. Define your own meaning of
"sorted" and go with that; don't ask the standardizers to put their blessing
on a weird sort order.

-Doug Ewell
 Fullerton, California



This archive was generated by hypermail 2.1.2 : Fri Jul 06 2001 - 00:17:18 EDT