Re: UTF8 vs. Unicode (UTF16) in code

From: Kenneth Whistler (kenw@sybase.com)
Date: Fri Mar 16 2001 - 21:29:45 EST


In response to my posting on this thread, William Overington asked:

> Yet the posting appended would seem to imply that at least one standard has
> reserved some (all?) of these codes either as "never to be used" codes or as
> "might someday be used" codes.
>
> So, my question is this. Which of the integer values from 0x110000 to
> 0x3FFFFFFF are reserved and by which standards please?

The International Standard ISO/IEC 10646-1:2000 reserves all the *code
positions* U-00110000 to U-7FFFFFFF. (Technically, this is not quite
correct yet, because ISO/IEC 10646-1:2000 allows some of those values to
be user-defined, but the amendment which has wording to reserve all
of them is in process right now.) Note that the 10646-1:2000 codespace
is 31-bit, not 30-bit, as implied by your question.

Nobody and no standard can reserve *integer values* in the abstract.
Integer values just are. They are used for all kinds of things,
over and over again.

--Ken



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:21:20 EDT