RE: Default endianness of Unicode, or not

From: Yves Arrouye (yves@realnames.com)
Date: Wed Apr 10 2002 - 19:58:19 EDT


> "D43 <ital>UTF-16 character encoding scheme:</ital> the Unicode
> CES that serializes a UTF-16 code unit sequence as a byte sequence
> in either big-endian or little-endian format.
>
> * In UTF-16 (the CES), the UTF-16 code unit sequence
> <004D 0430 4E8C D800 DF02> is serialized as
> <FE FF 00 4D 04 30 4E 8C D8 00 DF 02> or
> <FF FE 4D 00 30 04 8C 4E 00 D8 02 DF> or
> <00 4D 04 30 4E 8C D8 00 DF 02>."
>
> etc., etc.

So same semantics as before. In the absence of any indication of what byte
order is used, assume big endian.

YA
 



This archive was generated by hypermail 2.1.2 : Wed Apr 10 2002 - 18:29:37 EDT