Re: UTF8 vs AL32UTF8

From: Peter_Constable@sil.org
Date: Tue Jun 12 2001 - 14:29:55 EDT


On 06/12/2001 01:13:48 PM Jianping Yang wrote:

>If you convert < ED A0 80 ED B0 80 > into UTF-16, what does it mean then?
I
>think definitely it means U-00010000.

I'd say not if that 6-byte sequence is interpreted in terms of *UTF-8*.
UTF-8 has no 6-byte sequences. It must be something else, like the thing
informally designated in our discussions as UTF-8S.

- Peter

---------------------------------------------------------------------------
Peter Constable

Non-Roman Script Initiative, SIL International
7500 W. Camp Wisdom Rd., Dallas, TX 75236, USA
Tel: +1 972 708 7485
E-mail: <peter_constable@sil.org>



This archive was generated by hypermail 2.1.2 : Fri Jul 06 2001 - 00:17:18 EDT