On 06/12/2001 01:13:48 PM Jianping Yang wrote:
>If you convert < ED A0 80 ED B0 80 > into UTF-16, what does it mean then?
I
>think definitely it means U-00010000.
I'd say not if that 6-byte sequence is interpreted in terms of *UTF-8*.
UTF-8 has no 6-byte sequences. It must be something else, like the thing
informally designated in our discussions as UTF-8S.
- Peter
---------------------------------------------------------------------------
Peter Constable
Non-Roman Script Initiative, SIL International
7500 W. Camp Wisdom Rd., Dallas, TX 75236, USA
Tel: +1 972 708 7485
E-mail: <peter_constable@sil.org>
This archive was generated by hypermail 2.1.2 : Fri Jul 06 2001 - 00:17:18 EDT