RE: UTF-8S (was: Re: ISO vs Unicode UTF-8)

From: Misha.Wolf@reuters.com
Date: Tue Jun 05 2001 - 08:51:54 EDT


On 05/06/2001 13:03:03 Marco Cimarosti wrote:
[...]
> But how should this 6-byte sequence be interpreted by a standard UTF-8
> decoder? Does it become one or two code points?

That depends on where the decoder is. If it's inside an XML
parser, then it becomes neither of the above, but rather a
fatal error.

Misha

> _ Marco
>
>
>

-----------------------------------------------------------------
        Visit our Internet site at http://www.reuters.com

Any views expressed in this message are those of the individual
sender, except where the sender specifically states them to be
the views of Reuters Ltd.



This archive was generated by hypermail 2.1.2 : Fri Jul 06 2001 - 00:17:18 EDT