RE: UTF-8S (was: Re: ISO vs Unicode UTF-8)

From: Marco Cimarosti (marco.cimarosti@essetre.it)
Date: Tue Jun 05 2001 - 08:03:03 EDT


Mark Davis wrote:
> - I am well aware that one can accept 6-byte supplementary
> characters on
> input in UTF-8. (Did you really think I wasn't?)

(O, no, I know you knew!)

But how should this 6-byte sequence be interpreted by a standard UTF-8
decoder? Does it become one or two code points?

_ Marco



This archive was generated by hypermail 2.1.2 : Fri Jul 06 2001 - 00:17:18 EDT