From: Alexander Kh. (alexkh@writeme.com)
Date: Thu May 19 2005 - 21:42:06 CDT
From: "Philippe Verdy" <verdy_p@wanadoo.fr>
>
> The BOM is NOT a character.
Hm. Is that relevant in terms of statelessness?
> In a UTF-16 encoding *scheme* the leading BOM is fully ignorable.
> But in a UTF-16 encoding form, there's simply NO BOM and the
> codepoint U+FEFF is legal and represents ZWNSP.
>
> You are mixing several levels in the Unicode character model.
In this case let's split UTF-8 into several levels and put control
characters on different level from codepoints. That will make that
encoding stateless :-) The text will probably have to be stored in
separate chunks starting with control sequence each, though.
Best Reagrds.
Alexander Kh
-- ___________________________________________________________ Sign-up for Ads Free at Mail.com http://promo.mail.com/adsfreejump.htm
This archive was generated by hypermail 2.1.5 : Thu May 19 2005 - 21:42:54 CDT