L2/01-265

 

From: Mark Davis [mark@macchiato.com]
Sent: Monday, June 25, 2001 10:45 AM

Subject: Re: UTC Agenda Item: Request to WG2 to allow FFFF, FFFE

 

===============================

 

 

The UTC should formally request that WG2 change its definition of UTF-8 to allow the representation of the code points U+FFFF and U+FFFE. These are disallowed in 10646, but are clearly an anomaly: other non-characters (1FFFE, 1FFFF, etc.) as well as the new noncharacters FDD0-FDEF are allowed.
Moreover, these code points are legal in HTML: see the SGML declaration
(http://www.w3.org/TR/REC-html40/sgml/sgmldecl.html).

 

The 10646 definition should be modified to allow all noncharacters to be represented in UTF-8.

 

===============================