Re: Is there Unicode mail out there?

From: Shigemichi Yazawa (yazawa@globalsight.com)
Date: Wed Jul 18 2001 - 14:49:07 EDT


> ----- Original Message -----
> From: "Bill Kurmey" <Bill.Kurmey@v-wave.com>
> To: "Mark Davis" <mark@macchiato.com>
> Sent: Wednesday, July 18, 2001 03:08
> Subject: Is there Unicode mail out there?
>
> > Am I missing something somewhere in the specifications on the W3C site?
> > Where is there a reference forbidding an XML processor from handling ANY
> > character that is defined in Unicode and ISO/IEC 10646?

Look at http://www.w3.org/TR/2000/REC-xml-20001006#NT-Char

[2] Char ::= #x9 | #xA | #xD | [#x20-#xD7FF] | /* any Unicode character,
             [#xE000-#xFFFD] | [#x10000-#x10FFFF] excluding the surrogate blocks,
                                                  FFFE, and FFFF. */

I think XML spec is self contradictory.

> > My concern stems from working with an email archive format which uses soh,
> > stx and etx as an envelope.

Good point. U+000c is also used frequently in email's and news
article's body. It may not make sense to allow control characters in
HTML, but it does make sense in XML when it is used as a container of
data including legacy data like email archives.

-------------------
Shigemichi Yazawa
yazawa@globalsight.com



This archive was generated by hypermail 2.1.2 : Wed Jul 18 2001 - 15:35:57 EDT