Re: MS/Unix BOM FAQ again (small fix)

From: Doug Ewell (dewell@adelphia.net)
Date: Thu Apr 11 2002 - 01:35:25 EDT


Mark Davis <mark@macchiato.com> wrote:

> - when one of the BOM-allowing UTFs starts with a BOM, you know the
> encoding*, and you strip off the BOM when you get the content.
>
> *assuming that no UTF-16 file has U+0000 as the first character.

In the real world, this is a pretty good assumption -- almost as good,
in fact, as the one I've been stating for years: that no Unicode file
will have a zero-width no-break space (intended as such) as the first
character.

-Doug Ewell
 Fullerton, California



This archive was generated by hypermail 2.1.2 : Thu Apr 11 2002 - 00:24:06 EDT