Corrigendum #9

Doug Ewell doug at
Thu Jun 26 12:08:45 CDT 2014

Richard Wordingham <richard dot wordingham at ntlworld dot com> wrote:

> At present there is no certainty as to whether
> an interchanged file in the UTF-16 encoding scheme that appears to
> contain a BOM contains a BOM or starts with U+FFFE. The only
> promise is that such a file contains an even number of data bytes.
> Any such sequence is valid! Will the UTF-16 encoding scheme be
> withdrawn?

One might wonder, given how frequently we hear that unpaired surrogates
also occur in the wild and need to be tolerated.

Doug Ewell | Thornton, CO, USA | @DougEwell

More information about the Unicode mailing list