doug at ewellic.org
Thu Jun 26 12:08:45 CDT 2014
Richard Wordingham <richard dot wordingham at ntlworld dot com> wrote:
> At present there is no certainty as to whether
> an interchanged file in the UTF-16 encoding scheme that appears to
> contain a BOM contains a BOM or starts with U+FFFE. The only
> promise is that such a file contains an even number of data bytes.
> Any such sequence is valid! Will the UTF-16 encoding scheme be
One might wonder, given how frequently we hear that unpaired surrogates
also occur in the wild and need to be tolerated.
Doug Ewell | Thornton, CO, USA
http://ewellic.org | @DougEwell
More information about the Unicode