Re: Feedback on the proposal to change U+FFFD generation when decoding ill-formed UTF-8

From: Richard Wordingham via Unicode <unicode_at_unicode.org>
Date: Wed, 31 May 2017 06:47:46 +0100

On Tue, 30 May 2017 16:38:45 -0600
Karl Williamson via Unicode <unicode_at_unicode.org> wrote:

> Under Best Practices, how many REPLACEMENT CHARACTERs should the
> sequence <ED B0 82> generate? 0, 1, 2, 3, 4 ?
>
> In practice, how many do parsers generate?

See Markus Kuhn's test page
http://www.cl.cam.ac.uk/~mgk25/ucs/examples/UTF-8-test.txt, test
5.1.5. Firefox generates three replacement characters.

Richard.
Received on Wed May 31 2017 - 00:48:08 CDT

This archive was generated by hypermail 2.2.0 : Wed May 31 2017 - 00:48:08 CDT