Re: UTF-8 stress test file?

From: Simon Montagu (smontagu@smontagu.org)
Date: Sun Oct 10 2004 - 14:36:27 CST

  • Next message: Theodore H. Smith: "Re: UTF-8 stress test file?"

    Theodore H. Smith wrote:
    > I'd like to see a UTF-8 stress test file.
    >
    > It should consist of lines of UTF-8, separated each by a newline. Each
    > line should be malformed. Also, some idea of how to deal with the
    > malformed UTF-8 should be noted in a separate file.
    >
    > Really, I just want some way to verify that I can detect every kind of
    > UTF-8 wrongness. I have some code I adapted from Unicode.org, but I want
    > to make sure my adaptions haven't broken the code.

    http://www.cl.cam.ac.uk/~mgk25/ucs/examples/UTF-8-test.txt



    This archive was generated by hypermail 2.1.5 : Sun Oct 10 2004 - 14:37:53 CST