RE: Invalid code points

From: Phillips, Addison (addison@amazon.com)
Date: Sun May 31 2009 - 12:26:44 CDT

  • Next message: Doug Ewell: "Re: Invalid code points"

    The code points in the range U+0080 through U+009F are assigned to the C1 control characters, just like they are in ISO 8859-1. Unlike the other code points the article cites, these are real character assignments. Each character has a name, properties, and so forth. So those code points are definitely not "invalid".

    Addison

    Addison Phillips
    Globalization Architect -- Lab126

    Internationalization is not a feature.
    It is an architecture.

    > -----Original Message-----
    > From: unicode-bounce@unicode.org [mailto:unicode-bounce@unicode.org]
    > On Behalf Of Hans Aberg
    > Sent: Sunday, May 31, 2009 8:55 AM
    > To: Unicode Mailing List
    > Subject: Invalid code points
    >
    > This quote say that it depends on how you read the standard which
    > code
    > points are invalid; perhaps someone here can clarify :-):
    > http://en.wikipedia.org/wiki/UTF-8#Invalid_code_points
    >
    > In particular, it would be great to know if the range U+0080, …, U
    > +009F is invalid.
    >
    > Hans Aberg
    >
    >
    >



    This archive was generated by hypermail 2.1.5 : Sun May 31 2009 - 12:28:34 CDT