Re: Invalid code points

From: Doug Ewell (doug@ewellic.org)
Date: Sun May 31 2009 - 17:25:10 CDT

Next message: Ruszlán Gaszanov: "Re: Invalid code points"

Previous message: David Perry: "Old Italic in RTL ??"
In reply to: Hans Aberg: "Re: Invalid code points"
Next in thread: William J Poser: "Re: Invalid code points"
Reply: William J Poser: "Re: Invalid code points"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

Hans Aberg <haberg at math dot su dot se> wrote:

> I think also strictly speaking there are two UTF-8s: one which does
> not have the integer limitations that are used in Unicode. This could
> be used to convert integers sequences into byte sequences which then
> do not have Unicode character interpretation.

There is only one UTF-8, the one defined by Unicode and ISO/IEC 10646,
which maps valid Unicode/10646 scalar values to sequences of bytes.
Anything else is not UTF-8. Keep repeating this to yourself.

--
Doug Ewell  *  Thornton, Colorado, USA  *  RFC 4645  *  UTN #14
http://www.ewellic.org
http://www1.ietf.org/html.charters/ltru-charter.html
http://www.alvestrand.no/mailman/listinfo/ietf-languages  ˆ

Next message: Ruszlán Gaszanov: "Re: Invalid code points"
Previous message: David Perry: "Old Italic in RTL ??"
In reply to: Hans Aberg: "Re: Invalid code points"
Next in thread: William J Poser: "Re: Invalid code points"
Reply: William J Poser: "Re: Invalid code points"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

This archive was generated by hypermail 2.1.5 : Sun May 31 2009 - 17:27:49 CDT