From: Hans Aberg (haberg@math.su.se)
Date: Sun May 31 2009 - 14:45:04 CDT
On 31 May 2009, at 19:42, Doug Ewell wrote:
>> In particular, it would be great to know if the range U+0080, …, U
>> +009F is invalid.
>
> That bit is especially wrong. I can at least imagine why there
> might be confusion about the noncharacters and surrogate code
> points, but not the C1 controls.
It is a bit disappointing: I was looking for a beginning (escape) byte
sequence to tell that string isn't UTF-8, among other valid strings.
But perhaps it does not matter.
Hans
This archive was generated by hypermail 2.1.5 : Sun May 31 2009 - 14:47:58 CDT