From: Richard Wordingham (richard.wordingham@ntlworld.com)
Date: Tue May 06 2008 - 15:36:02 CDT
Andreas Prilop wrote on Tuesday, May 06, 2008 5:24 PM
> The problem was/is:
> What to do when a byte 0x90 is found in a file that has
>
> (a) erroneously charset=ISO-8859-1
>
> (b) erroneously charset=Windows-1252
>
> (c) no encoding/charset at all specified
>
> Surprisingly, the W3C validator gives up with Windows-1252
> but does perform a check with ISO-8859-1.
It's not surprising at all. These charsets designations have the *IANA*
definitions, which are not necessarily identical to international (e.g.
ISO-8859 series) or national (e.g. TIS-620) standards. Thus 0x90 is
undefined for Windows-1252 but merely an illegal character for HTML in the
IANA definition of ISO-88591.
Richard.
This archive was generated by hypermail 2.1.5 : Tue May 06 2008 - 15:39:14 CDT