From: Roozbeh Pournader (roozbeh@sharif.edu)
Date: Mon Feb 17 2003 - 02:18:40 EST
On Sun, 16 Feb 2003, Doug Ewell wrote:
> The Unicode home page includes the following line, right where it should
> be, in the <head> section:
>
> <meta http-equiv="Content-Type" content="text/html; charset=utf-8">
>
> Any User Agent that takes a page properly marked as UTF-8, as above, and
> still tries to autodetect a local code page, is badly misguided. How
> would it handle a real UTF-8-encoded euro sign (0xE2 0x82 0xAC)?
AFAICR, there is supposed to be no single non-ASCII character before that
<meta> tag. I really don't like to search the specs again, but I'm sure I
saw it somewhere. The HTML renderer sees those characters and thinks the
document has already started (since the <html>, <head> and <body> tags are
are not mandatory in HTML 4 Transitional). So it goes into autodetection
mode. The same situation happens with MS FrontPage 2000 (but I've already
explained that).
roozbeh
This archive was generated by hypermail 2.1.5 : Mon Feb 17 2003 - 02:54:28 EST