Re: 8859-1, 8859-15, 1252 and Euro

From: A. Vine (avine@eng.sun.com)
Date: Mon Feb 07 2000 - 18:05:24 EST


Tim Greenwood wrote:
>
> Pretty much all of the pages on the web, and the browsers, ignore the
> differences between ISO-8859-1 and Windows code page 1252.

I wish they would! I'm pretty sick of seeing question marks where there should
be quotes, apostrophes, bullets, em-dashes, etc.

>
> So what is a system that stores all data in Unicode and converts for web
> output to do with U+20AC? The formally correct process would seem to be to
> convert to 0x80 only for CP1252 (and the other CP12xx sets) to 0xa4 for
> ISO-8859-15 and to the 'not a character in this set' sign for ISO-8859-1.
> This may be formally correct, but would not help the majority of users. For
> that we would convert to 0x80 for ISO-8859-1 - it works even though 'wrong'.

Sure, if you don't care about Unix users.
Unless Linux does something different?

Andrea

-- 
Andrea Vine, avine@eng.sun.com, iPlanet i18n architect
A word is not a crystal, transparent and unchanging--it is the skin of a
living thought, and may vary greatly in color and content according to the
circumstances and time in which it is used. - Supreme Court Justice Holmes



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:58 EDT