RE: DEC multilingual code page, ISO 8859-1, etc.

From: Chris Pratley (chrispr@MICROSOFT.com)
Date: Wed Mar 29 2000 - 18:04:28 EST


See>>>

-----Original Message-----
From: Jungshik Shin [mailto:jshin@pantheon.yale.edu]
Sent: March 28, 2000 2:34 PM
To: Unicode List
Subject: RE: DEC multilingual code page, ISO 8859-1, etc.

On Tue, 28 Mar 2000, Chris Pratley wrote:

> As a counter-example to my own argument, with IE5 and Office2000 we did
> decide to take a leap and force Unicode on users. Not many people know
that
> IE5 and Office2000 send URLs in UTF-8 by default. The server is expected
to
> assume UTF-8 if it could be UTF-8, otherwise try to use its local encoding
> (IIS4 and 5 do this). We got significant complaints in Korea and Taiwan
> where there were apparently a significant number of ISPs supporting local
> characters in URLs by assuming the local encoding (KSC-5601or Big5) so we
> had to turn UTF-8 off by default there, but in most other areas it went
over

  IMHO, problems with Korean and Taiwanese ISPs are not MS IE's falut
but that of those who assumed the local encoding in the URL(filename/path
part) They should fix their web pages and MS IE should not retreat on
that front.
>>>Of course we'd like to do that, but in the real world this would be seen
a huge bug. Since we make software for a living and not just as a thought
experiment, we need to go with what the market wants, while carefully
guiding them toward the right thing. What I'd like to see happen here is
that the web servers from various makers are upgraded to assume UTF-8 if it
could be UTF-8 (as IIS4/5 do). That way the question will be moot after
awhile as these Korean ISPs upgrade to newer web servers. Of course that
requires some proactive action on the part of web server vendors.

  BTW, the name of the local encoding most widely used in Korea is NOT KS C
5601(which is just the old name of the coded character set KS X 1001
which comprises EUC-KR and other Korean encodings) BUT EUC-KR(one of
several possible encodings/character set encoding schemes for TWO coded
character sets US-ASCII/ISO 646 AND KS X 1001). Please, stop pass along
the wrong information just as Kano's infamous book does.

     Jungshik Shin



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:21:00 EDT