Re: ICU's uconv vs Linux iconv and UTF-8

From: Mark Davis (mark@macchiato.com)
Date: Fri Feb 01 2002 - 11:46:11 EST


>ICU's pedantic form

The goal for ICU is to be charset neutral, and support all of the
conversions that are in modern use. There are a large number of
variants of character sets; you can use the one you want. See:

http://oss.software.ibm.com/icu/charset/index.html

Mark

----- Original Message -----
From: "Dan Kogai" <dankogai@dan.co.jp>
To: "Nick Ing-Simmons" <nick.ing-simmons@elixent.com>
Cc: "Nick Ing-Simmons" <nick@ing-simmons.net>; "SADAHIRO Tomoyuki"
<bqw10602@nifty.com>; <perl-unicode@perl.org>; <unicode@unicode.org>
Sent: Friday, February 01, 2002 07:46
Subject: Re: ICU's uconv vs Linux iconv and UTF-8

> On 2002.02.02, at 00:37, Nick Ing-Simmons wrote:
> >> Oh, yes. This is the problem of the original Unicode 2.x map;
It is
> >> not ASCII preservative. I have posted this problem to perl-
> >> unicode@perl.org when I first released Jcode. Several
discussions
> >> later, I made Jcode so that it preserves ASCII by default and
added
> >> $Jcode::Unicode::PEDANTIC to change the behavior
> >
> > Ah. I take your point. If we used ICU's pedantic form
> > Both UNIX ~/foo and MS C:\Foo get mangled.
>
> EXACTLY!
>
> > The other differences (having looked at diff in yudit) seems to be
> > mapping 「 (cent),」 (pound) ,ャ (not) and one of the longer
dashes to
> > different width variants (full width for ICU).
> >
> > I am going off ICU ...
>
> As I addressed to unicode@unicode.org, Yet another problems that
> ftp://ftp.unicode.org/Public/MAPPINGS/EASTASIA/ is now gone so I
don't
> have a practical way to check the mapping. I want the mapping back!
>
> Dan
>
>
>



This archive was generated by hypermail 2.1.2 : Fri Feb 01 2002 - 11:15:44 EST