From: Doug Ewell (dewell@adelphia.net)
Date: Mon Dec 15 2003 - 11:32:26 EST
Philippe Verdy <verdy underscore p at wanadoo dot fr> wrote:
> I would have expected to find these mappings:
>
> 0130; F; 0069; # LATIN SMALL LETTER DOTLESS I
> -> LATIN SMALL LETTER I
> 0130; T; 0130; # LATIN SMALL LETTER DOTLESS I
> -> LATIN SMALL LETTER DOTLESS I
>
> The rationale being that the locale-neutral mappings would not
> differentiate the "standard" small letter (soft-dotted) i, and the
> "Turkic" small letter dotless i, for the same reason that they do not
> differentiate their uppercase versions; and that the "Turkic" mappings
> should maintain this difference in both lowercase and uppercase pairs
> of letters.
Turkish and Azeri (and others) can only be cased correctly with
locale-specific mappings. The locale-neutral mappings cannot be
expected to consider U+0069 'i' and U+0130 'ı' equivalent, with all the
ambiguities that would bring. As you point out, 'i' and 'ı' are quite
different letters.
-Doug Ewell
Fullerton, California
http://users.adelphia.net/~dewell/
This archive was generated by hypermail 2.1.5 : Mon Dec 15 2003 - 12:14:58 EST