Re: Case mapping of dotless lowercase letters

From: Doug Ewell (dewell@adelphia.net)
Date: Mon Dec 15 2003 - 11:32:26 EST

  • Next message: Mark E. Shoulson: "Re: [OT reversing letters to avoid offence] Re: [Fwd: Re: Swastika to be banned by Microsoft?]"

    Philippe Verdy <verdy underscore p at wanadoo dot fr> wrote:

    > I would have expected to find these mappings:
    >
    > 0130; F; 0069; # LATIN SMALL LETTER DOTLESS I
    > -> LATIN SMALL LETTER I
    > 0130; T; 0130; # LATIN SMALL LETTER DOTLESS I
    > -> LATIN SMALL LETTER DOTLESS I
    >
    > The rationale being that the locale-neutral mappings would not
    > differentiate the "standard" small letter (soft-dotted) i, and the
    > "Turkic" small letter dotless i, for the same reason that they do not
    > differentiate their uppercase versions; and that the "Turkic" mappings
    > should maintain this difference in both lowercase and uppercase pairs
    > of letters.

    Turkish and Azeri (and others) can only be cased correctly with
    locale-specific mappings. The locale-neutral mappings cannot be
    expected to consider U+0069 'i' and U+0130 'ı' equivalent, with all the
    ambiguities that would bring. As you point out, 'i' and 'ı' are quite
    different letters.

    -Doug Ewell
     Fullerton, California
     http://users.adelphia.net/~dewell/



    This archive was generated by hypermail 2.1.5 : Mon Dec 15 2003 - 12:14:58 EST