From: Kent Karlsson (kentk@cs.chalmers.se)
Date: Wed Dec 17 2003 - 08:37:46 EST
[resending; better set the encoding to UTF-8...]
Peter Kirk wrote:
...
> used on Turkic text, violates the very sensible rule DO NOT USE
> COMBINING DOTS WITH I's, and leads to all sorts of potential
> confusion
> e.g. that both simple and full case folding and lowercasing
> applied to
> NFD Turkic text generate the nonsensical <i, dot above>. This
> could be a
> serious problem - although one that may not be worth fixing.
<i, dot above> is not non-sensical. It is used in Lithuanian for
such things as <i, dot above, tilde above>, as well as other
additonal accents above an i or a j that keeps its dot.
/kent k
Lithuanian alphabet (not listing all the uppercase
accented letters)
Aa (Àà, Áá Ãã Ąą {Ą́}{ą́}), Bb, Cc (CHch), Čč, Dd,
Ee (Ęę, Ėė è é ẽ ę {ę́} {ę̃} ė {ė́} {ė̃}), Ff, Gg, Hh,
Ii (Ì{i̇̀} Í{i̇́} Ĩ{i̇̃} Įį {Į́}{į̇́} {Į̃}{į̇̃}, Yy, Ýý, Ỹỹ),
Jj ({J̃}{j̇̃}), Kk, Ll ({l̃}), Mm ({m̃}), Nn (Ññ),
Oo (ò, ó, õ), Pp, [Qq], Rr (r̃), Ss, Šš, Tt,
Uu (ù ú ũ Ųų {ų́} {ų̃} Ūū {ū́}), Vv, [Ww], [Xx], Zz, Žž
This archive was generated by hypermail 2.1.5 : Wed Dec 17 2003 - 09:24:54 EST