From: Eric Muller (emuller@adobe.com)
Date: Sat Apr 26 2003 - 13:46:10 EDT
Kevin Brown wrote:
>Is there a database anywhere that cross-references Latin-based glyphs (eg
>E WITH DIAERESIS) to all the Latin-based writing systems they're used in
>(eg Dutch, French, Albanian, Cyrillic and Welsh) and vice versa? It would
>be really useful.
>
www.eki.ee, which has already been mentioned a few times, reports the
following for U+00CB Ë LATIN CAPITAL LETTER E WITH DIAERESIS:
name: LATIN CAPITAL LETTER E WITH DIAERESIS
old name: LATIN CAPITAL LETTER E DIAERESIS
Adobe glyph name: Edieresis
mnemonic name(s): <E:>
HTML 4 mnemonic name:Ë
category: Lu (Letter, Uppercase)
combining: 0
decomposition info: 0045 0308
comment:
found in charsets: 8859-1 (CB); 8859-10 (CB); 8859-14 (CB); 8859-15
(CB); 8859-2 (CB); 8859-3 (CB); 8859-4 (CB); 8859-9 (CB); CP1250 (CB);
CP1252 (CB); CP1254 (CB); CP1258 (CB); CP850 (D3); CP852 (D3); CP857
(D3); CP863 (94); ROMAN (E8); CP1116 (D3); CP1122 (73); SAMI_WIN (CB);
SAMI_MAC (E8); 8859-16 (CB);
found in languages: af [Afrikaans]; cy [Welsh]; fr [French]; fy
[Frisian]; lb [Luxembourgian]; nl [Dutch]; sq [Albanian]; sv [Swedish];
wo [Wolof]; yap [Yapese]; dink [Dinka]; sla [Kashubian];
used in romanization of: be_r [Belarusian (cyrillic)]; ru_r [Russian
(cyrillic)];
lowercase: 00EB
This database also covers Cyrillic and the languages written in it.
The underlying data can be accessed a number of ways: e.g. “what is
needed beyond basic Cyrillic to write Altai?” reports U+0458 ј CYRILLIC
SMALL LETTER JE, U+04A5 ҥ CYRILLIC SMALL LIGATURE EN GHE, U+04E7 ӧ
CYRILLIC SMALL LETTER O WITH DIAERESIS and U+04F1 ӱ CYRILLIC SMALL
LETTER U WITH DIAERESIS.
As far as I can tell, the underlying data is of good quality; of course,
there the devil is in the details, and there are certainly decisions
that could be argued. A wonderful resource.
Eric.
This archive was generated by hypermail 2.1.5 : Sat Apr 26 2003 - 14:33:09 EDT