CLDR Ticket #10092(new data)
Opened 4 weeks ago
Transliteration rules from German to ASCII
|Reported by:||ausi <martin@…>||Owned by:||anybody|
To convert German text into ASCII (e.g. for generating URLs), there is currently only the transform „Latin-ASCII“ available where the diaeresis (umlaut) ä, ö and ü get converted to a, o and u respectively.
For the German language there are transliteration rules that define that ä, ö and ü should get translated to ae, oe and ue respectively. I found the following references of this rule:
When it is not possible to use the umlauts (for example, when using a restricted character set) the characters Ä, Ö, Ü, ä, ö, ü should be transcribed as Ae, Oe, Ue, ae, oe, ue respectively
When replacing umlaut characters with plain ASCII, use ae, oe, etc. for German language
German names containing umlauts (ä, ö, ü) and/or ß are spelled in the correct way in the non-machine-readable zone of the passport, but with simple vowel + E and/or SS in the machine-readable zone
The Austrian passport can (but does not always) contain a note in German, English, and French that AE / OE/ UE / SS are the common transcriptions for Ä / Ö / Ü / ß.
I created a new XML file for transforms/de-ASCII.xml which is attached to this ticket.