Re: Encoding of personal names in official databases

From: Brendan_Murray/DUB/Lotus@lotus.com
Date: Tue Mar 30 1999 - 08:26:16 EST


Why not use UTF-8? This would provide a high level of compatibility with the
current system, in that ASCII range data remain as single-byte ASCII, while the
Æ, Ø and Å become 2 bytes. This encoding permits the use of all Unicode
characters, so things like the Turkish dotless-i and even CJK ideographs can
just as easily be encoded.

For example, your name would be:
     Currently: Trond Trosterud
     UTF-8: Trond Trosterud

while other names would change to look like (if viewed directly on Windows as
though they were encoded as CP 1252):
     Currently: Åge Sørensen
     UTF-8: Ã?ge Sørensen

Brendan



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:45 EDT