From: Keutgen, Walter (walter.keutgen@be.unisys.com)
Date: Thu Apr 06 2006 - 06:13:27 CST
Donald,
almost correct. See the CLDR instructions:
http://www.unicode.org/cldr/wiki?SurveyToolHelp/characters. PLEASE FOLLOW THE HYPERLINK [Exemplar Characters].
Please pay also attention to "tought as the alphabet" i.e. one may not exclude letters not used in genuine words of a language that uses Latin script e.g. "k" to be kept in "it". Probably one can derive a similar rule for other scripts.
Best regards
Walter Keutgen
International Engineering Centre
Unisys Belgium nv-sa
THIS COMMUNICATION MAY CONTAIN CONFIDENTIAL AND/OR OTHERWISE PROPRIETARY MATERIAL and is thus for use only by the intended recipient. If you received this in error, please contact the sender and delete the e-mail and its attachments from all computers.
-----Original Message-----
From: unicode-bounce@unicode.org [mailto:unicode-bounce@unicode.org] On Behalf Of Donald Z. Osborn
Sent: Donnerstag, den 6. April 2006 11:58
To: Simon Montagu
Cc: Unicode Mailing List
Subject: Re: CLDR: Bad exemplar chars for some locales
What is the general rule in this kind of situation? Another case is
languages in
Latin transcription that can have tone marks, but generally don't use
them, and
others that generally do but not necessarily, etc. The case is a bit
complicated where precomposed accented characters can be used for ASCII base
characters but don't exist for extended characters.
It almost seems like there ought to be an "tone and vowel mark"
category between
the standard set and the auxiliary set. But then again maybe (1) the standard
set does not have to do with frequency of usage (so points in Hebrew, accents
in Bambara etc. should be there if they are part of the transcription system)
and (2) I should look at the good old Effing Manual to clarify my
understanding
of "auxiliary" (which I take to mean characters "not used" in the
language that
might be for borrowed and transcribed foreign terms).
Don Osborn
Bisharat.net
PanAfrican Localisation Project
This archive was generated by hypermail 2.1.5 : Thu Apr 06 2006 - 06:31:34 CST