Marco.Cimarosti@icl.com scripsit:
> 1: In your example, you use GUJ as an abbreviation of Gujarati. Notice
> that there is a standard for 2-letter abbreviation of languages (from ISO, I
> think; there should be a link to it also on the Unicode site). This could be
> used in your proposal also to identify scripts (where each script is
> identified by the language it took its name from: Latin, Arabic, Greek,
> etc.). A handful of non conflicting 2-letter abbreviations should however be
> added for scripts that didn't take their name from a language (Cyrillic,
> Devanagari, Hangul, Kana, etc.).
I will attempt to forestall Michael by noting the existence of
an ISO pre-standard for script names; home page (in English and French)
at http://www.indigo.ie/egt/standards/iso15924/ .
Script names are 2 or 3 letters (or a number), conventionally capitalized
Xx or Xxx (to distinguish them visually from language and country names).
> CJK Ideographs currently have uninformative names like "CJK IDEOGRAPH 4E00".
> They could be more validly named using IDS's (Ideographic Description
> Sequences) with only Kangxi and Supplement radicals as DC's. E.g.:
The names will not be changed again, ever. No matter how bogus they are.
But feel free to prepare and publish such a table, which would be
extremely useful.
-- John Cowan cowan@ccil.org I am a member of a civilization. --David Brin
This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:53 EDT