[Unicode]   Common Locale Data Repository : Bug Tracking Home | Site Map | Search
 

source: trunk/common/properties/scriptMetadata.txt @ 13477

Revision 13412, 11.3 KB checked in by scherer, 6 weeks ago (diff)

cldrbug 9882: script metadata for new Unicode 10 scripts; change Aspirational to Limited_Use; merged from branches/markus/uni10b; add new script codes to validity and coverage levels

  • Property svn:eol-style set to native
  • Property svn:mime-type set to text/plain
Line 
1# ScriptMetadata.txt
2# Copyright © 1991-2016 Unicode, Inc.
3# CLDR data files are interpreted according to the LDML specification (http://unicode.org/reports/tr35/)
4# For terms of use, see http://www.unicode.org/copyright.html
5#
6# This file provides general information about scripts that may be useful to implementations processing text.
7# The information is the best currently available, and may change between versions of CLDR.
8#
9# Format:
10#       The data is not in XML; instead it uses the semicolon-delimited format from the Unicode Character Database (UCD).
11#       This is so that parsers of the UCD can more easily be adapted to read the data.
12#       Additional fields may be added in future versions; parsers may be designed to ignore those fields until they are revised.
13#
14# Field - Description
15#
16# 0 - Script Identifier
17# 1 - Web Rank:
18#               The approximate rank of this script from a large sample of the web,
19#               in terms of the number of characters found in that script.
20#               Below 32 the ranking is not statistically significant.
21# 2 - Sample Character:
22#               A sample character for use in "Last Resort" style fonts.
23#               For printing the combining mark for Zinh in a chart, U+25CC can be prepended.
24#               See http://unicode.org/policies/lastresortfont_eula.html
25# 3 - Origin country:
26#               The approximate area where the script originated, expressed as a BCP47 region code.
27# 4 - Density:
28#               The approximate information density of characters in this script, based on comparison of bilingual texts.
29# 5 - ID Usage:
30#               The usage for IDs (tables 4-7) according to UAX #31.
31#               For a description of values, see
32#               http://unicode.org/reports/tr31/#Table_Candidate_Characters_for_Exclusion_from_Identifiers
33# 6 - RTL:
34#               YES if the script is RTL
35#               Derived from whether the script contains RTL letters according to the Bidi_Class property
36# 7 - LB letters:
37#               YES if the major languages using the script allow linebreaks between letters (excluding hyphenation).
38#               Derived from LB property.
39# 8 - Shaping Required:
40#               YES if shaping is required for the major languages using that script for NFC text.
41#                       This includes not only ligation (and Indic conjuncts), Indic vowel splitting/reordering, and
42#                       Arabic-style contextual shaping, but also cases where NSM placement is required, like Thai.
43#               MIN if NSM placement is sufficient, not the more complex shaping.
44#                       The NSM placement may only be necessary for some major languages using the script.
45# 9 - Input Method Engine Required:
46#               YES if the major languages using the script require IMEs.
47#               In particular, users (of languages for that script) would be accustomed to using IMEs (such as Japanese)
48#               and typical commercial products for those languages would need IME support in order to be competitive.
49# 10- Cased
50#               YES if in modern (or most recent) usage case distinctions are customary.
51#
52# Sometimes a script is included here before it is added in the Unicode Standard.
53# Such scripts are marked with a "provisional" comment.
54#
55# Note: For the most likely language for each script, see
56#               http://unicode.org/repos/cldr-tmp/trunk/diff/supplemental/likely_subtags.html
57#
58Zyyy; 1; 0040; ZZ; -1; RECOMMENDED; UNKNOWN; UNKNOWN; UNKNOWN; UNKNOWN; UNKNOWN
59Latn; 2; 004C; IT; 1; RECOMMENDED; NO; NO; MIN; NO; YES
60Hanb; 3; 5B57; CN; 3; RECOMMENDED; NO; YES; NO; YES; NO
61Hani; 3; 5B57; CN; 3; RECOMMENDED; NO; YES; NO; YES; NO
62Hans; 3; 5B57; CN; 3; RECOMMENDED; NO; YES; NO; YES; NO
63Hant; 3; 5B57; CN; 3; RECOMMENDED; NO; YES; NO; YES; NO
64Cyrl; 4; 042F; BG; 1; RECOMMENDED; NO; NO; MIN; NO; YES
65Hira; 5; 304B; JP; 2; RECOMMENDED; NO; YES; NO; NO; NO
66Jpan; 5; 304B; JP; 2; RECOMMENDED; NO; YES; NO; YES; NO
67Kana; 6; 30AB; JP; 2; RECOMMENDED; NO; YES; NO; NO; NO
68Thai; 7; 0E17; TH; 1; RECOMMENDED; NO; YES; MIN; NO; NO
69Arab; 8; 0628; SA; 1; RECOMMENDED; YES; NO; YES; NO; NO
70Hang; 9; AC00; KR; 3; RECOMMENDED; NO; NO; MIN; YES; NO
71Jamo; 9; 1112; KR; 3; RECOMMENDED; NO; NO; MIN; YES; NO
72Kore; 9; AC00; KR; 3; RECOMMENDED; NO; NO; MIN; YES; NO
73Deva; 10; 0905; IN; 1; RECOMMENDED; NO; NO; YES; NO; NO
74Grek; 11; 03A9; GR; 1; RECOMMENDED; NO; NO; NO; NO; YES
75Hebr; 12; 05D0; IL; 1; RECOMMENDED; YES; NO; NO; NO; NO
76Taml; 13; 0B95; IN; 1; RECOMMENDED; NO; NO; YES; NO; NO
77Knda; 14; 0C95; IN; 1; RECOMMENDED; NO; NO; YES; NO; NO
78Geor; 15; 10D3; GE; 1; RECOMMENDED; NO; NO; NO; NO; NO
79Mlym; 16; 0D15; IN; 1; RECOMMENDED; NO; NO; YES; NO; NO
80Telu; 17; 0C15; IN; 1; RECOMMENDED; NO; NO; YES; NO; NO
81Armn; 18; 0531; AM; 1; RECOMMENDED; NO; NO; NO; NO; YES
82Mymr; 19; 1000; MM; 1; RECOMMENDED; NO; YES; YES; NO; NO
83Gujr; 20; 0A95; IN; 1; RECOMMENDED; NO; NO; YES; NO; NO
84Beng; 21; 0995; BD; 1; RECOMMENDED; NO; NO; YES; NO; NO
85Guru; 22; 0A15; IN; 1; RECOMMENDED; NO; NO; YES; NO; NO
86Laoo; 23; 0EA5; LA; 1; RECOMMENDED; NO; YES; YES; NO; NO
87Zinh; 24; 0308; ZZ; -1; RECOMMENDED; UNKNOWN; UNKNOWN; MIN; UNKNOWN; UNKNOWN
88Khmr; 25; 1780; KH; 1; RECOMMENDED; NO; YES; YES; NO; NO
89Tibt; 27; 0F40; CN; 1; RECOMMENDED; NO; NO; YES; NO; NO
90Sinh; 28; 0D85; LK; 1; RECOMMENDED; NO; NO; YES; NO; NO
91Ethi; 29; 12A0; ET; 2; RECOMMENDED; NO; NO; MIN; YES; NO
92Thaa; 30; 078C; MV; 1; RECOMMENDED; YES; NO; YES; NO; NO
93Orya; 31; 0B15; IN; 1; RECOMMENDED; NO; NO; YES; NO; NO
94Zzzz; 32; FDD0; ZZ; -1; UNKNOWN; UNKNOWN; UNKNOWN; UNKNOWN; UNKNOWN; UNKNOWN
95Adlm; 33; 1E909; GN; 1; LIMITED_USE; YES; NO; MIN; NO; YES
96Aghb; 33; 10537; RU; 1; EXCLUSION; NO; NO; NO; NO; NO
97Ahom; 33; 11717; IN; 1; EXCLUSION; NO; YES; YES; NO; NO
98Armi; 33; 10840; IR; 1; EXCLUSION; YES; NO; NO; NO; NO
99Avst; 33; 10B00; IR; 1; EXCLUSION; YES; NO; YES; NO; NO
100Bali; 33; 1B05; ID; 1; LIMITED_USE; NO; NO; YES; NO; NO
101Bamu; 33; A6A0; CM; 1; LIMITED_USE; NO; NO; MIN; YES; NO
102Bass; 33; 16AE6; LR; 1; EXCLUSION; NO; NO; NO; NO; NO
103Batk; 33; 1BC0; ID; 1; LIMITED_USE; NO; NO; YES; NO; NO
104Bhks; 33; 11C0E; IN; 1; EXCLUSION; NO; NO; YES; NO; NO
105Bopo; 33; 3105; CN; 2; RECOMMENDED; NO; YES; NO; NO; NO
106Brah; 33; 11005; IN; 1; EXCLUSION; NO; NO; YES; NO; NO
107Brai; 33; 280E; FR; -1; UNKNOWN; UNKNOWN; UNKNOWN; UNKNOWN; UNKNOWN; UNKNOWN
108Bugi; 33; 1A00; ID; 1; EXCLUSION; NO; NO; MIN; NO; NO
109Buhd; 33; 1743; PH; 1; EXCLUSION; NO; NO; YES; NO; NO
110Cakm; 33; 11103; BD; 1; LIMITED_USE; NO; NO; YES; NO; NO
111Cans; 33; 14C0; CA; 2; LIMITED_USE; NO; NO; NO; YES; NO
112Cari; 33; 102A0; TR; 1; EXCLUSION; NO; NO; NO; NO; NO
113Cham; 33; AA00; VN; 1; LIMITED_USE; NO; NO; YES; NO; NO
114Cher; 33; 13C4; US; 2; LIMITED_USE; NO; NO; NO; NO; YES
115Copt; 33; 03E2; EG; 1; EXCLUSION; NO; NO; MIN; NO; YES
116Cprt; 33; 10800; CY; 1; EXCLUSION; YES; NO; NO; NO; NO
117Dsrt; 33; 10414; US; 1; EXCLUSION; NO; NO; NO; NO; YES
118Dupl; 33; 1BC20; FR; 1; EXCLUSION; NO; NO; NO; YES; NO
119Egyp; 33; 13153; EG; 3; EXCLUSION; NO; NO; YES; YES; NO
120Elba; 33; 10500; AL; 1; EXCLUSION; NO; NO; NO; NO; NO
121Glag; 33; 2C00; BG; 1; EXCLUSION; NO; NO; NO; NO; YES
122Gonm; 33; 11D10; IN; 1; EXCLUSION; NO; NO; YES; NO; NO  # provisional data for future Unicode 10.0 script
123Goth; 33; 10330; UA; 1; EXCLUSION; NO; NO; NO; NO; NO
124Gran; 33; 11315; IN; 1; EXCLUSION; NO; NO; NO; NO; NO
125Hano; 33; 1723; PH; 1; EXCLUSION; NO; NO; YES; NO; NO
126Hatr; 33; 108F4; IQ; 1; EXCLUSION; YES; NO; NO; NO; NO
127Hluw; 33; 14400; TR; 1; EXCLUSION; NO; NO; NO; YES; NO
128Hmng; 33; 16B1C; LA; 1; EXCLUSION; NO; NO; NO; NO; NO
129Hung; 33; 10CA1; HU; 1; EXCLUSION; YES; NO; NO; NO; YES
130Ital; 33; 10300; IT; 1; EXCLUSION; NO; NO; NO; NO; NO
131Java; 33; A984; ID; 1; LIMITED_USE; NO; NO; YES; NO; NO
132Kali; 33; A90A; MM; 1; LIMITED_USE; NO; NO; MIN; NO; NO
133Khar; 33; 10A00; PK; 1; EXCLUSION; YES; NO; YES; NO; NO
134Khoj; 33; 11208; IN; 1; EXCLUSION; NO; NO; NO; NO; NO
135Kthi; 33; 11083; IN; 1; EXCLUSION; NO; NO; MIN; NO; NO
136Lana; 33; 1A20; TH; 1; LIMITED_USE; NO; YES; YES; NO; NO
137Lepc; 33; 1C00; IN; 1; LIMITED_USE; NO; NO; YES; NO; NO
138Limb; 33; 1900; IN; 1; LIMITED_USE; NO; NO; YES; NO; NO
139Lina; 33; 10647; GR; 1; EXCLUSION; NO; NO; NO; YES; NO
140Linb; 33; 10000; GR; 1; EXCLUSION; NO; NO; NO; YES; NO
141Lisu; 33; A4D0; CN; 1; LIMITED_USE; NO; NO; NO; YES; NO
142Lyci; 33; 10280; TR; 1; EXCLUSION; NO; NO; NO; NO; NO
143Lydi; 33; 10920; TR; 1; EXCLUSION; YES; NO; NO; NO; NO
144Mahj; 33; 11152; IN; 1; EXCLUSION; NO; NO; NO; NO; NO
145Mand; 33; 0840; IR; 1; LIMITED_USE; YES; NO; YES; NO; NO
146Mani; 33; 10AD8; CN; 1; EXCLUSION; YES; NO; NO; NO; NO
147Marc; 33; 11C72; CN; 1; EXCLUSION; NO; NO; YES; NO; NO
148Mend; 33; 1E802; SL; 1; EXCLUSION; YES; NO; NO; YES; NO
149Merc; 33; 109A0; SD; 1; EXCLUSION; YES; NO; NO; NO; NO
150Mero; 33; 10980; SD; 1; EXCLUSION; YES; NO; NO; NO; NO
151Modi; 33; 1160E; IN; 1; EXCLUSION; NO; NO; NO; NO; NO
152Mong; 33; 1826; MN; 1; LIMITED_USE; NO; NO; YES; NO; NO
153Mroo; 33; 16A4F; BD; 1; EXCLUSION; NO; NO; NO; NO; NO
154Mtei; 33; ABC0; IN; 1; LIMITED_USE; NO; NO; YES; NO; NO
155Mult; 33; 1128F; PK; 1; EXCLUSION; NO; NO; NO; NO; NO
156Narb; 33; 10A95; SA; 1; EXCLUSION; YES; NO; NO; NO; NO
157Nbat; 33; 10896; JO; 1; EXCLUSION; YES; NO; NO; NO; NO
158Newa; 33; 11412; NP; 1; LIMITED_USE; NO; NO; YES; NO; NO
159Nkoo; 33; 07CA; GN; 1; LIMITED_USE; YES; NO; YES; NO; NO
160Nshu; 33; 1B1C4; CN; 2; EXCLUSION; NO; YES; NO; YES; NO  # provisional data for future Unicode 10.0 script
161Ogam; 33; 168F; IE; 1; EXCLUSION; NO; NO; NO; NO; NO
162Olck; 33; 1C5A; IN; 1; LIMITED_USE; NO; NO; NO; NO; NO
163Orkh; 33; 10C00; MN; 1; EXCLUSION; YES; NO; NO; NO; NO
164Osge; 33; 104B5; US; 1; LIMITED_USE; NO; NO; NO; NO; YES
165Osma; 33; 10480; SO; 1; EXCLUSION; NO; NO; NO; NO; NO
166Palm; 33; 10873; SY; 1; EXCLUSION; YES; NO; NO; NO; NO
167Pauc; 33; 11AC0; MM; 1; EXCLUSION; NO; NO; NO; NO; NO
168Perm; 33; 1036B; RU; 1; EXCLUSION; NO; NO; NO; NO; NO
169Phag; 33; A840; CN; 1; EXCLUSION; NO; NO; YES; NO; NO
170Phli; 33; 10B60; IR; 1; EXCLUSION; YES; NO; NO; NO; NO
171Phlp; 33; 10B8F; CN; 1; EXCLUSION; YES; NO; NO; NO; NO
172Phnx; 33; 10900; LB; 1; EXCLUSION; YES; NO; NO; NO; NO
173Plrd; 33; 16F00; CN; 1; LIMITED_USE; NO; NO; YES; NO; NO
174Prti; 33; 10B40; IR; 1; EXCLUSION; YES; NO; NO; NO; NO
175Rjng; 33; A930; ID; 1; EXCLUSION; NO; NO; YES; NO; NO
176Runr; 33; 16A0; SE; 1; EXCLUSION; NO; NO; NO; NO; NO
177Samr; 33; 0800; IL; 1; EXCLUSION; YES; NO; MIN; NO; NO
178Sarb; 33; 10A60; YE; 1; EXCLUSION; YES; NO; NO; NO; NO
179Saur; 33; A882; IN; 1; LIMITED_USE; NO; NO; YES; NO; NO
180Sgnw; 33; 1D850; US; 1; EXCLUSION; NO; NO; NO; YES; NO
181Shaw; 33; 10450; GB; 1; EXCLUSION; NO; NO; NO; NO; NO
182Shrd; 33; 11183; IN; 1; EXCLUSION; NO; NO; YES; NO; NO
183Sidd; 33; 1158E; IN; 1; EXCLUSION; NO; NO; NO; NO; NO
184Sind; 33; 112BE; IN; 1; EXCLUSION; NO; NO; NO; NO; NO
185Sora; 33; 110D0; IN; 1; EXCLUSION; NO; NO; NO; NO; NO
186Soyo; 33; 11A5C; MN; 1; EXCLUSION; NO; NO; YES; NO; NO  # provisional data for future Unicode 10.0 script
187Sund; 33; 1B83; ID; 1; LIMITED_USE; NO; NO; YES; NO; NO
188Sylo; 33; A800; BD; 1; LIMITED_USE; NO; NO; YES; NO; NO
189Syrc; 33; 0710; SY; 1; LIMITED_USE; YES; NO; YES; NO; NO
190Tagb; 33; 1763; PH; 1; EXCLUSION; NO; NO; NO; NO; NO
191Takr; 33; 11680; IN; 1; EXCLUSION; NO; NO; YES; NO; NO
192Tale; 33; 1950; CN; 1; LIMITED_USE; NO; YES; NO; NO; NO
193Talu; 33; 1980; CN; 1; LIMITED_USE; NO; YES; YES; NO; NO
194Tang; 33; 18229; CN; 3; EXCLUSION; NO; YES; NO; YES; NO
195Tavt; 33; AA80; VN; 1; LIMITED_USE; NO; YES; YES; NO; NO
196Tfng; 33; 2D30; MA; 1; LIMITED_USE; NO; NO; NO; NO; NO
197Tglg; 33; 1703; PH; 1; EXCLUSION; NO; NO; MIN; NO; NO
198Tirh; 33; 11484; IN; 1; EXCLUSION; NO; NO; NO; NO; NO
199Ugar; 33; 10380; SY; 1; EXCLUSION; NO; NO; NO; NO; NO
200Vaii; 33; A549; LR; 2; LIMITED_USE; NO; NO; NO; YES; NO
201Wara; 33; 118B4; IN; 1; EXCLUSION; NO; NO; NO; NO; YES
202Xpeo; 33; 103A0; IR; 1; EXCLUSION; NO; NO; NO; NO; NO
203Xsux; 33; 12000; IQ; 3; EXCLUSION; NO; NO; NO; YES; NO
204Yiii; 33; A288; CN; 3; LIMITED_USE; NO; YES; NO; YES; NO
205Zanb; 33; 11A0B; MN; 1; EXCLUSION; NO; NO; YES; NO; NO  # provisional data for future Unicode 10.0 script
206
207# EOF
Note: See TracBrowser for help on using the repository browser.