[Unicode]   Common Locale Data Repository : Bug Tracking Home | Site Map | Search
 

source: trunk/common/properties/scriptMetadata.txt @ 12419

Revision 12404, 11.2 KB checked in by scherer, 5 days ago (diff)

cldrbug 8745: add Unicode 9 scripts with their script metadata; cldrbug 9138: fix some language codes for scripts, including cldrbug 8922: mro/mru; merged from branches/markus/uni90 plus further integration fixes

  • Property svn:eol-style set to native
  • Property svn:mime-type set to text/plain
Line 
1# ScriptMetadata.txt
2# Copyright © 1991-2016 Unicode, Inc.
3# CLDR data files are interpreted according to the LDML specification (http://unicode.org/reports/tr35/)
4# For terms of use, see http://www.unicode.org/copyright.html
5#
6# This file provides general information about scripts that may be useful to implementations processing text.
7# The information is the best currently available, and may change between versions of CLDR.
8#
9# Format:
10#       The data is not in XML; instead it uses the semicolon-delimited format from the Unicode Character Database (UCD).
11#       This is so that parsers of the UCD can more easily be adapted to read the data.
12#       Additional fields may be added in future versions; parsers may be designed to ignore those fields until they are revised.
13#
14# Field - Description
15#
16# 0 - Script Identifier
17# 1 - Web Rank:
18#               The approximate rank of this script from a large sample of the web,
19#               in terms of the number of characters found in that script.
20#               Below 32 the ranking is not statistically significant.
21# 2 - Sample Character:
22#               A sample character for use in "Last Resort" style fonts.
23#               For printing the combining mark for Zinh in a chart, U+25CC can be prepended.
24#               See http://unicode.org/policies/lastresortfont_eula.html
25# 3 - Origin country:
26#               The approximate area where the script originated, expressed as a BCP47 region code.
27# 4 - Density:
28#               The approximate information density of characters in this script, based on comparison of bilingual texts.
29# 5 - ID Usage:
30#               The usage for IDs (tables 4-7) according to UAX #31.
31#               For a description of values, see
32#               http://unicode.org/reports/tr31/#Table_Candidate_Characters_for_Exclusion_from_Identifiers
33# 6 - RTL:
34#               YES if the script is RTL
35#               Derived from whether the script contains RTL letters according to the Bidi_Class property
36# 7 - LB letters:
37#               YES if the major languages using the script allow linebreaks between letters (excluding hyphenation).
38#               Derived from LB property.
39# 8 - Shaping Required:
40#               YES if shaping is required for the major languages using that script for NFC text.
41#                       This includes not only ligation (and Indic conjuncts), Indic vowel splitting/reordering, and
42#                       Arabic-style contextual shaping, but also cases where NSM placement is required, like Thai.
43#               MIN if NSM placement is sufficient, not the more complex shaping.
44#                       The NSM placement may only be necessary for some major languages using the script.
45# 9 - Input Method Engine Required:
46#               YES if the major languages using the script require IMEs.
47#               In particular, users (of languages for that script) would be accustomed to using IMEs (such as Japanese)
48#               and typical commercial products for those languages would need IME support in order to be competitive.
49# 10- Cased
50#               YES if in modern (or most recent) usage case distinctions are customary.
51#
52# Sometimes a script is included here before it is added in the Unicode Standard.
53# Such scripts are marked with a "provisional" comment.
54#
55# Note: For the most likely language for each script, see
56#               http://unicode.org/repos/cldr-tmp/trunk/diff/supplemental/likely_subtags.html
57#
58Zyyy; 1; 0040; ZZ; -1; RECOMMENDED; UNKNOWN; UNKNOWN; UNKNOWN; UNKNOWN; UNKNOWN
59Latn; 2; 004C; IT; 1; RECOMMENDED; NO; NO; MIN; NO; YES
60Hani; 3; 5B57; CN; 3; RECOMMENDED; NO; YES; NO; YES; NO
61Hans; 3; 5B57; CN; 3; RECOMMENDED; NO; YES; NO; YES; NO
62Hant; 3; 5B57; CN; 3; RECOMMENDED; NO; YES; NO; YES; NO
63Cyrl; 4; 042F; BG; 1; RECOMMENDED; NO; NO; MIN; NO; YES
64Hira; 5; 304B; JP; 2; RECOMMENDED; NO; YES; NO; NO; NO
65Jpan; 5; 304B; JP; 2; RECOMMENDED; NO; YES; NO; YES; NO
66Kana; 6; 30AB; JP; 2; RECOMMENDED; NO; YES; NO; NO; NO
67Thai; 7; 0E17; TH; 1; RECOMMENDED; NO; YES; MIN; NO; NO
68Arab; 8; 0628; SA; 1; RECOMMENDED; YES; NO; YES; NO; NO
69Hang; 9; AC00; KR; 3; RECOMMENDED; NO; NO; MIN; YES; NO
70Kore; 9; AC00; KR; 3; RECOMMENDED; NO; NO; MIN; YES; NO
71Deva; 10; 0905; IN; 1; RECOMMENDED; NO; NO; YES; NO; NO
72Grek; 11; 03A9; GR; 1; RECOMMENDED; NO; NO; NO; NO; YES
73Hebr; 12; 05D0; IL; 1; RECOMMENDED; YES; NO; NO; NO; NO
74Taml; 13; 0B95; IN; 1; RECOMMENDED; NO; NO; YES; NO; NO
75Knda; 14; 0C95; IN; 1; RECOMMENDED; NO; NO; YES; NO; NO
76Geor; 15; 10D3; GE; 1; RECOMMENDED; NO; NO; NO; NO; NO
77Mlym; 16; 0D15; IN; 1; RECOMMENDED; NO; NO; YES; NO; NO
78Telu; 17; 0C15; IN; 1; RECOMMENDED; NO; NO; YES; NO; NO
79Armn; 18; 0531; AM; 1; RECOMMENDED; NO; NO; NO; NO; YES
80Mymr; 19; 1000; MM; 1; RECOMMENDED; NO; YES; YES; NO; NO
81Gujr; 20; 0A95; IN; 1; RECOMMENDED; NO; NO; YES; NO; NO
82Beng; 21; 0995; BD; 1; RECOMMENDED; NO; NO; YES; NO; NO
83Guru; 22; 0A15; IN; 1; RECOMMENDED; NO; NO; YES; NO; NO
84Laoo; 23; 0EA5; LA; 1; RECOMMENDED; NO; YES; YES; NO; NO
85Zinh; 24; 0308; ZZ; -1; RECOMMENDED; UNKNOWN; UNKNOWN; MIN; UNKNOWN; UNKNOWN
86Khmr; 25; 1780; KH; 1; RECOMMENDED; NO; YES; YES; NO; NO
87Tibt; 27; 0F40; CN; 1; RECOMMENDED; NO; NO; YES; NO; NO
88Sinh; 28; 0D85; LK; 1; RECOMMENDED; NO; NO; YES; NO; NO
89Ethi; 29; 12A0; ET; 2; RECOMMENDED; NO; NO; MIN; YES; NO
90Thaa; 30; 078C; MV; 1; RECOMMENDED; YES; NO; YES; NO; NO
91Orya; 31; 0B15; IN; 1; RECOMMENDED; NO; NO; YES; NO; NO
92Zzzz; 32; FDD0; ZZ; -1; UNKNOWN; UNKNOWN; UNKNOWN; UNKNOWN; UNKNOWN; UNKNOWN
93Cans; 33; 14C0; CA; 2; ASPIRATIONAL; NO; NO; NO; YES; NO
94Syrc; 34; 0710; SY; 1; LIMITED_USE; YES; NO; YES; NO; NO
95Bopo; 35; 3105; CN; 2; RECOMMENDED; NO; YES; NO; NO; NO
96Nkoo; 36; 07CA; GN; 1; LIMITED_USE; YES; NO; YES; NO; NO
97Cher; 37; 13C4; US; 2; LIMITED_USE; NO; NO; NO; NO; YES
98Yiii; 38; A288; CN; 3; ASPIRATIONAL; NO; YES; NO; YES; NO
99Samr; 39; 0800; IL; 1; EXCLUSION; YES; NO; MIN; NO; NO
100Copt; 40; 03E2; EG; 1; EXCLUSION; NO; NO; MIN; NO; YES
101Mong; 41; 1826; MN; 1; ASPIRATIONAL; NO; NO; YES; NO; NO
102Glag; 42; 2C00; BG; 1; EXCLUSION; NO; NO; NO; NO; YES
103Vaii; 43; A549; LR; 2; LIMITED_USE; NO; NO; NO; YES; NO
104Bali; 44; 1B05; ID; 1; LIMITED_USE; NO; NO; YES; NO; NO
105Tfng; 45; 2D30; MA; 1; ASPIRATIONAL; NO; NO; NO; NO; NO
106Bamu; 46; A6A0; CM; 1; LIMITED_USE; NO; NO; MIN; YES; NO
107Batk; 47; 1BC0; ID; 1; LIMITED_USE; NO; NO; YES; NO; NO
108Cham; 48; AA00; VN; 1; LIMITED_USE; NO; NO; YES; NO; NO
109Java; 49; A984; ID; 1; LIMITED_USE; NO; NO; YES; NO; NO
110Kali; 50; A90A; MM; 1; LIMITED_USE; NO; NO; MIN; NO; NO
111Lepc; 51; 1C00; IN; 1; LIMITED_USE; NO; NO; YES; NO; NO
112Limb; 52; 1900; IN; 1; LIMITED_USE; NO; NO; YES; NO; NO
113Lisu; 53; A4D0; CN; 1; LIMITED_USE; NO; NO; NO; YES; NO
114Mand; 54; 0840; IR; 1; LIMITED_USE; YES; NO; YES; NO; NO
115Mtei; 55; ABC0; IN; 1; LIMITED_USE; NO; NO; YES; NO; NO
116Talu; 56; 1980; CN; 1; LIMITED_USE; NO; YES; YES; NO; NO
117Olck; 57; 1C5A; IN; 1; LIMITED_USE; NO; NO; NO; NO; NO
118Saur; 58; A882; IN; 1; LIMITED_USE; NO; NO; YES; NO; NO
119Sund; 59; 1B83; ID; 1; LIMITED_USE; NO; NO; YES; NO; NO
120Sylo; 60; A800; BD; 1; LIMITED_USE; NO; NO; YES; NO; NO
121Tale; 61; 1950; CN; 1; LIMITED_USE; NO; YES; NO; NO; NO
122Lana; 62; 1A20; TH; 1; LIMITED_USE; NO; YES; YES; NO; NO
123Tavt; 63; AA80; VN; 1; LIMITED_USE; NO; YES; YES; NO; NO
124Avst; 64; 10B00; IR; 1; EXCLUSION; YES; NO; YES; NO; NO
125Brah; 65; 11005; IN; 1; EXCLUSION; NO; NO; YES; NO; NO
126Bugi; 66; 1A00; ID; 1; EXCLUSION; NO; NO; MIN; NO; NO
127Buhd; 67; 1743; PH; 1; EXCLUSION; NO; NO; YES; NO; NO
128Cari; 68; 102A0; TR; 1; EXCLUSION; NO; NO; NO; NO; NO
129Xsux; 69; 12000; IQ; 3; EXCLUSION; NO; NO; NO; YES; NO
130Cprt; 70; 10800; CY; 1; EXCLUSION; YES; NO; NO; NO; NO
131Dsrt; 71; 10414; US; 1; EXCLUSION; NO; NO; NO; NO; YES
132Egyp; 72; 13153; EG; 3; EXCLUSION; NO; NO; YES; YES; NO
133Goth; 73; 10330; UA; 1; EXCLUSION; NO; NO; NO; NO; NO
134Hano; 74; 1723; PH; 1; EXCLUSION; NO; NO; YES; NO; NO
135Armi; 75; 10840; IR; 1; EXCLUSION; YES; NO; NO; NO; NO
136Phli; 76; 10B60; IR; 1; EXCLUSION; YES; NO; NO; NO; NO
137Prti; 77; 10B40; IR; 1; EXCLUSION; YES; NO; NO; NO; NO
138Kthi; 78; 11083; IN; 1; EXCLUSION; NO; NO; MIN; NO; NO
139Khar; 79; 10A00; PK; 1; EXCLUSION; YES; NO; YES; NO; NO
140Linb; 80; 10000; GR; 1; EXCLUSION; NO; NO; NO; YES; NO
141Lyci; 81; 10280; TR; 1; EXCLUSION; NO; NO; NO; NO; NO
142Lydi; 82; 10920; TR; 1; EXCLUSION; YES; NO; NO; NO; NO
143Ogam; 83; 168F; IE; 1; EXCLUSION; NO; NO; NO; NO; NO
144Ital; 84; 10300; IT; 1; EXCLUSION; NO; NO; NO; NO; NO
145Xpeo; 85; 103A0; IR; 1; EXCLUSION; NO; NO; NO; NO; NO
146Sarb; 86; 10A60; YE; 1; EXCLUSION; YES; NO; NO; NO; NO
147Orkh; 87; 10C00; MN; 1; EXCLUSION; YES; NO; NO; NO; NO
148Osma; 88; 10480; SO; 1; EXCLUSION; NO; NO; NO; NO; NO
149Phag; 89; A840; CN; 1; EXCLUSION; NO; NO; YES; NO; NO
150Phnx; 90; 10900; LB; 1; EXCLUSION; YES; NO; NO; NO; NO
151Rjng; 91; A930; ID; 1; EXCLUSION; NO; NO; YES; NO; NO
152Runr; 92; 16A0; SE; 1; EXCLUSION; NO; NO; NO; NO; NO
153Shaw; 93; 10450; GB; 1; EXCLUSION; NO; NO; NO; NO; NO
154Tglg; 94; 1703; PH; 1; EXCLUSION; NO; NO; MIN; NO; NO
155Tagb; 95; 1763; PH; 1; EXCLUSION; NO; NO; NO; NO; NO
156Ugar; 96; 10380; SY; 1; EXCLUSION; NO; NO; NO; NO; NO
157Cakm; 97; 11103; BD; 1; LIMITED_USE; NO; NO; YES; NO; NO
158Merc; 98; 109A0; SD; 1; EXCLUSION; YES; NO; NO; NO; NO
159Mero; 99; 10980; SD; 1; EXCLUSION; YES; NO; NO; NO; NO
160Plrd; 100; 16F00; CN; 1; ASPIRATIONAL; NO; NO; YES; NO; NO
161Shrd; 101; 11183; IN; 1; EXCLUSION; NO; NO; YES; NO; NO
162Sora; 102; 110D0; IN; 1; EXCLUSION; NO; NO; NO; NO; NO
163Takr; 103; 11680; IN; 1; EXCLUSION; NO; NO; YES; NO; NO
164Brai; 104; 280E; FR; -1; UNKNOWN; UNKNOWN; UNKNOWN; UNKNOWN; UNKNOWN; UNKNOWN
165Aghb; 105; 10537; RU; 1; EXCLUSION; NO; NO; NO; NO; NO
166Bass; 106; 16AE6; LR; 1; EXCLUSION; NO; NO; NO; NO; NO
167Dupl; 107; 1BC20; FR; 1; EXCLUSION; NO; NO; NO; YES; NO
168Elba; 108; 10500; AL; 1; EXCLUSION; NO; NO; NO; NO; NO
169Gran; 109; 11315; IN; 1; EXCLUSION; NO; NO; NO; NO; NO
170Hmng; 110; 16B1C; LA; 1; EXCLUSION; NO; NO; NO; NO; NO
171Khoj; 111; 11208; IN; 1; EXCLUSION; NO; NO; NO; NO; NO
172Lina; 112; 10647; GR; 1; EXCLUSION; NO; NO; NO; YES; NO
173Mahj; 113; 11152; IN; 1; EXCLUSION; NO; NO; NO; NO; NO
174Mani; 114; 10AD8; CN; 1; EXCLUSION; YES; NO; NO; NO; NO
175Mend; 115; 1E802; SL; 1; EXCLUSION; YES; NO; NO; YES; NO
176Modi; 116; 1160E; IN; 1; EXCLUSION; NO; NO; NO; NO; NO
177Mroo; 117; 16A4F; BD; 1; EXCLUSION; NO; NO; NO; NO; NO
178Narb; 118; 10A95; SA; 1; EXCLUSION; YES; NO; NO; NO; NO
179Nbat; 119; 10896; JO; 1; EXCLUSION; YES; NO; NO; NO; NO
180Palm; 120; 10873; SY; 1; EXCLUSION; YES; NO; NO; NO; NO
181Pauc; 121; 11AC0; MM; 1; EXCLUSION; NO; NO; NO; NO; NO
182Perm; 122; 1036B; RU; 1; EXCLUSION; NO; NO; NO; NO; NO
183Phlp; 123; 10B8F; CN; 1; EXCLUSION; YES; NO; NO; NO; NO
184Sidd; 124; 1158E; IN; 1; EXCLUSION; NO; NO; NO; NO; NO
185Sind; 125; 112BE; IN; 1; EXCLUSION; NO; NO; NO; NO; NO
186Tirh; 126; 11484; IN; 1; EXCLUSION; NO; NO; NO; NO; NO
187Wara; 127; 118B4; IN; 1; EXCLUSION; NO; NO; NO; NO; YES
188Ahom; 128; 11717; IN; 1; EXCLUSION; NO; YES; YES; NO; NO
189Hluw; 129; 14400; TR; 1; EXCLUSION; NO; NO; NO; YES; NO
190Hatr; 130; 108F4; IQ; 1; EXCLUSION; YES; NO; NO; NO; NO
191Mult; 131; 1128F; PK; 1; EXCLUSION; NO; NO; NO; NO; NO
192Hung; 132; 10CA1; HU; 1; EXCLUSION; YES; NO; NO; NO; YES
193Sgnw; 133; 1D850; US; 1; EXCLUSION; NO; NO; NO; YES; NO
194Adlm; 134; 1E909; GN; 1; LIMITED_USE; YES; NO; MIN; NO; YES  # provisional data for future Unicode 9.0 script
195Bhks; 135; 11C0E; IN; 1; EXCLUSION; NO; NO; YES; NO; NO  # provisional data for future Unicode 9.0 script
196Marc; 136; 11C72; CN; 1; EXCLUSION; NO; NO; YES; NO; NO  # provisional data for future Unicode 9.0 script
197Osge; 137; 104B5; US; 1; LIMITED_USE; NO; NO; NO; NO; YES  # provisional data for future Unicode 9.0 script
198Tang; 138; 18229; CN; 3; EXCLUSION; NO; YES; NO; YES; NO  # provisional data for future Unicode 9.0 script
199Newa; 139; 11412; NP; 1; LIMITED_USE; NO; NO; YES; NO; NO  # provisional data for future Unicode 9.0 script
200
201# EOF
Note: See TracBrowser for help on using the repository browser.