Unicode CLDR Version 35 Language/Locale Data Released

From: <announcements_at_unicode.org>
Date: Wed, 27 Mar 2019 14:14:08 -0700

mechanical arm emoji imageUnicode CLDR 35 provides an update to the key
building blocks for software supporting the world's languages. CLDR data
is used by all major software systems
<http://cldr.unicode.org/index#TOC-Who-uses-CLDR-> for their software
internationalization and localization, adapting software to the
conventions of different languages for such common software tasks.

CLDR 35 included a limited Survey Tool data collection phase
<https://www.unicode.org/cldr/charts/35/supplemental/locale_coverage.html>.
The following summarizes the changes in the release.

*Data * 70,000+ new data fields, 13,400+ revised data fields
*Basic coverage * New languages at *Basic* coverage: Cebuano (ceb),
Hausa (ha), Igbo (ig), Yoruba (yo)
*Modern coverage * Languages Somali (so) and Javanese (jv) increased
coverage from *Moderate* to *Modern*
*Emoji 12.0 * Names and annotations (search keywords) for 90+ new emoji
<http://blog.unicode.org/2019/02/unicode-emoji-12-final-for-2019.html>;
Also includes fixes for previous names & keywords
*Collation * Collation updated to *Unicode 12.0*, including new emoji;
Japanese single-character (ligature) era names added to collation and
search collation
*Measurement units * 23 additional units
<http://www.unicode.org/cldr/charts/35/delta/supplemental-data.html#unit>
*Date formats * Two additional flexible formats, and 20 new interval
formats
*Japanese calendar * In Japanese locale, updated to use Gannen (元年)
year numbering for non-numeric formats (which include 年), and to
consistently use narrow eras in numeric date formats such as “H31/3/27”.
*Region Names * Many names updated to local equivalents of “North
Macedonia” (MK
<https://www.unicode.org/cldr/charts/35/by_type/locale_display_names.territories__europe_.html#216cb1286c47a733>)
and “Eswatini” (SZ
<https://www.unicode.org/cldr/charts/35/by_type/locale_display_names.territories__africa_.html#6e49aa3c9aa50dc9>).

*Segmentation * Enhanced Grapheme Cluster Boundary rules for 6 Indic
scripts: Gujr, Telu, Mlym, Orya, Beng, Deva.

A dot release, version 35.1 is expected in April, with further changes
for Japanese calendar.

For details, see Detailed Specification Changes
<https://sites.google.com/site/cldr/index/downloads/cldr-35?pli=1#TOC-Detailed-Specification-Changes>,
Detailed Structure Changes
<https://sites.google.com/site/cldr/index/downloads/cldr-35?pli=1#TOC-Detailed-Structure-Changes>,
Detailed Data Changes
<https://sites.google.com/site/cldr/index/downloads/cldr-35?pli=1#TOC-Detailed-Data-Changes>.

------------------------------------------------------------------------
/Over 136,000 characters are available for adoption
<http://unicode.org/consortium/adopt-a-character.html>, to help the
Unicode Consortium’s work on digitally disadvantaged languages/

[badge] <http://unicode.org/consortium/adopt-a-character.html>

http://blog.unicode.org/2019/03/unicode-cldr-version-35-languagelocale.html

----
All of the Unicode Consortium lists are strictly opt-in lists for members
or interested users of our standards. We make every effort to remove
users who do not wish to receive e-mail from us. To see why you are getting
this mail and how to remove yourself from our lists if you want, please
see http://www.unicode.org/consortium/distlist.html#announcements
Received on Wed Mar 27 2019 - 16:20:34 CDT

This archive was generated by hypermail 2.2.0 : Wed Mar 27 2019 - 16:20:47 CDT