[Unicode]   Common Locale Data Repository : Bug Tracking Home | Site Map | Search
 
Modify

CLDR Ticket #9151(accepted data)

Opened 20 months ago

Last modified 19 months ago

Latvian collation chart misses rules for letters Ā Ē Ī Ū

Reported by: LazyDelphiBuilder@… Owned by: markus
Component: collation Data Locale: LV
Phase: rc Review:
Weeks: Data Xpath:
Xref:

Description

I checked collation charts versions 27 & 28 - they miss rules for latvian letters Āā Ēē Īī Ūū
http://www.unicode.org/cldr/charts/28/collation/lv.html

In practice, it leads to incorrect (not matching to order of letters in Latvian Alphabet) sorting order.

Example test case (for Ā letter):
Ab 1
Az 2
Āb 3
Āz 4
Cb 5

Sorted list of these 5 strings must be ordered according to nubmers next to letters. But in reality this order is not guaranteed, cause existing rules do not specify difference amongst Ā and A letters (same is for a and ā, Ē and E, ē and e, Ī and I, ī and i, Ū and U, ū and u).

Links:
Latvian Alphabet: https://en.wikipedia.org/wiki/Latvian_orthography
The Latvian Language Agency (goverment organisation): http://valoda.lv/en/Agenturas_darbiba/The_Latvian_Language_Agency/653/mid_668

p.s. the "funny thing" is that this rule impacts huge areas of computer software, e.g. Android and MacOs devices, databases etc. And I really wonder why the "bug" is still not fixed

Attachments

Change History

comment:1 Changed 19 months ago by emmons

  • Owner changed from anybody to markus
  • Phase changed from dsub to rc
  • Type changed from charts to data
  • Status changed from new to accepted
  • Milestone changed from UNSCH to upcoming

Markus please investigate

View

Add a comment

Modify Ticket

Action
as accepted
Author


E-mail address and user name can be saved in the Preferences.

 
Note: See TracTickets for help on using tickets.