[Unicode]   Common Locale Data Repository : Bug Tracking Home | Site Map | Search

CLDR Ticket #9151(accepted data)

Opened 2 years ago

Last modified 2 years ago

Latvian collation chart misses rules for letters Ā Ē Ī Ū

Reported by: LazyDelphiBuilder@… Owned by: markus
Component: collation Data Locale: LV
Phase: rc Review:
Weeks: Data Xpath:


I checked collation charts versions 27 & 28 - they miss rules for latvian letters Āā Ēē Īī Ūū

In practice, it leads to incorrect (not matching to order of letters in Latvian Alphabet) sorting order.

Example test case (for Ā letter):
Ab 1
Az 2
Āb 3
Āz 4
Cb 5

Sorted list of these 5 strings must be ordered according to nubmers next to letters. But in reality this order is not guaranteed, cause existing rules do not specify difference amongst Ā and A letters (same is for a and ā, Ē and E, ē and e, Ī and I, ī and i, Ū and U, ū and u).

Latvian Alphabet: https://en.wikipedia.org/wiki/Latvian_orthography
The Latvian Language Agency (goverment organisation): http://valoda.lv/en/Agenturas_darbiba/The_Latvian_Language_Agency/653/mid_668

p.s. the "funny thing" is that this rule impacts huge areas of computer software, e.g. Android and MacOs devices, databases etc. And I really wonder why the "bug" is still not fixed


Change History

comment:1 Changed 2 years ago by emmons

  • Owner changed from anybody to markus
  • Phase changed from dsub to rc
  • Type changed from charts to data
  • Status changed from new to accepted
  • Milestone changed from UNSCH to upcoming

Markus please investigate


Add a comment

Modify Ticket

as accepted

E-mail address and user name can be saved in the Preferences.

Note: See TracTickets for help on using tickets.