[Unicode]   Common Locale Data Repository : Bug Tracking Home | Site Map | Search
 
Modify

CLDR Ticket #5859(accepted data)

Opened 5 years ago

Last modified 3 years ago

Ambiguous Collation of ll before Middle Dot

Reported by: richard.wordingham@… Owned by: markus
Component: collation Data Locale: cy es sq
Phase: Review:
Weeks: Data Xpath:
Xref:

Description

Albanian, Spanish (traditional) and Welsh have collation tailorings for 'll', 'lL' and 'LL'. If the root weightings for 'l·' and 'L·' are interpreted as contractions, as implied by allkeys_CLDR.txt and the published definition of FractionalUCA.txt (though that definition is probably wrong - to be dealt with under ticket 5850), the sequence ll· will have weight CE(ll)CE(·), not CE(ll)CE(l|·). To correct this anomaly, the following three prefix rules need to be added for the three locales:

&\u0FDD0p=ll|·
&\u0FDD0p=Ll|·
&\u0FDD0p=LL|·

(The use of 'p' is correct for UCA 6.2.0 and CLDR Version 23.)

Applications interpreting prefix rules as equivalent to single contractions will then derive the same collations as implementations interpreting '004C | 00B7' in FractionalUCA.txt as an ICU prefix rule.

Attachments

Change History

comment:1 Changed 5 years ago by emmons

  • Owner changed from anybody to markus
  • Status changed from new to assigned

comment:2 Changed 3 years ago by markus

  • Type changed from defect to data

comment:3 Changed 2 years ago by srl

  • Status changed from assigned to accepted
View

Add a comment

Modify Ticket

Action
as accepted
Author


E-mail address and user name can be saved in the Preferences.

 
Note: See TracTickets for help on using tickets.