[Unicode]   Common Locale Data Repository : Bug Tracking Home | Site Map | Search

CLDR Ticket #6738(closed defect: fixed)

Opened 5 years ago

Last modified 4 years ago

collation starred relations should only contain NFD-inert characters

Reported by: markus Owned by: markus
Component: collation Data Locale:
Phase: Review: pedberg
Weeks: 0.1 Data Xpath:



When we introduced collation starred relations (compact syntax), I think we said we wanted to forbid characters that are not NFD-inert, so that there is no ambiguity when rule strings are decomposed or otherwise normalized. If we still have consensus on this, then we should document and enforce it.

For example, fa.txt contains a starred rule with decomposable characters: <<*أٲإٳؤ In escaped form, it is <<*\u0623\u0672\u0625\u0673\u0624. U+0623 ARABIC LETTER ALEF WITH HAMZA ABOVE and U+0625 ARABIC LETTER ALEF WITH HAMZA BELOW are composites.


Change History

comment:1 Changed 5 years ago by emmons

  • Owner changed from anybody to markus
  • Priority changed from assess to medium
  • Status changed from new to assigned
  • Milestone changed from UNSCH to 25rc

TC agrees with Markus's assessment.

comment:2 Changed 4 years ago by markus

  • Cc mark, yoshito, emmons added
  • Status changed from assigned to reviewing
  • Review set to pedberg

comment:3 Changed 4 years ago by pedberg

  • Cc markus added
  • Status changed from reviewing to closed
  • Xref set to 7060
  • Resolution set to fixed

Filed cldrbug 7060: to add a test for this.

comment:4 Changed 4 years ago by emmons

  • Milestone 25rc deleted

Milestone 25rc deleted


Add a comment

Modify Ticket

as closed
Next status will be 'new'
Next status will be 'closed'

E-mail address and user name can be saved in the Preferences.

Note: See TracTickets for help on using tickets.