[Unicode]   Common Locale Data Repository : Bug Tracking Home | Site Map | Search

CLDR Ticket #10799(accepted data)

Opened 8 months ago

Last modified 5 months ago

Urdu collator doesn't place U+0647 and U+0622 in the correct order

Reported by: shehzaad@… Owned by: markus
Component: collation Data Locale: ur
Phase: rc Review:
Weeks: 0.4 Data Xpath:


Copied by markus from IcuBug:13499 (with U+ notation fixed) --

U+0647 should be between U+0648 and U+06BE.
Right now U+0647 ends up at the end of the sorted list of the characters in the entire alphabet:

U+0622 appears after U+0627, when is should appear as the first character. (U+0622 is a variation of U+0627 and in dictionaries appears as the first letter). See https://ia600901.us.archive.org/23/items/FerozUlLughat/Feroz-Ul-Lughat%20Jame%20By%20Maulvi%20Ferozuddin.pdf (the de-facto authoritative dictionary of Urdu).

The expected order of the alphabet should be:

{"\u0622", "\u0627","\u0628","\u067E","\u062A","\u0679","\u062B",


Change History

comment:1 Changed 7 months ago by mark

  • Owner changed from anybody to markus
  • Priority changed from assess to major
  • Status changed from new to accepted
  • Milestone changed from UNSCH to 33

comment:2 Changed 5 months ago by markus

  • Keywords punt33 added
  • Milestone changed from 33 to 34

Add a comment

Modify Ticket

as accepted

E-mail address and user name can be saved in the Preferences.

Note: See TracTickets for help on using tickets.