[Unicode]   Common Locale Data Repository : Bug Tracking Home | Site Map | Search
 
Modify

CLDR Ticket #10799(accepted data)

Opened 5 months ago

Last modified 2 months ago

Urdu collator doesn't place U+0647 and U+0622 in the correct order

Reported by: shehzaad@… Owned by: markus
Component: collation Data Locale: ur
Phase: rc Review:
Weeks: 0.4 Data Xpath:
Xref:

Description

Copied by markus from IcuBug:13499 (with U+ notation fixed) --

U+0647 should be between U+0648 and U+06BE.
Right now U+0647 ends up at the end of the sorted list of the characters in the entire alphabet:

U+0622 appears after U+0627, when is should appear as the first character. (U+0622 is a variation of U+0627 and in dictionaries appears as the first letter). See https://ia600901.us.archive.org/23/items/FerozUlLughat/Feroz-Ul-Lughat%20Jame%20By%20Maulvi%20Ferozuddin.pdf (the de-facto authoritative dictionary of Urdu).

The expected order of the alphabet should be:

{"\u0622", "\u0627","\u0628","\u067E","\u062A","\u0679","\u062B",
 "\u062C","\u0686","\u062D","\u062E","\u062F","\u0688",
 "\u0630","\u0631","\u0691","\u0632","\u0698","\u0633",
 "\u0634","\u0635","\u0636","\u0637","\u0638","\u0639",
 "\u063A","\u0641","\u0642","\u06A9","\u06AF","\u0644",
 "\u0645","\u0646","\u0648","\u0647","\u06BE","\u0621",
 "\u06CC","\u06D2"};

Attachments

Change History

comment:1 Changed 5 months ago by mark

  • Owner changed from anybody to markus
  • Priority changed from assess to major
  • Status changed from new to accepted
  • Milestone changed from UNSCH to 33

comment:2 Changed 2 months ago by markus

  • Keywords punt33 added
  • Milestone changed from 33 to 34
View

Add a comment

Modify Ticket

Action
as accepted
Author


E-mail address and user name can be saved in the Preferences.

 
Note: See TracTickets for help on using tickets.