[Unicode]   Common Locale Data Repository : Bug Tracking Home | Site Map | Search
 
Modify

CLDR Ticket #10460(closed data: fixed)

Opened 14 months ago

Last modified 10 months ago

ar_IQ.xml has a typo in one instance of the month name October

Reported by: maiku.fabian@… Owned by: kristi
Component: unknown Data Locale:
Phase: dsub Review: fredrik
Weeks: Data Xpath:
Xref:

Description

http://unicode.org/repos/cldr/trunk/common/main/ar_IQ.xml

contains the full month name for “October” 4 times:

<calendar type="gregorian">

<months>

<monthContext type="format">

<monthWidth type="abbreviated">

...
<month type="10">تشرین الأول</month>
...

</monthWidth>
<monthWidth type="narrow">

...

</monthWidth>
<monthWidth type="wide">

...
<month type="10">تشرين الأول</month>
...

</monthWidth>

</monthContext>
<monthContext type="stand-alone">

<monthWidth type="abbreviated">

...
<month type="10">تشرين الأول</month>
...

</monthWidth>
<monthWidth type="narrow">

...

</monthWidth>
<monthWidth type="wide">

...
<month type="10">تشرين الأول</month>
...

</monthWidth>

</monthContext>

</months>

</calendar>

The sequence in "format" "abbreviated" is different from the other 3 sequences.

The sequence in "format" "abbreviated" has these Unicode code points:

~$ echo -n تشرین الأول | iconv -f utf-8 -t utf16le | od -t x2
0000000 062a 0634 0631 06cc 0646 0020 0627 0644
0000020 0623 0648 0644
0000026
~$

and the other 3 sequences have these Unicode code points:

~$ echo -n تشرين الأول | iconv -f utf-8 -t utf16le | od -t x2
0000000 062a 0634 0631 064a 0646 0020 0627 0644
0000020 0623 0648 0644
0000026
~$

The difference is the 4th code point which is ی U+06CC ARABIC LETTER FARSI YEH
in the sequence in "format" "abbreviated" but
ي U+064A ARABIC LETTER YEH in the other 3 sequences.

I think U+06CC in the the sequence in "format" "abbreviated" is a typo.
The sequences seem to render identical, but ی U+06CC ARABIC LETTER FARSI YEH
does not seem to belong here.

ی U+06CC ARABIC LETTER FARSI YEH can also not be converted from to ISO-8859-6,
another hint that this character is wrong for Arabic.

Attachments

Change History

comment:1 Changed 14 months ago by carlos@…

I agree with Mike. This looks like a typo was made since the medial form of ARABIC LETTER FARSI YEH has a diaeresis below just like ARABIC LETTER YEH.

comment:2 Changed 12 months ago by mark

  • Owner changed from anybody to kristi
  • Priority changed from assess to medium
  • Type changed from unknown to data
  • Status changed from new to accepted
  • Milestone changed from UNSCH to 32

comment:3 Changed 11 months ago by kristi

  • Status changed from accepted to reviewing
  • Review set to fredrik

comment:4 Changed 11 months ago by fredrik

Reached out to our Arabic vetter to double-check.

comment:5 Changed 10 months ago by fredrik

  • Status changed from reviewing to closed
  • Resolution set to fixed
View

Add a comment

Modify Ticket

Action
as closed
Next status will be 'new'
Next status will be 'closed'
Author


E-mail address and user name can be saved in the Preferences.

 
Note: See TracTickets for help on using tickets.