[Unicode]   Common Locale Data Repository : Bug Tracking Home | Site Map | Search
 
Modify

CLDR Ticket #10738(closed data: fixed)

Opened 12 months ago

Last modified 6 weeks ago

Wrong glyph in Kyrgyz (Kirghiz) XML

Reported by: a@… Owned by: mark
Component: main Data Locale: KY
Phase: dsub Review: kristi
Weeks: Data Xpath:
Xref:

ticket:10739

Description

In http://unicode.org/repos/cldr/trunk/common/main/ky.xml

<exemplarCharacters>
[а б г д е ё ж з и й к л м н ӊ о ө п р с т у ү х ч ш ъ ы э ю я]
</exemplarCharacters>

ӊ ‎04CA CYRILLIC SMALL LETTER EN WITH TAIL
is wrong

The correct character should be:
ң ‎04A3 CYRILLIC SMALL LETTER EN WITH DESCENDER

Uppercase shows the correct characters.
<exemplarCharacters type="index">
[А Б В Г Д Е Ё Ж З И Й К Л М Н Ң О Ө П Р С Т У Ү Ф Х Ц Ч Ш Щ Ъ Ы Ь Э Ю Я]
</exemplarCharacters>

Proof link 1: https://www.paratype.com/help/language/language1.asp?langCode=41
Proof link 2: https://www.eki.ee/letter/chardata.cgi?ucode=04A3

Attachments

Change History

comment:1 Changed 9 months ago by srl

  • Xref set to 10739

comment:2 Changed 6 months ago by kristi

  • Owner changed from anybody to mark
  • Status changed from new to accepted
  • Milestone changed from UNSCH to 34

comment:3 Changed 5 months ago by mark

My reservation is that the translators produced these results, so they were probably on keyboards. On the other hand, the fact that we had 04CA as the character may have caused translators to change their text on input.

So the question is: what forms are used by the most common keyboards used by Kyrgyz users?

Once we establish what the preferred form is, we can use DAIP to pick the right form automatically. That means that we don't have to apply this fix before submission. That is, the DAIP can automatically change:

ӊ ‎04CA CYRILLIC SMALL LETTER EN WITH TAIL

to

ң ‎04A3 CYRILLIC SMALL LETTER EN WITH DESCENDER

or vice versa (depending on the right form).

Last edited 5 months ago by mark (previous) (diff)

comment:4 Changed 5 months ago by mark

  • Cc kristi, chiara, fredrik added

comment:5 Changed 5 months ago by kristi

The index characters contains Ң (U+04A2 Cyrillic capital letter EN with descender), which is fine, but the main letters contains ӊ (U+04CA Cyrillic small letter EN with tail) instead of ң (U+04A3 Cyrillic small letter EN with descender).

The result is that
There are lots of warnings are for terms that contain the letter that is missing from the core data. For example: http://st.unicode.org/cldr-apps/v#/ky/Activities/cd73a73601d9c8a

comment:6 Changed 5 months ago by mark

  • Component changed from unknown to survey

comment:7 Changed 5 months ago by mark

  • Type changed from unknown to survey

comment:8 Changed 2 months ago by mark

  • Status changed from accepted to reviewing
  • Review set to kristi

comment:9 Changed 6 weeks ago by pedberg

  • Type changed from survey to data
  • Component changed from survey to main

comment:10 Changed 6 weeks ago by kristi

  • Status changed from reviewing to closed
  • Resolution set to fixed
View

Add a comment

Modify Ticket

Action
as closed
Next status will be 'new'
Next status will be 'closed'
Author


E-mail address and user name can be saved in the Preferences.

 
Note: See TracTickets for help on using tickets.