[Unicode]   Common Locale Data Repository : Bug Tracking Home | Site Map | Search
 
Modify

CLDR Ticket #10738(accepted survey)

Opened 9 months ago

Last modified 7 weeks ago

Wrong glyph in Kyrgyz (Kirghiz) XML

Reported by: a@… Owned by: mark
Component: survey Data Locale: KY
Phase: dsub Review:
Weeks: Data Xpath:
Xref:

ticket:10739

Description

In http://unicode.org/repos/cldr/trunk/common/main/ky.xml

<exemplarCharacters>
[а б г д е ё ж з и й к л м н ӊ о ө п р с т у ү х ч ш ъ ы э ю я]
</exemplarCharacters>

ӊ ‎04CA CYRILLIC SMALL LETTER EN WITH TAIL
is wrong

The correct character should be:
ң ‎04A3 CYRILLIC SMALL LETTER EN WITH DESCENDER

Uppercase shows the correct characters.
<exemplarCharacters type="index">
[А Б В Г Д Е Ё Ж З И Й К Л М Н Ң О Ө П Р С Т У Ү Ф Х Ц Ч Ш Щ Ъ Ы Ь Э Ю Я]
</exemplarCharacters>

Proof link 1: https://www.paratype.com/help/language/language1.asp?langCode=41
Proof link 2: https://www.eki.ee/letter/chardata.cgi?ucode=04A3

Attachments

Change History

comment:1 Changed 6 months ago by srl

  • Xref set to 10739

comment:2 Changed 2 months ago by kristi

  • Owner changed from anybody to mark
  • Status changed from new to accepted
  • Milestone changed from UNSCH to 34

comment:3 Changed 2 months ago by mark

My reservation is that the translators produced these results, so they were probably on keyboards. On the other hand, the fact that we had 04CA as the character may have caused translators to change their text on input.

So the question is: what forms are used by the most common keyboards used by Kyrgyz users?

Once we establish what the preferred form is, we can use DAIP to pick the right form automatically. That means that we don't have to apply this fix before submission. That is, the DAIP can automatically change:

ӊ ‎04CA CYRILLIC SMALL LETTER EN WITH TAIL

to

ң ‎04A3 CYRILLIC SMALL LETTER EN WITH DESCENDER

or vice versa (depending on the right form).

Last edited 2 months ago by mark (previous) (diff)

comment:4 Changed 2 months ago by mark

  • Cc kristi, chiara, fredrik added

comment:5 Changed 2 months ago by kristi

The index characters contains Ң (U+04A2 Cyrillic capital letter EN with descender), which is fine, but the main letters contains ӊ (U+04CA Cyrillic small letter EN with tail) instead of ң (U+04A3 Cyrillic small letter EN with descender).

The result is that
There are lots of warnings are for terms that contain the letter that is missing from the core data. For example: http://st.unicode.org/cldr-apps/v#/ky/Activities/cd73a73601d9c8a

comment:6 Changed 7 weeks ago by mark

  • Component changed from unknown to survey

comment:7 Changed 7 weeks ago by mark

  • Type changed from unknown to survey
View

Add a comment

Modify Ticket

Action
as accepted
Author


E-mail address and user name can be saved in the Preferences.

 
Note: See TracTickets for help on using tickets.