[Unicode]   Common Locale Data Repository : Bug Tracking Home | Site Map | Search

CLDR Ticket #6785(closed: fixed)

Opened 5 years ago

Last modified 4 months ago

Fix some replacements

Reported by: mark Owned by: mark
Component: other Data Locale:
Phase: Review: pedberg
Weeks: Data Xpath:

Description (last modified by mark) (diff)

  1. Moldovan, and fix to LDML.
  1. The line

<languageAlias type="mo" replacement="ro" reason="deprecated"/> <!-- Moldovan -->

This would give better results as:

<languageAlias type="mo" replacement="ro_MD" reason="deprecated"/> <!-- Moldovan -->

Like we do with

<languageAlias type="sh" replacement="sr_Latn" reason="legacy"/> <!-- Serbo-Croatian -->

  1. In any event, the text in http://www.unicode.org/reports/tr35/#BCP_47_Language_Tag_Conversion must be fixed to have the right algorithm for replacement, so that it handles 'sh' properly. The key is that while the original base language will always be changed by languageAlias, any other existing subtags will not. So "sh-TR" => "sr-Latn-TR", but "sh-Cyrl" => "sr-Cyrl".
  1. We sometimes have multiple replacements:

<territoryAlias type="SU" replacement="RU AM AZ BY EE GE KZ KG LV LT MD TJ TM UA UZ" reason="deprecated"/> <!-- Union of Soviet Socialist Republics -->

Right now, the first listed one is the most likely, in the absence of other information. However, we could specify a slightly more sophisticated replacement algorithm for territory replacement.

  1. If there is a single territory in the replacement, use it.
  2. Otherwise, look up the most likely territory for the base language code (and script, if there is one).
  3. If that likely territory is in the list, use it.
  4. Otherwise, use the first territory in the list.

Thus, for example "hy-SU" (Armenian as used in the Soviet Union) becomes "hy-AM" (Armenian as used in Armenia).

This is not a high priority fix, but that could change it


Change History

comment:1 Changed 5 years ago by mark

  • Description modified (diff)

comment:2 Changed 5 years ago by emmons

  • Status changed from new to assigned
  • Component changed from unknown to data
  • Priority changed from assess to major
  • Milestone changed from UNSCH to 25M1
  • Owner changed from anybody to mark
  • type changed from unknown to enhancement

comment:3 Changed 5 years ago by mark

Split off documentation part of this into ticket:6901

Last edited 5 years ago by mark (previous) (diff)

comment:4 Changed 5 years ago by mark

  • Review set to pedberg

comment:5 Changed 5 years ago by pedberg

  • Status changed from assigned to closed
  • Resolution set to fixed

comment:6 Changed 5 years ago by emmons

  • Milestone 25M1 deleted

Milestone 25M1 deleted

comment:7 Changed 4 months ago by mark

  • Component changed from main to other

Add a comment

Modify Ticket

as closed
Next status will be 'new'
Next status will be 'closed'

E-mail address and user name can be saved in the Preferences.

Note: See TracTickets for help on using tickets.