[Unicode]   Common Locale Data Repository : Bug Tracking Home | Site Map | Search

CLDR Ticket #9000(accepted data)

Opened 3 years ago

Last modified 10 months ago

Add subdivision names in more languages

Reported by: mark Owned by: mark
Component: other Data Locale:
Phase: dvet Review:
Weeks: Data Xpath:


The goal is not complete coverage, but rather to cover the subdivisions in the main languages of each country, plus other subdivisions where the data is relatively easy to extract.

Here's a draft process, but this will need development and refinement as we go along.

  1. Start with a goal of at least one official (de jure or de facto) language for each country. However, based on availability and quality of data, that scope could be expanded (eg adding the names of German Länder in Russian). Initially limit to only "modern coverage" CLDR languages.
  2. Document the process used to clean up the English names (techniques for resolving conflicts, producing more customary names: eg "State of California" => "California") so that translators have something to start with.
  3. Extract native language subdivision names for subdivisions based on Wikipedia data and/or other sources.
  4. Produce spreadsheets for each language listing the subdivision code, English name, and possible native names, maybe also wikipedia links.
  5. Distributed these to translators for verification, probably in online-spreadsheet form.
  6. Process the resulting data into XML format. Residual conflicts are sent back to translators for review.

Note: we've found it best to do (e) and (f) for one or two languages first, to verify that the process works, before opening it up to more languages.


Change History

comment:1 Changed 3 years ago by emmons

  • Status changed from new to accepted
  • Component changed from unknown to other
  • Priority changed from assess to medium
  • Phase changed from dsub to final
  • Milestone changed from UNSCH to 29
  • Owner changed from anybody to mark
  • Type changed from unknown to data

comment:2 Changed 2 years ago by mark

  • Milestone changed from 29 to 30

comment:3 Changed 2 years ago by mark

  • Phase changed from final to dsub

comment:4 Changed 2 years ago by mark

  • Phase changed from dsub to dvet

comment:5 Changed 2 years ago by mark

  • Milestone changed from 30 to 31

comment:6 Changed 18 months ago by mark

  • Milestone changed from 31 to 32

Will wait til ST cycle

comment:7 Changed 10 months ago by mark

  • Milestone changed from 32 to UNSCH

Probably moot, given wikipedia import...


Add a comment

Modify Ticket

as accepted

E-mail address and user name can be saved in the Preferences.

Note: See TracTickets for help on using tickets.