[Unicode]   Common Locale Data Repository : Bug Tracking Home | Site Map | Search
 
Modify

CLDR Ticket #9000(accepted data)

Opened 23 months ago

Last modified 7 months ago

Add subdivision names in more languages

Reported by: mark Owned by: mark
Component: other Data Locale:
Phase: dvet Review:
Weeks: Data Xpath:
Xref:

Description

The goal is not complete coverage, but rather to cover the subdivisions in the main languages of each country, plus other subdivisions where the data is relatively easy to extract.

Here's a draft process, but this will need development and refinement as we go along.

  1. Start with a goal of at least one official (de jure or de facto) language for each country. However, based on availability and quality of data, that scope could be expanded (eg adding the names of German Länder in Russian). Initially limit to only "modern coverage" CLDR languages.
  2. Document the process used to clean up the English names (techniques for resolving conflicts, producing more customary names: eg "State of California" => "California") so that translators have something to start with.
  3. Extract native language subdivision names for subdivisions based on Wikipedia data and/or other sources.
  4. Produce spreadsheets for each language listing the subdivision code, English name, and possible native names, maybe also wikipedia links.
  5. Distributed these to translators for verification, probably in online-spreadsheet form.
  6. Process the resulting data into XML format. Residual conflicts are sent back to translators for review.

Note: we've found it best to do (e) and (f) for one or two languages first, to verify that the process works, before opening it up to more languages.

Attachments

Change History

comment:1 Changed 23 months ago by emmons

  • Status changed from new to accepted
  • Component changed from unknown to other
  • Priority changed from assess to medium
  • Phase changed from dsub to final
  • Milestone changed from UNSCH to 29
  • Owner changed from anybody to mark
  • Type changed from unknown to data

comment:2 Changed 19 months ago by mark

  • Milestone changed from 29 to 30

comment:3 Changed 17 months ago by mark

  • Phase changed from final to dsub

comment:4 Changed 17 months ago by mark

  • Phase changed from dsub to dvet

comment:5 Changed 13 months ago by mark

  • Milestone changed from 30 to 31

comment:6 Changed 7 months ago by mark

  • Milestone changed from 31 to 32

Will wait til ST cycle

View

Add a comment

Modify Ticket

Action
as accepted
Author


E-mail address and user name can be saved in the Preferences.

 
Note: See TracTickets for help on using tickets.