[Unicode]   Common Locale Data Repository : Bug Tracking Home | Site Map | Search
 
Modify

CLDR Ticket #9915(closed data: no-time-to-do-this)

Opened 17 months ago

Last modified 17 months ago

Extend Territory-Language Information to all living languages

Reported by: mats.gbproject@… Owned by: anybody
Component: supplemental Data Locale:
Phase: dsub Review:
Weeks: Data Xpath:
Xref:

Description

I wonder if we could make it possible to add territory-language information for all living languages?
http://www.unicode.org/cldr/charts/latest/supplemental/territory_language_information.html

This would be a long-term project, but we could manage to get the data a little by little.

We could base it on ISO 639-3 codes for languages that are not already in the tabels.

Attachments

Change History

comment:1 Changed 17 months ago by srl

I wonder if we could make it possible

It is theoretically possible… but not clear what is supposed to happen with this ticket. Are you volunteering to provide data?

comment:2 Changed 17 months ago by mats.gbproject@…

There are 2 credible open source dataset we could use to get an initial mapping of all the languages in the world. There is a lot of data in Glottolog which is licensed under a creative commons:
glottolog.org/glottolog/language

We could also use the dataset in the Palaso Library which is licensed under a MIT license:
https://raw.githubusercontent.com/sillsdev/libpalaso/master/SIL.WritingSystems/Resources/LanguageIndex.txt
github.com/sillsdev/libpalaso/wiki/SIL.WritingSystems
It also have some interesting data about writing systems used for different languages:
github.com/sillsdev/libpalaso/blob/master/SIL.WritingSystems/Resources/alltags.txt

I think this would be a first great step for CLDR to better support more languages! For sure I'm volunteering to compare and analyze the data and try get overview of how they diverge and adapt the data to be implemented into CLDR. We also need to make sure languages are mapped correctly into the territories supported by CLDR that are not part of ISO 3166

(I had to unlink some sources as the ticket system faulty detect my post as spam because of "too many external links")

comment:3 Changed 17 months ago by emmons

  • Status changed from new to closed
  • Resolution set to no-time-to-do-this

This is way too big of a maintenance effort for us to CLDR to consider. CLDR TC discussion of 2016-12-07 consensus is that we don't have the resources to do this.

View

Add a comment

Modify Ticket

Action
as closed
Next status will be 'new'
Next status will be 'closed'
Author


E-mail address and user name can be saved in the Preferences.

 
Note: See TracTickets for help on using tickets.