Dataset for all ISO639 code sorted by country/territory?

Philippe Verdy verdy_p at
Sat Sep 17 06:35:20 CDT 2016

Not all languages are sorted, only those for which there are released data
in CLDR.
And languages frequently belong to several countries/territories at the
same time, with different official or recognized status (itself independant
of the number of actual speakers, which is very frequently roughly
Some countries are giving official statistics about their national or
regional languages, but frequently these stats are old, or underestimated
or overestimated for political reasons, or some languages are mixed as if
they were only one, or simply discarded if it is considered locally as a
secondary language, even if the official language is superficially
understood but taken as a primary one.
Statistics are also forgetting native speakers living abroad in a diaspora,
or secondary learners of a language taught in foreign countries.

2016-09-17 11:19 GMT+02:00 Mats Blakstad <mats.gbproject at>:

> Hi
> Is there any dataset that contains all languages in the world sorted by
> country/territory?
> I found this at Unicode, but seems like only containing the most spoken
> languages in each country and not the smaller once:
> information.html
> Thanks in advance for help.
> Best regards
> Mats Blakstad
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <>

More information about the Unicode mailing list