[Unicode]   Common Locale Data Repository : Bug Tracking Home | Site Map | Search

CLDR Ticket #8372(accepted data)

Opened 2 years ago

Last modified 5 months ago

Bogus Kaithi values?

Reported by: mark Owned by: rick
Component: supplemental Data Locale:
Phase: rc Review:
Weeks: Data Xpath:


Description (last modified by markus) (diff)

141: <likelySubtag from="bh" to="bh_Kthi_IN"/> 
2,151: <likelySubtag from="und_Kthi" to="bh_Kthi_IN"/> 

These look bogus.

Kaithi is a 'historic' script (at least according to http://en.wikipedia.org/wiki/Kaithi). So it shouldn't be the customary script for any modern language.

'bh' is a collection, but the languages encompassed in that collection are modern, and none would thus appear to use Kaithi as the predominant script.

I'm guessing Devanagari, but this needs research.


Change History

comment:1 Changed 2 years ago by emmons

  • Status changed from new to accepted
  • Priority changed from assess to medium
  • Phase changed from dsub to rc
  • Milestone changed from UNSCH to 28
  • Owner changed from anybody to rick
  • Type set to data

comment:2 Changed 22 months ago by rick

This seems to originate in Script_Metadata.csv, where Kaithi script is listed with Bhojpuri as the likely language. But it's marked as ID Usage "Exclusion". I don't know what it should be, or what change would fix this bug...

Last edited 22 months ago by rick (previous) (diff)

comment:3 Changed 22 months ago by rick

Possibly this problem could be fixed by some over-ride in "GenerateMaximalLocales.java".

Maybe adding a line like this to the LANGUAGE_OVERRIDES list:

{ "bh", "bh_Deva_IN" },

would fix it...? I don't know.

comment:4 Changed 21 months ago by emmons

  • Component changed from unknown to supplemental

comment:5 Changed 21 months ago by rick

  • Milestone changed from 28 to 29

comment:6 Changed 19 months ago by emmons

  • Milestone changed from 29 to upcoming

comment:7 Changed 8 months ago by rick

I believe the right answer is Devanagari (Deva), for the language(s) formerly written with Kaithi. And the language most likely associated with Kaithi is "bho", not "bh" or "bih".

comment:8 Changed 5 months ago by markus

  • Cc markus added
  • Xref set to 9168
  • Description modified (diff)

Is this obsolete since ticket:9168 went into CLDR 30?


Add a comment

Modify Ticket

as accepted

E-mail address and user name can be saved in the Preferences.

Note: See TracTickets for help on using tickets.