[Unicode]   Common Locale Data Repository : Bug Tracking Home | Site Map | Search

CLDR Ticket #5577(closed: fixed)

Opened 6 years ago

Last modified 5 years ago

Thai collation should put Thai first, before other scripts

Reported by: pedberg Owned by: pedberg
Component: collation Data Locale:
Phase: Review: markus
Weeks: Data Xpath:




The default Thai collation should put Thai first, before other scripts. Can use block reordering for this.


Change History

comment:1 Changed 6 years ago by pedberg

  • Owner changed from anybody to pedberg
  • Priority changed from assess to major
  • Status changed from new to assigned
  • Milestone changed from UNSCH to 23dres

There was an old bug—or perhaps a Unicode PRI—that considered this, which concluded that this should probably be done for most non-Latin-script languages except CJK languages. It might be worth finding this info and doing more globally.

comment:2 Changed 6 years ago by pedberg

  • Keywords Apple-10916886 added
  • Status changed from assigned to accepted
  • Xref set to 4020

The old bugs and PRI are:

  • cldrbug 3070: to create Unicode PRI #178 about this (PRI active around May 2011) - closed.
  • cldrbug 3522: with PRI #178 feedback about Japanese - closed as dup of cldrbug 4020:
  • cldrbug 4020: with the text of PRI #178 and the feedback in cldrbug 3522: - as far as I can tell from UTC agendas and PRI feedback there was no other feedback on PRI #178. This ticket is currently assigned to Mark with milestone future.

I think we need more information about the conventions for different scripts/languages before doing this across all languages (except CJK), so I will leave the more global change to cldrbug 4020: and just use this for Thai (for which we have specific feedback that this is desired), and some highly likely cases from cldrbug 4020: (sr and sr_Latn).

# th
<reorder>Thai Latn</reorder>

# sr
<reorder>Cyrl Latn</reorder>

# sr_Latn
<reorder>Latn Cyrl</reorder>
Last edited 6 years ago by markus (previous) (diff)

comment:3 Changed 6 years ago by pedberg

I am checking with some of our localizers to se if I should do this for selected other non-Latin-script languages.

comment:4 Changed 6 years ago by pedberg

I have confirmation from our Russian and Hebrew experts that it should be done for those languages. No other feedback yet on this.

comment:5 Changed 6 years ago by pedberg

  • Xref changed from 4020 to 4020, 5599

OK, for this ticket I will do th, sr, sr_Latn, ru, and he; others can be done under cldrbug 4020: . And actually, the XML syntax in comment 2 above is bogus due to an error in UTS #35 (filed cldrbug 5599: about that). So what we actually want to do is add an attribute to settings as follows:

# th
    <settings ... reorder="Thai Latn">
# sr
    <settings ... reorder="Cyrl Latn">
# sr_Latn
    <settings ... reorder="Latn Cyrl">
# ru
    <settings ... reorder="Cyrl Latn">
# he
    <settings ... reorder="Hebr Latn">

comment:6 Changed 6 years ago by pedberg

  • Review set to mark

Actually, sr_Latn is just an alias to hr, so we do this for hr where it also makes sense. These changes will also cover bs/bs_Cyrl which are aliased to hr/sr.

comment:7 Changed 6 years ago by pedberg

Got feedback that we should do this for Arabic too, so I added that.

comment:8 Changed 6 years ago by pedberg

  • Review changed from mark to markus

Removed redundant Latns

comment:9 Changed 6 years ago by markus

  • Status changed from accepted to closed
  • Resolution set to fixed

comment:10 Changed 5 years ago by emmons

  • Milestone 23dres deleted

Milestone 23dres deleted


Add a comment

Modify Ticket

as closed
Next status will be 'new'
Next status will be 'closed'

E-mail address and user name can be saved in the Preferences.

Note: See TracTickets for help on using tickets.