[Unicode]   Common Locale Data Repository : Bug Tracking Home | Site Map | Search

CLDR Ticket #5588(accepted spec)

Opened 4 years ago

Last modified 2 years ago

add variant of alternate=shifted that is consistent with the rest of the UCA

Reported by: markus Owned by: markus
Component: collation Data Locale:
Phase: Review:
Weeks: 0.5 Data Xpath:


I looked into the details of the alternate=Shifted and Shift-Trimmed collation options, and found that they produce strange and surprising (to me) results. I wrote an ICU User Guide chapter explaining the various options and their results: http://userguide.icu-project.org/collation/customization/ignorepunct

I propose that we add a variant of "Shifted" that is consistent with the rest of the UCA, that is, it yields behavior on quaternary level consistent with normal behavior on secondary and tertiary levels. It is the last option in the new chapter. Implementation-wise it is a minor tweak of Shifted.

I am not sure what to call this option. It should probably have been the "Shifted" behavior when that was first introduced. For this proposed addition, I suggested Shifted-High, Mark suggested Variable-After. Shifted-consistent-with-the-rest-of-UCA?? See the new ICU User Guide chapter for details.


Change History

comment:1 Changed 4 years ago by markus

Mark and I talked this over some more (and refined the new ICU User Guide text).

We think that normally Shifted is good because it sorts, for example, "de luge" < "deluge", which we think is customary.

We think that when the string-without-variables wants to be sorted before variants of that string with variables, then Variable-After would be better than Shift-Trimmed.

As long as there is no strong demand for either one, we probably do not need to take immediate action.

comment:2 Changed 4 years ago by emmons

  • Owner changed from anybody to markus
  • Status changed from new to assigned

Not high priority - UNSCH for now.

comment:3 Changed 2 years ago by markus

  • Type changed from enhancement to spec
  • Component changed from uca to collation

comment:4 Changed 2 years ago by srl

  • Status changed from assigned to accepted

Add a comment

Modify Ticket

as accepted

E-mail address and user name can be saved in the Preferences.

Note: See TracTickets for help on using tickets.