CLDR Ticket #6326(accepted data)
review almost-after-last-Latin collation tailorings
|Reported by:||markus||Owned by:||markus|
We should review all of the collation tailorings that appear to try to tailor near "the last character in the Latin script" or maybe "before the Latin clicks" or similar.
If there remain cases where we do want to tailor near "the last character in the Latin script", then we should add a CLDR release task, for after any Unicode release, to look for the currently-last one and see if the tailorings need to be updated to a newly-later-sorting character.
- They may need to be updated
- Because of them, we have wanted to add syntax like &[last Latn], see ticket:5549 and ticket:2821.
For example, sv.xml has &[before 1]ǀ<å<<<Å<ä ... which currently puts å etc. between U+0296 inverted glottal stop and U+01C0 dental click. There are currently 7 more Latin characters after the dental click.
It is not clear whether it really makes sense to tailor near "the last character in the Latin script no matter what is there". Why not tailor to a specific character? And why tailor primary-before something near the end of the Latin script, rather than after? Would we want this only for Latin, or also for other scripts?
- Owner changed from anybody to markus
- Priority changed from assess to medium
- Status changed from new to assigned
- Milestone changed from UNSCH to 24rc
- Xref changed from 5549 2821 to 5549 2821 5710
- Milestone changed from 24rc to 25dsub