CLDR Ticket #9771(accepted data)
collation index: add beyond-last-reordered-CJK index labels
|Reported by:||markus||Owned by:||markus|
Most CJK collation tailorings reorder only a subset of the Han ideographs, to limit the data size. Any not-reordered ideograph sorts after the last reordered one.
In a CJK collation index, a not-reordered ideograph shows up in the last CJK index bucket, which is misleading. For example, with the short Chinese stroke order, the two-stroke ideograph 㐅=U+3405 lands in the 48劃 bucket because the short-stroke Han tailoring ends with
<'\uFDD0\u2830' # INDEX 48 <*龘 # 48
I suggest that we add another index label at the end of each of the Han tailorings, maybe something like "?劃".
I have not tried if this would work out of the box with ICU, and I am open to other ideas.
- Owner changed from anybody to markus
- Priority changed from assess to major
- Status changed from new to accepted
- Milestone changed from UNSCH to 30