[Unicode]   Common Locale Data Repository : Bug Tracking Home | Site Map | Search
 
Modify

CLDR Ticket #10201(accepted data)

Opened 4 months ago

Last modified 4 months ago

Danish collation: thorn should be secondary not tertiary

Reported by: markus Owned by: markus
Component: collation Data Locale: da
Phase: rc Review:
Weeks: 0.1 Data Xpath:
Xref:

ticket:10178

Description

The thorn is tertiary-after th (case-like difference) while most of the other variations are secondary-after their regular bases (as usual for ligatures and such). Unless the Danish sorting standard indicates something about the strength of the thorn difference, it should be secondary.

We have &th<<<þ &TH<<<Þ

Customary would be &th<<þ<<<Þ

It is customary for ligatures and such letter-form variations to trump case & font variations. For example, long-s ſ is secondary-after s; sharp-s ß is secondary-after ss; etc.: http://www.unicode.org/charts/collation/chart_Latin.html

Also, a common way to do case-insensitive matching is to set strength=secondary. With the current rules, that makes th match thorn. With the usual secondary difference, they don't match. Of course, if you also ignore accents (strength=primary), they match either way.

Danish has secondary differences for most of the other near-equivalences (y vs. ü, æ vs. ä, ø vs. ö vs. ő).

Note: Danish å vs. aa is currently also only a tertiary difference. We might consider it at the same time as th vs. þ.

Difference in sort orders:
(th uses fullwidth ASCII as one example of other tertiary variants.)

Current thorn with [caseFirst upper]

  • OE
  • oe
  • Œ
  • œ
  • TH
  • Þ
  • th
  • þ
  • th
  • Y
  • y
  • Ü
  • ü

Customary thorn with [caseFirst upper]

  • OE
  • oe
  • Œ
  • œ
  • TH
  • th
  • th
  • Þ
  • þ
  • Y
  • y
  • Ü
  • ü

Attachments

Change History

comment:1 Changed 4 months ago by emmons

  • Owner changed from anybody to markus
  • Phase changed from dsub to rc
  • Priority changed from assess to medium
  • Status changed from new to accepted
  • Milestone changed from UNSCH to 32
View

Add a comment

Modify Ticket

Action
as accepted
Author


E-mail address and user name can be saved in the Preferences.

 
Note: See TracTickets for help on using tickets.