Re: [OT] o-circumflex

From: Asmus Freytag (asmusf@ix.netcom.com)
Date: Sat Sep 08 2001 - 20:55:45 EDT


At 02:45 PM 9/8/01 -0700, Mark Davis wrote:
>If you use a Danish tailoring of the UCA that equates Å and AA (at least at
>a primary and secondary level), then they will sort the same way. A string
>search that uses the same tailoring will also find "Ålborg" when given
>"Aalborg" (and vice versa).

But if you do this, all compound words starting with "data" and continuing
with another word starting with "a" will be sorted incorrectly!

To achieve this effect, you would have to mark which AAs are A-Rings and
which ones are accidental adjacencies. In Danish one can use the SHY (soft
hyphen) to break the latter, as these accidental pairs occur at legal word
break points. In fact, that's the recommended solution, but it requires
that the input data are in a sepecific form.

A./



This archive was generated by hypermail 2.1.2 : Sat Sep 08 2001 - 21:47:52 EDT