Alternative sorting for digraphs (Was Re: [OT] o-circumflex)

From: Mark Davis (mark@macchiato.com)
Date: Mon Sep 10 2001 - 12:07:31 EDT


A SHY will mean that the word can break at "Bei-
jing". It is not clear to me at least that that is safe in all cases for all
languages with digraphs that sort separately, although it may be a solution
for some.

A ZWNJ will break ligatures and cursive connections. While probably safe in
Danish or Dutch, it is unclear to me that that is safe in all languages
where this situation occurs. There are diagraphs in Urdu, for example. While
I don't know their sorting order, if they do sort separately then ZWNJ can't
be used to express the alternative sorting, since it would give the wrong
rendering.

Mark
—————

Πόλλ’ ἠπίστατο ἔργα, κακῶς δ’ ἠπίστατο πάντα — Όμήρου Μαργίτῃ
[http://www.macchiato.com]
----- Original Message -----
From: "John Wilcock" <john@tradoc.fr>
To: <unicode@unicode.org>
Sent: Monday, September 10, 2001 8:39 AM
Subject: Re: [OT] o-circumflex

> On Mon, 10 Sep 2001 16:42:45 +0200, Keld Jørn Simonsen wrote:
> > But maybe you are driving for a yet more complex sorting, one that can
sort
> > according to multiple rules? Beijing should then not be sorted as
Beÿing?
>
> I haven't followed this discussion from the beginning, so apologies if
> I'm missing the point, but it seems to me that the Beijing case in
> Dutch is no different from the ekstraarbejde case in Danish - a SHY or
> ZWNJ is all that is needed to stop Beijing sorting with Bey.
>
>
> John.
>
> --
> -- Over 1500 webcams from ski resorts around the world -
http://www.snoweye.com/
> -- Translate your technical documents and web pages -
http://www.tradoc.fr/
>
>



This archive was generated by hypermail 2.1.2 : Mon Sep 10 2001 - 13:08:08 EDT