Re: AbstractCharacter class

From: Kenneth Whistler (kenw@sybase.com)
Date: Tue Jul 29 1997 - 13:35:33 EDT


John Cowan schrieb und J örg Knappen beantwortet:
>
> > Thus LATIN CAPITAL LETTER O plus COMBINING DOT BELOW plus
> > COMBINING CIRCUMFLEX BELOW plus COMBINING CIRCUMFLEX (to make
> > up an example) can be reduced to LATIN CAPITAL LETTER O WITH
> > CIRCUMFLEX AND DOT BELOW (U+1ED8) plus COMBINING CIRCUMFLEX BELOW,
> > but if DOT BELOW comes after CIRCUMFLEX BELOW, the shortest reduction
> > is to LATIN CAPITAL LETTER O WITH CIRCUMFLEX plus COMBINING DOT
> > BELOW plus COMBINING CIRCUMFLEX BELOW.
>
>
> Hmm... I think first of all, a canonical order of all combining marks is
> needed. The combining marks fall into three classes: strike-through, below,
> and above (maybe those wide combining marks form a forth class). Note that
> you cannot reorder the combining marks within one class without changing
> the character:

Rick McGowan noted:

> There already is such a canonical order, and it's documented in the
> book; and the data is available via FTP.

In particular, the canonical ordering algorithm is Section 3.9, pp. 3-9 to
3-11, and the lists of combining classes used by that algorithm can be found
in Section 4.2, pp. 4-2 to 4-10.

--Ken



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:36 EDT