From: John Cowan (jcowan@reutershealth.com)
Date: Tue Oct 28 2003 - 15:35:19 CST
jim scripsit:
> Unicode encodes U+1E20 and U+1E21 as combinations of lower and uppercase
> _g_ with macron. The forms have canonical decomposition to _g_ or _G_
> followed by U+0304. This seems to rule out being able to consider a bar
> above and a bar below as variants of the same character within Unicode.
True. But that doesn't mean that the glyph that a particular font uses for
the sequence <g, COMBINING MACRON BELOW> can't have the bar above the g.
This is a pure rendering question.
> IPA specifications also indicate that U+0325 COMBINING RING BELOW and
> some other diacritics normally placed beneath a character may instead be
> displayed above a character for typographical reasons.
So a smart IPA-specific font could render <g, U+0325> with the ring above.
> But Unicode specifications currently say nothing about the possibility
> of moving under-diacritics to an over-character position for
> typographical reasons except for combination of _g_ and cedilla.
Nothing needs to be said, because glyphs are not normative.
> Perhaps we need instead special search folding between upper position
> and lower position diacritics that are otherwise identical in form,
> e.g. between U+0304 (COMBINING MACRON) and U+0331 (COMBINING MACRON
> BELOW), between U+ 030A (COMBING RING ABOVE) and U+0325 (COMBINING RING
> BELOW) and so forth for any diacritics where an upper form and a lower
> form may have the same meaning.
That also can be done as a collation tailoring.
-- Principles. You can't say A is John Cowan <jcowan@reutershealth.com> made of B or vice versa. All mass http://www.reutershealth.com is interaction. --Richard Feynman http://www.ccil.org/~cowan
This archive was generated by hypermail 2.1.5 : Thu Jan 18 2007 - 15:54:25 CST