From: Kent Karlsson (kentk@cs.chalmers.se)
Date: Thu Jul 03 2003 - 09:50:05 EDT
(Disregarding your netiquette breach of quoting an off-list message on
a list...)
> Unicode characters can be said "deprecated", or strongly discouraged
> howeer they are still valid, and then it's best to describe 
> what should be
> their correct behavior. My question was there only for completeness,
> something that the Public Review Issues is supposed to enhance and
> document officially, even for "deprecated" characters.
There is no point in doing undue work for characters that shouldn't be
used
(whether deprecated or not).
> > The (typographic) dot(s) above should be removed if there is a
> > combining character of class 230 [centred above] in a combining
> > sequence starting with a soft-dotted character.  The file
> > UCD-4.0.0.html only says "An accent placed on these characters...";
> > but the "on" here should be interpreted as "class 230".  That could
> > be clarified. 
> 
> Thanks for admitting that the current description may easily 
> be misread
> as meaning "any diacritic". 
Well, it was not my formulation.  I've always referred to "(current)
combining class 230".  I might not have written it explicitly every time
though.
> With such misreading, a simple 
> font renderer
> may just check the presence of the first diacritic to use a 
> dotless glyph,
Not likely, though.  I do think typographers are smart too. ;-)
> I would like to have exact comments of what "on" means: does it *only*
> refer to the class 230? What is the impact of format controls inserted
> in a combining sequence, 
They break the combining sequence.  Applying a combining character
to a format control is, while legal, not something that has a
well-defined
behaviour.
...
> - Hangul syllables are very well defined
Hangul is a problem case.  But I will not go into that here and now.
                /kent k
This archive was generated by hypermail 2.1.5 : Thu Jul 03 2003 - 10:58:03 EDT