Re: missing characters: combining marks above runs of more than 2 base letters

From: Kent Karlsson <kent.karlsson14_at_telia.com>
Date: Sun, 20 Nov 2011 22:43:19 +0100

Den 2011-11-20 20:50, skrev "Peter Constable" <petercon_at_microsoft.com>:

> Note that UTR 20 discusses semantic and presentation effects that are suitable
> for representation as characters versus markup and makes the point that, in
> XML, effects that involve spans of text should be represented using markup
> rather than characters that set and unset state. Those are, of course,
> recommendations about a markup language, not plain text. But the argument used
> works in both directions: things that involve spans of text are best handled
> as markup, while things that are very local (e.g. spanning no more than a
> grapheme cluster) may be more suitable for representation as characters.

And yet we have, apart from bidi controls, characters whose effect in
various ways spans several other characters/grapheme clusters:

0600;ARABIC NUMBER SIGN;Cf;0;AN;;;;;N;;;;;
0601;ARABIC SIGN SANAH;Cf;0;AN;;;;;N;;;;;
0602;ARABIC FOOTNOTE MARKER;Cf;0;AN;;;;;N;;;;;
0603;ARABIC SIGN SAFHA;Cf;0;AN;;;;;N;;;;;
0604;ARABIC SIGN SAMVAT;Cf;0;AN;;;;;N;;;;;

06DD;ARABIC END OF AYAH;Cf;0;AN;;;;;N;;;;;

070F;SYRIAC ABBREVIATION MARK;Cf;0;AL;;;;;N;;;;;

FFF9;INTERLINEAR ANNOTATION ANCHOR;Cf;0;ON;;;;;N;;;;;
FFFA;INTERLINEAR ANNOTATION SEPARATOR;Cf;0;ON;;;;;N;;;;;
FFFB;INTERLINEAR ANNOTATION TERMINATOR;Cf;0;ON;;;;;N;;;;;

110BD;KAITHI NUMBER SIGN;Cf;0;L;;;;;N;;;;;

1D173;MUSICAL SYMBOL BEGIN BEAM;Cf;0;BN;;;;;N;;;;;
1D174;MUSICAL SYMBOL END BEAM;Cf;0;BN;;;;;N;;;;;
1D175;MUSICAL SYMBOL BEGIN TIE;Cf;0;BN;;;;;N;;;;;
1D176;MUSICAL SYMBOL END TIE;Cf;0;BN;;;;;N;;;;;
1D177;MUSICAL SYMBOL BEGIN SLUR;Cf;0;BN;;;;;N;;;;;
1D178;MUSICAL SYMBOL END SLUR;Cf;0;BN;;;;;N;;;;;
1D179;MUSICAL SYMBOL BEGIN PHRASE;Cf;0;BN;;;;;N;;;;;
1D17A;MUSICAL SYMBOL END PHRASE;Cf;0;BN;;;;;N;;;;;

Ok, Ruby does already have XHTML/(HTML5) markup that seems better.

    /Kent K
 
> Peter
Received on Sun Nov 20 2011 - 15:46:24 CST

This archive was generated by hypermail 2.2.0 : Sun Nov 20 2011 - 15:46:25 CST