In TUS, in http://www.unicode.org/versions/Unicode6.1.0/ch03.pdf
D56 Combining character sequence: A maximal character sequence
consisting of either a
base character followed by a sequence of one or more characters where each
is a
combining character, zero width joiner, or zero width non-joiner; or a
sequence of one or more characters where each is a combining character,
zero
width joiner, or zero width non-joiner.
...
So 'a' is the base character for the acute. The clauses with zwj/nj are
really designed for Indic and similar scripts. The rendering isn't
specified for other cases, but where the zwj/nj are not defined to have an
effect, they should* be ignored for normal rendering. That being said, this
is such an edge case that I don't think the 'should' is enough to jump
through hoops for.
------------------------------
Mark <https://plus.google.com/114199149796022210033>
*
*
*— Il meglio è l’inimico del bene —*
**
On Mon, Feb 27, 2012 at 09:37, Eric Mader <emader_at_icu-project.org> wrote:
> Hello,
>
> I searched the list archives and didn't find anything that addressed this
> exact issue. If I see a sequence like a + ZWNJ + ACUTE, should it be
> rendered as a followed by an acute accent over a dotted circle, or should
> it be rendered as A-ACUTE?
>
> (The actual case I hit was in Devanagari, but it seems to me that the
> question applies to all scripts with non-spacing marks)
>
> Regards,
> Eric Mader
>
>
>
Received on Mon Feb 27 2012 - 13:27:37 CST
This archive was generated by hypermail 2.2.0 : Mon Feb 27 2012 - 13:27:38 CST