Re: ZWNJ And Non-spacing Marks from Mark Davis ☕ on 2012-02-27 (Unicode Mail List Archive)

From: Mark Davis ☕ <mark_at_macchiato.com>
Date: Mon, 27 Feb 2012 11:23:19 -0800

In TUS, in http://www.unicode.org/versions/Unicode6.1.0/ch03.pdf

D56 Combining character sequence: A maximal character sequence
consisting of either a
base character followed by a sequence of one or more characters where each
is a
combining character, zero width joiner, or zero width non-joiner; or a
sequence of one or more characters where each is a combining character,
zero
width joiner, or zero width non-joiner.

...

So 'a' is the base character for the acute. The clauses with zwj/nj are
really designed for Indic and similar scripts. The rendering isn't
specified for other cases, but where the zwj/nj are not defined to have an
effect, they should* be ignored for normal rendering. That being said, this
is such an edge case that I don't think the 'should' is enough to jump
through hoops for.

------------------------------
Mark <https://plus.google.com/114199149796022210033>
*
*
*— Il meglio è l’inimico del bene —*
**

On Mon, Feb 27, 2012 at 09:37, Eric Mader <emader_at_icu-project.org> wrote:

> Hello,
>
> I searched the list archives and didn't find anything that addressed this
> exact issue. If I see a sequence like a + ZWNJ + ACUTE, should it be
> rendered as a followed by an acute accent over a dotted circle, or should
> it be rendered as A-ACUTE?
>
> (The actual case I hit was in Devanagari, but it seems to me that the
> question applies to all scripts with non-spacing marks)
>
> Regards,
> Eric Mader
>
>
>
Received on Mon Feb 27 2012 - 13:27:37 CST

This archive was generated by hypermail 2.2.0 : Mon Feb 27 2012 - 13:27:38 CST