Re: Diacritical marks: Single character or combined character?

From: Shriramana Sharma <samjnaa_at_gmail.com>
Date: Fri, 6 Dec 2013 04:15:07 +0530

On Fri, Dec 6, 2013 at 3:40 AM, Naz Gassiep <mrnaz_at_hotmail.com> wrote:
>
> I favour using a single method for all things, and so I am attracted to the
> idea of using combining characters for everything. However, language parsing
> tools for languages where those combined characters are used may be fooled
> when presented with U+0061 combined with U+0304 instead of the usual U+0101.

In Unicode the characters with precomposed diacritics are given
"canonical equivalences" to the corresponding sequences of base
characters followed by separate diacritics. So Unicode-compliant
parsing tools should not distinguish between the two.

-- 
Shriramana Sharma ஶ்ரீரமணஶர்மா श्रीरमणशर्मा

Received on Thu Dec 05 2013 - 16:47:00 CST

This archive was generated by hypermail 2.2.0 : Thu Dec 05 2013 - 16:47:01 CST