On Fri, Dec 6, 2013 at 3:40 AM, Naz Gassiep <mrnaz_at_hotmail.com> wrote:
>
> I favour using a single method for all things, and so I am attracted to the
> idea of using combining characters for everything. However, language parsing
> tools for languages where those combined characters are used may be fooled
> when presented with U+0061 combined with U+0304 instead of the usual U+0101.
In Unicode the characters with precomposed diacritics are given
"canonical equivalences" to the corresponding sequences of base
characters followed by separate diacritics. So Unicode-compliant
parsing tools should not distinguish between the two.
-- Shriramana Sharma ஶ்ரீரமணஶர்மா श्रीरमणशर्माReceived on Thu Dec 05 2013 - 16:47:00 CST
This archive was generated by hypermail 2.2.0 : Thu Dec 05 2013 - 16:47:01 CST