From: Frdric Grosshans (frederic.grosshans@m4x.org)
Date: Mon Oct 25 2010 - 08:03:50 CDT
Le lundi 25 octobre 2010 à 14:08 +0300, Arns Udovīčė a écrit :
> Hello all,
>
> On 2010-06-30 change became effective for Samogitian
> (http://www.sil.org/iso639-3/documentation.asp?id=sgs). There are one
> letter which has many problems with rendering. It is E/e with dot
> above and macron [...] You can see it in: atvīkė̄ or Vielībė̅jė. On
> Linux it is shown (in different shrifts) as „ė“ with macron between
> dot and letter or „ė“ with minus near letter (see attachment).
On my (linux) system, it is well rendered.
> [...] Try to use in IT is complicated.
> What I should to do what this letter come to unicode? As I read in
> http://www.unicode.org/pending/proposals.html it is better to deal
> about it in this thread at first.
First, Unicode has stopped long time ago (Unicode 4.1 ?) to encode
composite latin diacritics. I have the feeling it should be documented
in http://www.unicode.org/faq/char_combmark.html#9 , but I've been
unable to find a clear statement here. The main reason is thet it would
break normalization stability
( http://www.unicode.org/policies/stability_policy.html#Normalization )
causing security problems (see http://www.unicode.org/reports/tr36/)
That's why, for exemple, letters for Lithuanian like LATIN CAPITAL
LETTER A WITH OGONEK AND ACUTE; U+0104 U+0301, is not encoded as a
unicode character but as a named sequence.
You may ask for encoding *LATIN {CAPITAL,SMALL} LETTER E WITH DOT ABOVE
AND MACRON as a named sequence (see
http://www.unicode.org/faq/named_sequences.html#3 and folllowing), but
I'm not sure it has a big chance to stay.
Frédéric
-- Frédéric Grosshans Chargé de Recherche Laboratoire de Photonique Quantique et Moléculaire ENS Cachan / CNRS UMR 8437 tel: (+33)1 47 40 77 15 GSM: (+33)6 09 24 29 64 e-mail: frederic.grosshans@ens-cachan.fr
This archive was generated by hypermail 2.1.5 : Mon Oct 25 2010 - 08:06:45 CDT