Re: Transliteration of Arabic characters into English

From: Graeme E. Coutts (gcoutts@philippholzmann.com)
Date: Fri May 12 2000 - 06:03:00 EDT


Thanks for all the info concerning my original question on Arabic
transliteration. One final question please (bear with me..I'm new at this
Unicode)...

To generate the CAPITAL Z WITH CEDILLA, I must use LATIN CAPITAL LETTER Z
(U+005A) + COMBINING CEDILLA (U+0327). That fine for file storage, but I
will need to generate a glyph for display purposes showing the combination
character. I can easily create the combined character glyph with my font
package - my question is what code can I give this glyph. Is there a special
area reserved for this purpose?

Regards,

Graeme Coutts.

----- Original Message -----
From: "Brendan Murray/DUB/Lotus" <brendan_murray@lotus.com>
To: "Unicode List" <unicode@unicode.org>
Sent: Thursday, May 11, 2000 9:34 PM
Subject: Re: Transliteration of Arabic characters into English

>
> Just to keep this in the unicode.org arena....
>
> Vladas Tumasonis <vladas.tumasonis@maf.vu.lt>wrote:
> >Brendan Murray/DUB/Lotus wrote:
> >>
> >> "Graeme E. Coutts" <gcoutts@philippholzmann.com>wrote:
> >> > I am working on a project which involves the transliteration of
Arabic
> >> Geographic Names into English text.
> >> See the ICU project - there's some Arabic/Latin transliteration already
> >> done there. It's on http://oss.software.ibm.com/icu.
> >>
> >> >However, there is one character + diacritical combination that I have
> >> failed to find a code for:
> >> In general, you can encode an accented character using the base
> character +
> >> the combining accent. For example, in the case of CAPITAL Z WITH
> CEDILLA,
> >> you can generate this using LATIN CAPITAL LETTER Z (U+005A) + COMBINING
> >> CEDILLA (U+0327).
> >>
> >> B=
> >
> > It is possible? CAPITAL Z WITH CEDILLA has not own Unicode code. So this
> > letter should be expressed by composite sequence as you wrote (and the
> > glyph should be generated in rendering). But as I know there is no yet
> > implementation of this feature, there is no text processor supporting
> > composite sequences. O may be there is some possibility?
>
> Actually, very many products already process combining characters
> correctly.You might consider upgrading whatever you're using, if you need
> support for such characters. For example, there is currently talk of
> encoding the CopyLeft symbol in Unicode. Just as an excercise, we found
> that this non-existant character could easily be displayed in Lotus 1-2-3,
> using the currently encoded characters.
>
> > It is very very important for Lithuania because Lithuanian language has
> > 35 accented letters (latin letters with acute accent, grave accent and
> > tilde) that have not UCS codes. And we can not enter, process and
> > display (print) such letters using multibyte technology.
>
> What Lithaunian character is not in Unicode? I think you'll find that
> you're mistaken - perhaps you're confusing Unicode with ASCII.
>
> >
> > Sincerely,
> > Vladas Tumasonis
> >
> > Vladas Tumasonis, associate professor
> > Department of Mathematics and Computer Science, Vilnius University
> > Naugarduko Str. 24, LT-2600 Vilnius, Lithuania
> > Ph.: +370-2-336035 Fax: +370-2-251585
> > E-mail: vladas.tumasonis@maf.vu.lt
>
>
>
>



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:21:02 EDT