From: Ed Trager (ed.trager@gmail.com)
Date: Tue Apr 21 2009 - 15:25:31 CDT
Hi, Ankur,
(1) As Shawn Steele noted, you should use Unicode (UTF-8 or UTF-16).
If this is a web application, probably UTF-8 is the best choice (just
as Wikipedia does). For thick clients, UTF-16 is often used -- but it
depends on
your OS and application framework.
(2) The Letter Database at the Institute of the Estonian Language is a
great resource for determining which letters are needed for spcific
Latin and, in your case, Cyrillic orthographies:
http://www.eki.ee/index.html.en
http://www.eki.ee/letter/
For example, here's the query for Kazakh using Cyrillic:
http://www.eki.ee/letter/chardata.cgi?lang=_kazakh&script=cyrillic
This information can be useful in choosing the right fonts, setting up
CSS font directives, etc.
Best - Ed
On Tue, Apr 21, 2009 at 11:28 AM, Ankur Mathur <mathur.ankur@gmail.com> wrote:
>
> Hi All,
>
> I need to implement support for Kazakh cyrillic alphabet.
> As per wikipedia ( http://en.wikipedia.org/wiki/Kazakh_alphabet ), there are two standard 8 bit character sets for this alphabet:
>
> CT PK 920-91 for DOS (a modification of code page 866)
> CT PK 1048—2002 for Windows (a modification of code page 1251)
>
> Now I couldn't find any more detail online about these two character sets.
> Please help me know, which all char sets should I support for this alphabet. And if the above char sets are the standard ones, where can I find their mapping tables (with unicode code points) ?
>
> Thanks in advance
> Ankur Mathur
This archive was generated by hypermail 2.1.5 : Tue Apr 21 2009 - 15:27:32 CDT