From: N. Ganesan (naa.ganesan@gmail.com)
Date: Sat Aug 20 2005 - 10:03:53 CDT
Richard Wordingham wrote:
>I can understand the gripes about 'level-2' v. 'level-1' implementation,
>though. [...]
>How well, though, would the new scheme work if it were allocated non-PUA
>codes in the SMP?
And, Philippe Verdy:
>Although I don't like the idea of publishing new 8-bit charset
>standards, it certainly helps when it allows reducing the number of
>cases to test and support for supporting correctly a script or language.
Tamil does not have conjuncts, and this is unique among
Indian languages. This is because of the action of puLLi,
all Tamil grammars define consonants with puLLi (as mentioned here
few times). Unicode defines abugidas with inherent /a/ as
consonants for Tamil (this is something fairly new, not attested in Tamil texts
and grammar anytime).
So, there are officially 2 bilingual 8-bit encodings:
TSCII and TAB. Both have conversion tables to Unicode Tamil
in Unicode Tech Notes. Tamil virtual university, Madras supports TAB and TAM.
Many yahoogroups etc., work in tscii,
http://www.tamil.net/tscii/charset17.gif
http://www.tscii.org/
Eg. tscii to Unicode mappings:
http://www.tscii.org/IETF/Tamil%20Char%20Names_20050526.pdf
Text conversion From TSCII 1.7 to Unicode (utn-15):
http://www.unicode.org/notes/tn15/
Interested to know about (level-1) SMP codes for
Tamil Virtual University table of Tamil letters,
http://www.infitt.org/minmanjari/issue2_2/mm-unicodetngovt.html
Naga Ganesan
This archive was generated by hypermail 2.1.5 : Sat Aug 20 2005 - 10:05:54 CDT