Unicode Processing Requirements for Tamil (was: 28th IUC paper - Tamil Unicode New)

From: Richard Wordingham (richard.wordingham@ntlworld.com)
Date: Wed Aug 31 2005 - 18:53:48 CDT

  • Next message: Anto'nio Martins-Tuva'lkin: "Re: Still OT, Re: VCR technical symbols?"

    John Hudson wrote on Monday, August 22, 2005

    > If you want something supported, you have to take it through the standards
    > process and
    > get it approved as part of Unicode or another standard that the software
    > company in
    > question is committed to supporting. If the behaviour you want to see for
    > Devanagari becomes part of Unicode's processing requirements for that
    > script, then you can expect Microsoft to support it.

    What should one do to get superscript (and ideally also subscript) digits
    supported in Tamil text? Section 9.6 Paragraph 2 of the Unicode Standard
    (from 4.0) says,
    'When representing the “missing” consonants in transcriptions of languages
    such as Sanskrit or Saurashtra, superscript European digits are often used,
    so ப² = pha, ப³ = ba, and ப⁴ = bha. The characters U+00B2, U+00B3, and
    U+2074 can be used to preserve this distinction in plain text.'

    However, combinations such as பெ⁴ௗ /bhau/ U+0BAA U+0BC6 U+2074 U+0BD7 and
    பெ₄ௗ /bhau/ U+0BAA U+0BC6 U+2084 U+0BD7 do not render properly on Windows
    XP - the dotted circle appears before the final element of the compound
    vowel.

    How would you recommend the Unicode Standard be strengthened so that
    Microsoft feels obliged to support the superscipts and subscripts in
    combination with non-conjoined follwoing vowels? I think mention of
    subscripts should be added, and it would be good to know whether subscript
    and superscript '1' are used.

    Richard.



    This archive was generated by hypermail 2.1.5 : Wed Aug 31 2005 - 18:59:08 CDT