Re: Unicode->ASCII approximate conversion

From: Anto'nio Martins-Tuva'lkin (antonio@tuvalkin.web.pt)
Date: Tue Jan 06 2004 - 14:27:57 EST

  • Next message: Marco Cimarosti: "RE: Punched tape (was: "Re: American English translation of chara cter names")"

    On 2003.12.19, 13:21, D. Starner <shalesller@writeme.com> wrote:

    > 0108 - "CH" (Esperanto)
    > 011C, 011D - "GH", "gh" (E-o)
    > 0124, 0125 - "HH", "hh" (")
    > 0134, 0135 - "JH", "jh" (")
    > 015C, 015D - "SH", "sh" (")

    Actually, most of us use "x" instead of "h", as fall back in poor media.
    If terribly interested try < http://www.esperanto.web.pt/cxapelo1.htm >.

    > Depending on your goals, 015F & 0161 could be "sh", 0163 "ts", 017D
    > "zh", etc.

    This business of creating a sensible and general "diacritic stripping"
    or fall back scheme for 7-bit ASCII is very fishy. As we say in
    Portuguese, "Minha ama foi à caça" (Milady went hunting.) is not "Minha
    ama foi a caca." (Milady was the sh*t.)...

    -- ____.
    António MARTINS-Tuválkin | ()|
    <antonio@tuvalkin.web.pt> |####|
    Rua Alberto Bramão, 8-1º d.to |
    PT-1700-132 LISBOA Não me invejo de quem tem |
    +351 934 821 700 carros, parelhas e montes |
    http://www.tuvalkin.web.pt/bandeira/ só me invejo de quem bebe |
    http://pagina.de/bandeiras/ a água em todas as fontes |



    This archive was generated by hypermail 2.1.5 : Tue Jan 06 2004 - 16:44:54 EST