Re: Unicode conversion to normal asci file

From: Doug Ewell (dewell@roadrunner.com)
Date: Thu Jun 07 2007 - 22:49:52 CDT

  • Next message: Christopher Fynn: "Re: Devanagari composing help needed"

    Anto'nio Martins-Tuva'lkin <antonio at tuvalkin dot web dot pt> wrote:

    >> I want a program that will read in a unicode file, convert it to asci
    >> so that we can upload it into our database.
    >> Example:
    >> If the input file looks like:
    >> último año de carrera
    >> It would convert it to
    >> ultimo ano de carrera
    >
    > You must mean 7-bit ASCII, not ANSI (whatever that exactly means here).
    > FWIW, in "ANSI" (considering this to be either Windows cp-1252 or ISO
    > 8859-1) it would still be "último año de carrera" as equivalents of both
    > U+00FA and U+00F1 are present in it.

    He didn't say ANSI. He said "asci", erroneously with one "i" instead of
    two.

    > Anyway, this sentence is an excellent example of why "stripping accents"
    > is not the way to go. Do you know what mean "ano" in Spanish? Do you? ;-)

    I had hoped that in 2007, there would no longer be a need for anyone to
    smash properly accented Spanish down to 7-bit ASCII, but I guess there is.

    --
    Doug Ewell  *  Fullerton, California, USA  *  RFC 4645  *  UTN #14
    http://users.adelphia.net/~dewell/
    http://www1.ietf.org/html.charters/ltru-charter.html
    http://www.alvestrand.no/mailman/listinfo/ietf-languages
    


    This archive was generated by hypermail 2.1.5 : Thu Jun 07 2007 - 23:12:28 CDT