Re: Perhaps OT: Mysterious escape sequences in UN data

From: Tom Gewecke (tom@bluesky.org)
Date: Tue Mar 31 2009 - 18:28:01 CST

  • Next message: Kenneth Whistler: "Re: Perhaps OT: Mysterious escape sequences in UN data"

    On Mar 31, 2009, at 12:58 PM, John Burger wrote:

    >
    > misleading clich\x{5ee5} that
    > Mr. Andr\x{5ee5} Pastrana Arango
    > highlighted by Mr. Rodr\x{74b2}uez
    > issued by the Espace r\x{5ee7}ublicain
    > transmitting an aide-m\x{5e66}oire issued
    >

    > I can correct some of these, but there are hundreds of different
    > ones.

    PS A possible way to convert the text:

    +find/replace \x{ by &#x

    +find/replace } by ;

    +convert html entities to unicode (UnicodeChecker does this on a Mac)

    +save as Big5 encoded

    +open as Latin-1 encoded



    This archive was generated by hypermail 2.1.5 : Tue Mar 31 2009 - 18:30:05 CST