Re: ISO 8859-11 (Thai) cross-mapping table

From: John Cowan (jcowan@reutershealth.com)
Date: Tue Oct 08 2002 - 11:01:12 EDT

  • Next message: John Hudson: "RE: InDesign"

    Marco Cimarosti scripsit:

    > All 8859 tables would be more succint.

    Well, I checked the 8859-2 mapping table, and the only contiguous ranges
    are of length 2, namely 0xA7-0xA8, 0xC1-0xC2, 0xCD-0xCE, 0xD3-0xD4,
    0xD6-0xD7, 0xDC-0xDD, 0xE1-0xE2, 0xF3-0xF4, 0xF6-0xF7, 0xFC-0xFD.
    All of these are places where Latin-1 and Latin-2 coincide.

    > Latin sections are a worse case, but they still benefit slightly, because
    > characters shared with Latin-in stay the same positions.

    This is strongly true of Latin-1, -2, -3, -4: a character appears in the
    same codepoint or not at all. Unfortunately, Latin-3 is used only for
    Esperanto and Maltese, and Latin-4 is dead. The later Latins share only
    with Latin-1.

    -- 
    A mosquito cried out in his pain,               John Cowan
    "A chemist has poisoned my brain!"              http://www.ccil.org/~cowan
            The cause of his sorrow                 http://www.reutershealth.com
            Was para-dichloro-                      jcowan@reutershealth.com
    Diphenyltrichloroethane.                                (aka DDT)
    


    This archive was generated by hypermail 2.1.5 : Tue Oct 08 2002 - 11:51:03 EDT