Re: Unicode charsets registered with IANA

From: John Cowan (jcowan@reutershealth.com)
Date: Mon Sep 30 2002 - 13:13:13 EDT

  • Next message: John Cowan: "Re: The Currency Symbol of China"

    Markus Scherer scripsit:

    > There are also some (obsolete) Unicode 1.1 charsets registered:
    >
    > UNICODE-1-1-UTF-7 (MIBenum 103) csUnicode11UTF7
    > UNICODE-1-1 (MIBenum 1010) csUnicode11 [this uses the UCS-2BE encoding scheme]

    The use of these forms means specifically that the old encoding of Hangul
    is in use. For all other Unicode 1.1 text, some UTF-* charset should be
    given.

    > This is even more obsolete:
    >
    > ISO-10646-UTF-1 (MIBenum 27) csISO10646UTF1 [Unicode 1.0? Very obsolete encoding scheme.]

    That's the predecessor of UTF-8, the one that reserves 0x80-0x9F octets.

    -- 
    John Cowan   <jcowan@reutershealth.com>   http://www.ccil.org/~cowan
    "One time I called in to the central system and started working on a big
    thick 'sed' and 'awk' heavy duty data bashing script.  One of the geologists
    came by, looked over my shoulder and said 'Oh, that happens to me too.
    Try hanging up and phoning in again.'"  --Beverly Erlebacher
    


    This archive was generated by hypermail 2.1.5 : Mon Sep 30 2002 - 14:07:49 EDT