From: John Cowan (jcowan@reutershealth.com)
Date: Mon Sep 30 2002 - 13:13:13 EDT
Markus Scherer scripsit:
> There are also some (obsolete) Unicode 1.1 charsets registered:
>
> UNICODE-1-1-UTF-7 (MIBenum 103) csUnicode11UTF7
> UNICODE-1-1 (MIBenum 1010) csUnicode11 [this uses the UCS-2BE encoding scheme]
The use of these forms means specifically that the old encoding of Hangul
is in use. For all other Unicode 1.1 text, some UTF-* charset should be
given.
> This is even more obsolete:
>
> ISO-10646-UTF-1 (MIBenum 27) csISO10646UTF1 [Unicode 1.0? Very obsolete encoding scheme.]
That's the predecessor of UTF-8, the one that reserves 0x80-0x9F octets.
-- John Cowan <jcowan@reutershealth.com> http://www.ccil.org/~cowan "One time I called in to the central system and started working on a big thick 'sed' and 'awk' heavy duty data bashing script. One of the geologists came by, looked over my shoulder and said 'Oh, that happens to me too. Try hanging up and phoning in again.'" --Beverly Erlebacher
This archive was generated by hypermail 2.1.5 : Mon Sep 30 2002 - 14:07:49 EDT