Re: JIS X 0208 mappings in Unihan.txt

From: Theo Veenker (Theo.Veenker@let.uu.nl)
Date: Thu Jun 02 2005 - 02:31:01 CDT

  • Next message: Antoine Leca: "Re: Ligatures fi and ffi"

    Erik van der Poel wrote:
    > Theo Veenker wrote:
    >
    >> In Unihan.txt I see only the 1990 mappings for JIS X 0208.
    >> But ISO-2022-JP-[12] also use JIS X 0208-1978 and JIS X 0208-1983.
    >> Are the later dated versions of this character set supersets
    >> of the earlier versions? Does anyone know where to find the 1978
    >> and 1983 mapping tables?
    >
    >
    > I believe the official source of info for JIS X 0208 is the Japanese
    > Standards Association (JSA):
    >
    > http://www.jsa.or.jp/
    >
    > Ken Lunde explains many (all?) of the differences between the 78, 83 and
    > 90 versions in his book CJKV Information Processing. Some of this info
    > is also available online:
    >
    > http://examples.oreilly.com/cjkvinfo/AppQ/

    As a matter of fact I have a copy of the book but it didn't cross
    my mind to check it; how stupid of me.

    >
    > Unihan.txt contains normative mappings for JIS X 0208-1990 under the
    > kIRG_JSource tag. I do not know of any mapping tables for the 78 and 83
    > versions. Some (most?) implementations simply make them equivalent to
    > the 90 version. See the Background Information section of the RFC 1468:
    >
    > http://ietf.org/rfc/rfc1468.txt

    Yes I read this. But I understand that there is a big difference between
    the 78 and 83 version. According to http://www.io.com/~kazushi/encoding/jis.html
    in the 83 version not only were characters added, but also many changed
    or exchanged (whatever that means). Also for the 90 version were characters
    changed. If this is really true, aren't the revised character sets incompatible
    with the earlier versions? I'll have to look this up in Ken Lundes book.

    Theo



    This archive was generated by hypermail 2.1.5 : Thu Jun 02 2005 - 02:33:33 CDT