Re: mixed-script writing systems

From: John Cowan (jcowan@reutershealth.com)
Date: Thu Nov 21 2002 - 11:11:41 EST

  • Next message: John Hudson: "RE: Anyone who can write Hindi on the Unicode List?"

    Dean Snyder scripsit:

    > What are the properties which will trigger separate Unicode encodings for
    > characters typically or always represented by identically shaped glyphs?

    Well, whyn't you say so?

    The normative ones, exactly and precisely. Casing is normative, so
    if language A claims that <squiggle> is upper case, and language B that it
    is lower case, then they must have distinct Unicode representations.
    Case *mapping* is informative, and it's perfectly all right for language
    A to claim that the lower-case form of <squiggle> is <squoggle> whereas
    language C makes it <squaggle> instead. Just another entry in SpecialCasing.

    At present, the most comprehensive list of normative vs. informative properties
    appears to be in http://www.unicode.org/Public/UNIDATA/UnicodeData.html and
    http://www.unicode.org/Public/UNIDATA/PropList.html jointly.

    -- 
    My corporate data's a mess!                     John Cowan
    It's all semi-structured, no less.              http://www.ccil.org/~cowan
        But I'll be carefree                        jcowan@reutershealth.com
        Using XSLT                                  http://www.reutershealth.com
    In an XML DBMS.
    


    This archive was generated by hypermail 2.1.5 : Thu Nov 21 2002 - 11:59:21 EST