Named sequences, was: Saudi-Arabian Copyright sign

From: Peter Kirk (peterkirk@qaya.org)
Date: Tue Sep 21 2004 - 05:58:29 CDT

  • Next message: Jörg Knappen: "RE: Saudi-Arabian Copyright sign"

    On 20/09/2004 19:21, Asmus Freytag wrote:

    > ...
    >
    > PS for named sequences:
    > See: http://www.unicode.org/reports/tr34
    > Draft Data:
    > http://www.unicode.org/Public/4.1-Update/NamedCompositeEntities-4.1.0d4.txt
    >
    > (the last part of the file name may change to NamedSequences*.txt).
    >
    The draft data is actually at
    http://www.unicode.org/Public/4.1-Update/NamedSequences-4.1.0d4.txt.

    Is the intention of these named sequences to list all sequences which
    are commonly considered to be units, although not treated as such by
    Unicode? There are certainly some in Hebrew - at least dotted shin,
    dotted sin and holam male, quite possibly all base characters with
    dagesh. Is the intention to name all sequences which actually occur as
    grapheme clusters? If so, a list of many thousands is needed for Hebrew.

    Where the sequence is supported as an alphabetic presentation form, e.g.
    FB2A, FB2B and FB4B, will there be an equivalent named sequence, or will
    the alphabetic presentation form name be used also for the sequence, or
    will there simply be no need to define a sequence? Two different names
    for the same thing could cause confusion.

    -- 
    Peter Kirk
    peter@qaya.org (personal)
    peterkirk@qaya.org (work)
    http://www.qaya.org/
    


    This archive was generated by hypermail 2.1.5 : Tue Sep 21 2004 - 10:53:01 CDT