From: Peter Kirk (peterkirk@qaya.org)
Date: Tue Sep 21 2004 - 05:58:29 CDT
On 20/09/2004 19:21, Asmus Freytag wrote:
> ...
>
> PS for named sequences:
> See: http://www.unicode.org/reports/tr34
> Draft Data:
> http://www.unicode.org/Public/4.1-Update/NamedCompositeEntities-4.1.0d4.txt
>
> (the last part of the file name may change to NamedSequences*.txt).
>
The draft data is actually at
http://www.unicode.org/Public/4.1-Update/NamedSequences-4.1.0d4.txt.
Is the intention of these named sequences to list all sequences which
are commonly considered to be units, although not treated as such by
Unicode? There are certainly some in Hebrew - at least dotted shin,
dotted sin and holam male, quite possibly all base characters with
dagesh. Is the intention to name all sequences which actually occur as
grapheme clusters? If so, a list of many thousands is needed for Hebrew.
Where the sequence is supported as an alphabetic presentation form, e.g.
FB2A, FB2B and FB4B, will there be an equivalent named sequence, or will
the alphabetic presentation form name be used also for the sequence, or
will there simply be no need to define a sequence? Two different names
for the same thing could cause confusion.
-- Peter Kirk peter@qaya.org (personal) peterkirk@qaya.org (work) http://www.qaya.org/
This archive was generated by hypermail 2.1.5 : Tue Sep 21 2004 - 10:53:01 CDT