Accumulated Feedback on PRI #287

This page is a compilation of formal public feedback received so far. See Feedback for further information on this issue, how to discuss it, and how to provide feedback.

Date/Time: Tue Apr 7 18:37:41 CDT 2015
Name: Asmus Freytag
Report Type: Error Report
Opt Subject: Proposal for UAX#24 Script Name

I propose a new section 2.9.x in UAX#24 that describes in more detail and
rigor how the script extension property is assigned.

< start of proposed section >

Section 2.9.x Script_Extensions Property Values

This section describes formal construction and constraints on the
Script_Extensions property values. In the following the Script_Extensions
property is abbreviated /scx/.

    1. Each code point has exactly one non-empty scx set associated with it.

    That is, the property value for the scx property always consists of a set. 
    The empty set is not allowed; the scx value for unassigned and non-character 
    code points is { Unknown }.

    2. The elements of the scx set consist of an unordered list of unique values 
    of the Script property (sc values).

    The scx values { Latn Grek } and { Grek Latn } are identical; for ease of 
    comparison the sets may be sorted and listed in alphabetical order.

    3. For the purposes of creating scx sets, the following sc values, and only 
    these are considered /implicit/: /Common/, /Inherited/, and /Unknown/. All
    other sc values are considered explicit.

    4. An scx set either contains a single implicit sc value or one or more explicit sc values.

    Examples:...

    5. If Script(c) is explicit, then that value must be an element of the scx set as well.

    Examples...

The number of values for the Script property grows every time a new script is
added to the Unicode Standard. Implementations are advised to allow for
growth. On the other hand, it is unlikely that any scx value would
individually list even a majority of existing scripts. It is for such cases of
widespread use of characters across scripts that the sc value /Common/ was
created.

There are no formal rules for when an sc value is added to the scx set for a
code point; whether to document that a character is used with multiple scripts
via the Script_Extensions property remains a judgment call, and is always
based on the best available information to the Unicode Technical Committee.

< end of proposed section >

I further propose that the Unicode Consortium *not* duplicate this information
in the stability policies. If considered absolutely necessary, my preferred
suggestion would be to cite the section and rules by reference and declare
those which must be formally stabilized.

My strong suspicion is that it is more important to document the script
extension property than to make formal guarantees.

One thing which should be considered is whether scripts can be delisted from
an scx set as more information becomes available. My strong inclination is to
answer that in the positive. (This would get an item 7 in the proposed
section).

Date/Time: Wed Apr 8 14:24:18 CDT 2015
Name: Asmus Freytag
Report Type: Error Report
Opt Subject: Editorial nit: UAX#24

Ed Note: This has already been fixed in the draft.

This is a minor editorial nit on UAX#24 

Table 4. Examples of East Asian Symbols with Common Script Values

==>

Table 4. Examples of East Asian Symbols with Script Value = common

This affects other table headers and surrounding text as well.

Reason: "common values" has a whole set of meanings beyond "value = common"... 
and even if it has the latter meaning it is not obvious without careful reading 
that "common" here is not the same a "commonly occurring" or "shared".