Accumulated Feedback on PRI #441

This page is a compilation of formal public feedback received so far. See Feedback for further information on this issue, how to discuss it, and how to provide feedback.

Date/Time: Tue Mar 29 00:43:30 CDT 2022
Name: Norbert Lindenberg
Report Type: Public Review Issue
Opt Subject: 441

The proposed update for UAX 29 excludes the following Kawi characters 
from having the Grapheme_Cluster_Break property value SpacingMark:

U+11F03 ( ◌𑼃 ) KAWI SIGN VISARGA
U+11F34 ( ◌𑼴 ) KAWI VOWEL SIGN AA
U+11F35 ( ◌𑼵 ) KAWI VOWEL SIGN ALTERNATE AA
U+11F41 ( ◌𑽁 ) KAWI SIGN KILLER

Being excluded from having SpacingMark means that they receive the
Grapheme_Cluster_Break property value Other. In consequence, these
characters do not combine with other characters into extended grapheme
clusters; they always form their own separate grapheme clusters.

I don't see any reason in the proposal for Kawi, L2/20-284R, or anywhere
else why that should be the case. The purpose of grapheme clusters isn't
well defined, but one case where the Unicode Standard recommends using them
is in emergency line breaking (see UAX 14, section 3, Introduction). If a
line break is introduced before a combining mark of a complex script, fonts
or rendering systems commonly insert a dotted circle as a base for that
mark, which is undesirable.

The corresponding spacing combining marks in the three most closely related
scripts, Javanese, Balinese, and Sundanese, all have the
Grapheme_Cluster_Break property value SpacingMark or (in one case, 1B35)
Extend. I suggest that Kawi is handled the same way.

Date/Time: Fri Jun 17 09:24:45 CDT 2022
Contact: richard.gibson@gmail.com
Name: Richard Gibson
Report Type: Error Report
Opt Subject: Unicode® Standard Annex #29 UNICODE TEXT SEGMENTATION

TC39 technical group 2 would like to push for an improvement
in #Word_Boundary_Rules that provides an example above WB6 similar to the
one above WB8.

Proposed change from the tc39/ecma402 GitHub repository issue 656
issuecomment-1158026888 :

-Do not break letters across certain punctuation.
+Do not break letters across certain punctuation, such as within “e.g” or “example.com”.