L2/24-161

Comments on Public Review Issues
(April 3 - July 2, 2024)

The sections below contain links to permanent feedback documents for the open Public Review Issues as well as other public feedback as of  July 2, 2024, since the previous cumulative document was issued prior to UTC #179 (April 3, 2024).

Contents:

The links below go directly to open PRIs and to feedback documents for them, as of July 2, 2024.

Issue Name Feedback Link
507 Proposed Update UAX #9, Unicode Bidirectional Algorithm (feedback) No feedback at this time
506 Proposed Update UAX #15, Unicode Normalization Forms (feedback)
505 Proposed Update UAX #42, Unicode Character Database in XML (feedback) No feedback at this time
504 Proposed Update UTS #46, Unicode IDNA Compatibility Processing (feedback) No feedback at this time
503 Proposed Update UAX #24, Unicode Script Property (feedback) No feedback at this time
502 Unicode 16.0.0 Beta (feedback)
501 Proposed Update UAX #11, East Asian Width (feedback) No feedback at this time
500 Draft UAX #57, Unicode Egyptian Hieroglyph Database (Unikemet) (feedback)
499 Proposed Update UAX #41, Common References for Unicode Standard Annexes (feedback) No feedback at this time
496 Proposed Update UTS #51, Unicode Emoji (feedback)
494 Proposed Update UAX #29, Unicode Text Segmentation (feedback)
492 Proposed Update UTS #39, Unicode Security Mechanisms (feedback) No feedback at this time
491 Proposed Update UAX #31, Unicode Identifiers and Syntax (feedback)
490 Proposed Update UAX #14, Unicode Line Breaking Algorithm (feedback)
489 Proposed Update UAX #44, Unicode Character Database (feedback)
488 Proposed Update UTS #10, Unicode Collation Algorithm (feedback) No feedback at this time
487 Proposed Update UAX #53, Unicode Arabic Mark Rendering (feedback) No feedback at this time
484 Proposed Update UAX #50, Unicode Vertical Text Layout (feedback) No feedback at this time
483 Proposed Update UAX #38, Unicode Han Database (Unihan) (feedback)

The links below go to locations in this document for feedback.

Feedback routed to CJK & Unihan Working Group for evaluation [CJK]
Feedback routed to Script Encoding Working Group for evaluation [SAH]
Feedback routed to Properties & Algorithms Working Group for evaluation [PAG]
Feedback routed to Emoji Standard & Research Working Group for evaluation [ESC]
Feedback routed to Editorial Working Group for evaluation [EDC]
Other Reports

 


Feedback routed to CJK & Unihan Working Group for evaluation [CJK]

Date/Time: Tue Apr 16 18:21:27 CDT 2024
ReportID: ID20240416182127
Name: Paul Masson
Report Type: Error Report
Opt Subject: kMandarin for U+941F 鐟


This character appears to have an alternate pronunciation tì which is not reflected in the database.


Feedback routed to Script Encoding Working Group for evaluation [SAH/SEW]

(None at this time.)


Feedback routed to Properties & Algorithms Working Group for evaluation [PAG]

(None at this time.)


Feedback routed to Emoji Standard & Research Working Group for evaluation [ESC/ESR]

(None at this time.)


Feedback routed to Editorial Working Group for evaluation [EDC]

Date/Time: Mon Apr 29 08:45:22 CDT 2024
ReportID: ID20240429084522
Name: Wuzzy Wuzzard
Report Type: Error Report
Opt Subject: Core Specification 15.0

I think I found an error in the Core Specification 15.0.

Link: https://www.unicode.org/versions/Unicode15.0.0/UnicodeStandard-15.0.pdf

On page 51 (real page number, not PDF page), it says:

    Plane 15 and Plane 16 are allocated, in their entirety, for private use. Those two planes con-
    tain a total of 131,068 characters, to supplement the 6,400 private-use characters located in
    the BMP.
    All other planes are reserved; there are no characters assigned in them. The last two code
    positions of all planes are permanently set aside as noncharacters. (See Section 2.13, Special
    Characters).

This seems to be a contradiction. If Planes 15 and 16 are *entirely* for private use, then the
last two code positions cannot be noncharacters. If the last two code positions are
noncharacters, then Planes 15 and 16 cannot be *entirely* for private use.

The reason why I think this is a contradiction because of another section. Here is an explanation
of "noncharacter" on page 938:

    In effect, noncharacters can be thought of as application-internal private-use code points.
    Unlike the private-use characters discussed in Section 23.5, Private-Use Characters, which
    are assigned characters and which are intended for use in open interchange, subject to
    interpretation by private agreement, noncharacters are permanently reserved (unassigned)
    and have no interpretation whatsoever outside of their possible application-internal pri-
    vate uses.

If this definition is what was actually intended by the spec, I conclude that all
codepoints it defines as "noncharacters" can never be considered "for private use"
at the same time because "for private use" implies interchangability, while "noncharacter"
means "not for interchange"/"internal-only".

Therefore, the statement "Plane 15 and Plane 16 are allocated, in their
entirety, for private use." must be false. The word "entirety" is the problem here.

Other Reports

(None at this time.)