Approved Minutes of UTC Meeting 165
Mountain View, CA — October 5 and 7, 2020
Hosted virtually on Zoom
UTC #165 Agenda
Revision date: August 3, 2021
Monday October 5, 2020
Meeting opened at 9:30 am.
10 Full members, 3 Institutional, 2 Supporting.
Full Members in regular attendance: 7
Institutional Members in regular attendance: 1
Supporting Members in regular attendance: 1
Quorum: 5
6 members represented: Apple, Adobe, Google, Facebook, Microsoft, UCB,
A1. Membership review, proxies, and meeting quorum
A.3 Approval of minutes of prior meeting [L2/20-172]
[165-C1] Consensus: Approve the minutes of UTC #164 as documented in L2/20-172.
A.5 Action item review [L2/SD2] A.5.1 Recently closed action items [L2/20-238]
Oral review by Ken Whistler.
Roll Call adjustment: Netflix now represented.
7 members represented: Apple, Adobe, Google, Facebook, Microsoft, UCB, Netflix.
A.6 Calendar review [Calendar]
Discussion. UTC #166 will be January 19 and 21, 2021. UTC #167 will also be virtual, April 27 and 29, 2021. UTC #168 will also likely be virtual, July 27 and 29, 2021. UTC #169 will be October 5 and 7, 2021; but will have a room reserved "in case".
[165-C2] Consensus: There will be no further emoji repertoire (characters and sequences) for Unicode version 14.0 after the January 2021 UTC meeting.
A.7 Liaison reports [ISO, IRG, IETF/ICANN, INFITT, SEI, Mongolian, ICU, CLDR, TC37/SC2]
SEI liaison report (L2/20-254), Deborah Anderson.
TC37/SC2 liaison report (L2/20-273), Peter Constable.
[165-A1] Action Item for Ken Whistler, Peter Constable: Follow up with TC37/SC2 regarding the upcoming ad-hoc meeting and provide a liaison statement.
IRG liaison report, Ken Lunde. Will be covered later.
Mongolian ad-hoc report, Lisa Moore. Meetings bi-weekly; progress being made.
CLDR-TC liaison, oral report, Mark Davis.
ICU-TC liaison, oral report Markus Scherer.
Short break until 11:00.
C.1 Unihan Ad Hoc Recommendations for UTC #165 Meeting [Lunde, Jenkins, et al, L2/20-235]
[165-C3] Consensus: Make changes to USourceData.txt and to the Unihan database based on feedback from Ken Lunde [Fri Aug 31 08:29:11 CDT 2020], based on document L2/20-239, for Unicode Version 14.0.
[165-A2a] Action Item for John Jenkins: Update records in USourceData.txt and Unihan database based on instructions on page 2 of document L2/20-239 and Unihan-UTC165-R01 in document L2/20-235, for Unicode Version 14.0.
[165-A2b] Action Item for John Jenkins: Prepare a proposal to horizontally-extend U+289B1 𨦱 to add UK-02829 as a new source reference and submit to the UTC and IRG, based on document L2/20-239 and Unihan-UTC165-R01 in document L2/20-235, for Unicode Version 14.0.
[165-C4] Consensus: Accept six new U-Source ideographs as UTC-03228 through UTC-03233 with a UAX #45 status value of N, based on document L2/20-206 and Unihan-UTC165-R02 in document L2/20-235, for Unicode Version 14.0.
[165-A2c] Action Item for John Jenkins: Add six new records to USourceData.txt and their representative glyphs to USourceGlyphs.pdf, based on document L2/20-206 and Unihan-UTC165-R02 in document L2/20-235, for Unicode Version 14.0.
[165-C5] Consensus: Add the first residual stroke field, Field 9, and its description to UAX #45 for Unicode Version 14.0, based on document L2/20-229 and Unihan-UTC165-R03 in document L2/20-235.
[165-A3a] Action Item for John Jenkins: Editorial Committee: Update the text of UAX #45 to include the first residual stroke field, Field 9 and its description, based on document L2/20-229 and Unihan-UTC165-R03 in document L2/20-235, for Unicode Version 14.0.
[165-A3b] Action Item for John Jenkins: Add Field 9 to USourceData.txt, based on document L2/20-229 and Unihan-UTC165-R03 in document L2/20-235, for Unicode Version 14.0.
[165-C6] Consensus: Make changes to the Unihan database based on feedback from Jim Breen [Fri Jul 17 20:18:55 CDT 2020 (updated 2020-09-01)], based on document L2/20-239, for Unicode Version 14.0.
[165-A4] Action Item for John Jenkins: Update the Unihan database to add or change the records, based on document L2/20-239 and Unihan-UTC165-R04 in document L2/20-235, for Unicode Version 14.0.
[165-C7] Consensus: Make changes to the Unihan database based on feedback from Jaemin Chung [Thu Sep 3 12:30:38 CDT 2020], based on document L2/20-239, for Unicode Version 14.0.
[165-A5] Action Item for Michel Suignard, John Jenkins: Make changes to the Unihan database based on document L2/20-239, and Unihan-UTC165-R05 in document L2/20-235, for Unicode Version 14.0.
[165-A6] Action Item for Ken Whistler: Update NamesList.txt to add U+20092 as a related CJK Unified Ideograph to U+2EA7, based on document L2/20-239 and Unihan-UTC165-R06 in document L2/20-235, for Unicode Version 14.0.
[165-C8] Consensus: Make changes to the Unihan database based on feedback from Jaemin Chung [Thu Sep 17 18:20:16 CDT 2020], based on document L2/20-239, for Unicode Version 14.0.
[165-A7] Action Item for Michel Suignard, John Jenkins: Make changes to the Unihan database based on document L2/20-239, and Unihan-UTC165-R07 in document L2/20-235, for Unicode Version 14.0.
[165-C9] Consensus: Accept the five urgently needed characters proposed in L2/20-203 and L2/20-204 with code points U+9FFD through U+9FFF, U+2A6DE, and U+2A6DF, along with the representative glyph change for MC-00137 proposed in L2/20-205, and add to a proposed update of UAX #38 for Unicode Version 14.0 to add the new kIRG_GSource and kIRG_MSource source prefixes.
[165-A8] Action Item for John Jenkins, Editorial Committee: Update UAX #38 to add the new kIRG_GSource and kIRG_MSource source prefixes, along with their syntax and descriptions, based on documents L2/20-203 and L2/20-204, and Unihan-UTC165-R09 in document L2/20-235, for Unicode Version 14.0.
[165-A9] Action Item for Ken Whistler: Update the pipeline to add the five urgently needed characters with code points U+9FFD through U+9FFF, U+2A6DE, and U+2A6DF. See document L2/20-235.
[165-A10] Action Item for Ken Lunde: Request that Macao SAR adjust the representative glyph for MC-00137 per L2/20-205, and provide an updated font to Michel Suignard.
[165-A11] Action Item for John Jenkins, Michel Suignard: Add or change Unihan database records, based on documents L2/20-203 and L2/20-204, and Unihan-UTC165-R09 in document L2/20-235, for Unicode Version 14.0.
[165-A12] Action Item for Michel Suignard, Editorial Committee: Review description of URO layout of Macao source in section 24.2 of the core spec for Unicode version 14.0
Short break until 12:15.
[165-C10] Consensus: Disunify U+722B and encode a new CJK Unified Ideograph at the end of the Extension C block at code point U+2B735 with a kIRG_VSource property value of V0-3D5B, for Unicode Version 14.0.
[165-A13] Action Item for Michel Suignard, John Jenkins: Update and add Unihan database records based on document L2/20-210 and Unihan-UTC165-R10 in document L2/20-235, for Unicode Version 14.0.
[165-A14] Action Item for Lee Collins: Provide a font to Michel Suignard for updates based on document L2/20-210 and Unihan-UTC165-R10 in document L2/20-235.
[165-A15] Action Item for Ken Whistler: Update the pipeline to add U+2B735. See Unihan-UTC165-R10 in document L2/20-235.
[165-C11] Consensus: Add or change Unihan database records, except for the duplicate record for U+2941B and the proposed alternate kTotalStrokes property value for U+4040, along with the new description of the “V4” prefix in UAX #38, based on document L2/20-230 and Unihan-UTC165-R11 in document L2/20-235, for Unicode Version 14.0.
[165-A16] Action Item for John Jenkins, Editorial Committee: change the description of the UAX #38 kIRG_VSource property's “V4” prefix to: Kho Chữ Hán Nôm Mã Hoá (Hán Nôm Coded Character Repertoire), Hà Nội, 2007.
[165-A17] Action Item for Lee Collins: Provide to Michel Suignard an updated font that also includes the 20 representative glyph changes, based on document L2/20-230 and Unihan-UTC165-R11 in document L2/20-235, for Unicode Version 14.0.
[165-A18] Action Item for Lee Collins: Propose to the IRG a new UCV (Unifiable Component Variations) for the 決 and 决 components.
[165-A19] Action Item for Michel Suignard, John Jenkins: Update, remove, and add Unihan database records based on document L2/20-230 and Unihan-UTC165-R11 in document L2/20-235, for Unicode Version 14.0.
[165-C12] Consensus: Make changes and additions to the kXHC1983 property, based on document L2/20-231 and Unihan-UTC165-R12 in document L2/20-235, for Unicode Version 14.0.
[165-A20] Action Item for Peter Edberg: Ask the CLDR-TC to check the proposed kMandarin property value for U+2B413 𫐓 then report back to the UTC. See document L2/20-231.
[165-A21] Action Item for John Jenkins: Update and add Unihan database records based on document L2/20-231 and Unihan-UTC165-R12 in document L2/20-235, for Unicode Version 14.0.
[165-A22] Action Item for Ken Lunde: Relay the UTC feedback on kanbun to the author of L2/20-232. See document L2/20-235 section 5.
C.2 PRI #421 Proposed Update UAX #38 Unicode Unihan Database C.2.1 Feedback on PRI#421 [L2/20-239]
[165-A23] Action Item for John Jenkins, Editorial Committee: Review feedback in PRI #421 from Eduardo Marín Silva [Sun Aug 9 01:10:36 CDT 2020] on possible update of the text of UAX #38.
Break for lunch 13:25 - 14:00.
B.1 Recommendations to UTC #164 October 2020 on Script Proposals [Anderson, L2/20-250]
(Section 2, Todhri)
[165-A24] Action Item for Deborah Anderson, Roozbeh Pournader: Write a document that clearly explains the pros and cons of different approaches to Todhri.
(Section 1, Latin)
[165-C13] Consensus: UTC rescinds approval of the 38 Latin characters listed in SAH-UTC165-R1, document L2/20-250, while waiting for a new proposal that consolidates all the Latin letters with revised codepoints.
[165-A25] Action Item for Ken Whistler: Update the pipeline with changes per above consensus 165-C13.
(Section 3, Vithkuqi)
[165-A26] Action Item for Deborah Anderson: Provide feedback to the author of L2/20-187R that the UTC does not support encoding newly invented modern characters without evidence of usage in text.
(Section 4, UCAS)
[165-C14] Consensus: The UTC accepts 16 Unified Canadian Aboriginal Syllabics characters as specified in SAH-UTC165-R4 of document L2/2-250, in a new Unified Canadian Aboriginal Syllabics Extended-A block (U+11AB0..U+11ABF) for encoding in a future version of the standard. See also document L2/20-255.
[165-A27] Action Item for Liang Hai: Provide a font to Michel Suignard for printing 16 new UCAS symbols. See document L2/20-255.
[165-A28] Action Item for Ken Whistler: Update the pipeline to include 16 UCAS symbols. See document L2/20-255.
Short break until 15:15.
(Section 5, Arabic)
[165-C15] Consensus: Accept three Arabic characters with properties as given in L/20-245 for encoding in a future version of the standard:
U+061D ARABIC END OF TEXT MARK U+0890 ARABIC POUND MARK ABOVE U+0891 ARABIC PIASTRE MARK ABOVE
[165-A29] Action Item for Ken Whistler: Update the pipeline to include three new Arabic characters:
U+061D ARABIC END OF TEXT MARK U+0890 ARABIC POUND MARK ABOVE U+0891 ARABIC PIASTRE MARK ABOVE
[165-A30] Action Item for Roozbeh Pournader, Lorna Evans: Provide a font to Michel Suignard for printing three Arabic characters with properties as given in L/20-245.
[165-A31] Action Item for Ken Whistler, Editorial Committee: Change the spelling "Uighur" to "Uyghur" in the names list annotations, to bring them in line with the current spelling conventions in the Core Specification. For 14.0. Reference: Section 5b of L2/20-250.
[165-A32] Action Item for Roozbeh Pournader: Respond to the author of Eastern Arabic Fractions feedback in L2/20-239 about the vulgar fractions and 'Egyptian' two. Reference: Section 5c of L2/20-250.
[165-C16] Consensus: Move the 98 approved characters for Cypro-Minoan at U+12700..U+12761 and its attendant Cypro-Minoan block (U+12700..U+1276F) to U+12F90..U+12FF1 in a Cypro-Minoan block whose range extends from U+12F90..U+12FFF. Reference: Section 7 of L2/20-250.
[165-A33] Action Item for Ken Whistler: Update the pipeline to move Cypro-Minoan characters. See document L2/20-250, Section 7.
[165-C17] Consensus: The UTC accepts seven Ahom characters for encoding in a future version of the standard, with properties as given in L2/20-258, and extends the current Ahom block one column so the block is from U+11700..U+1174F.
U+11740 AHOM LETTER CA U+11741 AHOM LETTER TTA U+11742 AHOM LETTER TTHA U+11743 AHOM LETTER DDA U+11744 AHOM LETTER DDHA U+11745 AHOM LETTER NNA U+11746 AHOM LETTER LLA
[165-A34] Action Item for Ken Whistler: Update the pipeline to include seven new Ahom letters. See consensus 165-C17.
[165-A35] Action Item for Deborah Anderson: Provide a font to Michel Suignard for printing seven new Ahom letters. See consensus 165-C17.
[165-A36] Action Item for Ken Whistler: Ask the roadmap committee to extend the Ahom block range.
[165-C18] Consensus: The UTC accepts U+1715 TAGALOG SIGN PAMUDPOD for encoding in a future version of the standard, as documented in L2/20-272.
[165-A37] Action Item for Mark Davis: Add U+1715 TAGALOG SIGN PAMUDPOD to the list of confusables, see L2/20-272.
[165-A38] Action Item for Ken Whistler: Update the pipeline to include U+1715 TAGALOG SIGN PAMUDPOD. See document L2/20-272.
[165-C19] Consensus: The UTC accepts a formal name alias of type "correction" for U+AA6E MYANMAR LETTER KHAMTI HHA, for Unicode version 14.0. The formal name alias will be: MYANMAR LETTER KHAMTI LLA. See document L2/20-263.
[165-A39] Action Item for Ken Whistler: Update NameAliases.txt for Unicode 14.0. See L2/20-263 and L2/20-250.
[165-C20] Consensus: The UTC accepts U+20C0 SOM SIGN for encoding in Unicode version 14.0. See documents L2/20-261 and L2/20-250.
[165-A40] Action Item for Ken Whistler: Update the pipeline to include U+20C0 SOM SIGN. See documents L2/20-261 and L2/20-250.
UTC adjourned for the day at 16:15.
Wednesday October 7, 2020
Meeting opened at 9:30 am.
7 members represented: Apple, Adobe, Google, Facebook, Netflix, Microsoft, UCB,
D.1 UTC #165 properties feedback & recommendations [Scherer, et al, L2/20-240]
[165-A41] Action Item for Mark Davis: Forward L2/20-240 item F1 to CLDR for discussion: Handling of quotation marks in line breaking needs language-specific tailoring.
[165-A42] Action Item for Mark Davis, Editorial Committee: Prepare a proposed update of UAX #31 to clarify when & why ZWJ/ZWNJ should be ignored vs. when not. See L2/20-240 item F4. For Unicode version 14.
[165-A43] Action Item for Rick McGowan, Editorial Committee: Post a PRI for the proposed update of UAX #31 to close December 31, 2020 for Unicode version 14.
[165-A44] Action Item for Asmus Freytag, Michel Suignard: Provide a document proposing an option in UAX #31 to prohibit ZWJ/ZWNJ altogether, for identifier security.
[165-A45] Action Item for Mark Davis, Editorial Committee: In UAX #31 more clearly and consistently refer to CLDR for UnicodeSet syntax, according to L2/20-240 item F5. For Unicode version 14.
[165-A46] Action Item for Mark Davis, Editorial Committee: In UAX #31 prefix the sentence: "The Identifier characters are always a superset of the ID_Start characters" with "by definition", for Unicode version 14.
[165-A47] Action Item for Mark Davis: In security/.../IdentifierType.txt, for U+1B6B..U+1B73 add Identifier_Type=Technical as proposed in L2/20-240 item F6, unless other UTC action items about Identifier_Type classifications contradict this. For Unicode 14.
[165-A48] Action Item for Markus Scherer, Editorial Committee: Update UTS #46 to validate ACE label edge cases, see L2/20-240 item F7. For Unicode 14.
[165-A49] Action Item for Roozbeh Pournader: Re document L2/20-240 item F8, investigate what the right Indic shaping properties should be for certain Vedic characters. See also related AI 164-A63. For Unicode 14.
Short break until 10:45.
[165-A50] Action Item for Rick McGowan: Contact Henri Sivonen re L2/20-202 and refer to L2/20-240, section D2.
D.2 Open Sourcing the Last Resort Font
[165-C21] Consensus: Make the Last Resort Font Github repository public, after updating the ReadMe file appropriately.
[165-A51] Action Item for Ken Lunde, Rick McGowan: Update the ReadMe.md file in Github for the Last Resort Font to include the following in addition to other updates: "This font may be updated for future versions of the standard as time and resources permit."
[165-A52] Action Item for Rick McGowan: Update the Last Resort Font page to refer to the appropriate Github repository.
Short break.
F.1 Editorial Committee Report and Recommendations for UTC #164 Meeting [Whistler, L2/20-241]
[165-A53] Action Item for Ken Whistler, Editorial Committee: Clarify the names list annotation regarding punctus elevatus (U+2E4E) for Unicode 14.0. Ref. David Corbett, July 21, in L2/20-239. [Tue Jul 21 13:06:27 CDT 2020]
[165-A54] Action Item for Liang Hai, Editorial Committee: In Section 12.9, Malayalam, of the core specification, provide clarification about the attestations of candrakkala (U+0D4D) in some irregular forms. For Unicode 14.0. Ref. Ajith, July 29, in L2/20-239. [Wed Jul 29 23:33:52 CDT 2020]
[165-A55] Action Item for Liang Hai, Editorial Committee: In Section 12.9, Malayalam, of the core specification, provide an explanation of the rationale for use of decomposed sequences in two-part vowels in the examples in Table 12-41. For Unicode 14.0. Ref. Ajith, July 30, in L2/20-239. [Thu Jul 30 01:19:40 CDT 2020]
[165-A56] Action Item for Peter Constable, Editorial Committee: Prepare proposed update text for UTS #39 for Version 14.0, incorporating textual suggestions noted in L2/20-239. Ref. Peter Constable, July 30. [Thu Jul 30 15:56:14 CDT 2020, Thu Jul 30 16:27:43 CDT 2020]
[165-A57] Action Item for Ken Whistler, Editorial Committee: Prepare proposed update text for UAX #31 for Version 14.0, incorporating specific textual suggestions noted in L2/20-239. Ref. Peter Constable, July 30. [Thu Jul 30 16:47:52 CDT 2020, Thu Jul 30 17:11:37 CDT 2020]
Lunch break 12:05 - 14:00.
E.1 Recommendations for Emoji, Unicode 14.0 [Daniel/ESC, L2/20-242]
Long discussion.
[165-C22] Consensus: Remove seven provisional emoji candidates based on section IV of L2/20-242R2.
[165-A58] Action Item for Mark Davis, Ned Holbrook: Remove seven provisional emoji candidates based on section IV of L2/20-242R2.
[165-C23] Consensus: Accept thirty-seven draft candidate atomic characters with codepoints and 75 sequences based on document L2/20-242R2.
[165-A59] Action Item for Mark Davis, Ned Holbrook: Update the emoji charts with the thirty-seven draft candidates approved by UTC. See above Consensus 165-C23 document L2/20-242R2.
[165-A60] Action Item for Ken Whistler: Update the pipeline to include thirty-seven emoji candidates. See above Consensus 165-C23 document L2/20-242R2.
E.2 Comments on Emoji 13.1 and 14.0 Candidates [Buff, L2/20-200] E.2.1 ESC comments on 2020 Q3 feedback [ESC, L2/20-227]
[165-A61] Action Item for Rick McGowan: Point Charlotte Buff to the ESC response document L2/20-227.
UTC adjourned for the week at 16:30.
L2 continued after a short break.
Members Represented
Full Member | 10/05/20 | 10/07/20 |
1. Adobe |
yes | yes |
2. Apple Inc. |
yes | yes |
3. Facebook |
yes | yes |
4. Google, Inc. |
yes | yes |
5. IBM Corporation |
||
6. Microsoft Corporation |
yes | yes |
7. Netflix |
yes | yes |
8. SAP AG |
||
9. Sultanate of Oman, MARA |
||
Institutional Member |
||
1. Bangladesh, MSICT |
||
2. India, MICT |
||
3. Tamil Nadu, TVA |
||
4. UCB |
yes | yes |
Supporting Member |
||
1. Emojipedia |
yes | |
2. Monotype Imaging Corp |
||
Associate Member |
||
1. Emojination |
yes | yes |
2. SIL |
yes | yes |
UTC Attendance
Person | Representing |
Deborah Anderson | U.C. Berkeley |
Fesseha Atlaw | self |
Dragan Besevic | Netflix |
Frederick Brennan | self |
Jeremy Burge | Emojipedia |
Chris Chapman | Adobe |
Mia Cinelli | self |
Lee Collins | Netflix |
Peter Constable | self |
Craig Cummings | Amazon |
Jennifer Daniel | |
Mark Davis | |
Peter Edberg | Apple |
Behnam Esfahbod | self |
Lorna Evans | SIL |
Loïz Fily (BZH) | Office of Breton Language |
Rich Gillam | Apple |
Andrew Glass | Microsoft |
Joshua Hadley | Adobe |
Liang Hai | Unicode |
Ned Holbrook | Apple |
John Jenkins | Apple |
Kevin Keystone | self |
Jan Kučera | self |
Jennifer 8. Lee | Emojination |
Kristi Lee | Microsoft |
Ken Lunde | Unicode |
Rick McGowan | Unicode |
Lisa Moore | Unicode |
Timo Nijssen | self |
Marcel Pauluk | self |
Roozbeh Pournader | |
Murray Sargent | Microsoft |
Markus Scherer | |
Jiali Sheng | Microsoft |
Michel Suignard | Unicode |
Tex Texin | self |
Ken Whistler | Unicode |
Shawn Xu | Netflix |
Daniel Yacob | self |
Ben Yang | Panlex |
Members not in regular attendance: Tamil Nadu, Oracle, SAP, Sultanate of Oman
Quorum: 5