Hello,
This discrepancy was addressed during the release process. Please refer to the published Version 9.0 of UAX #29 and the UCD files.
Regards,
L.
-----Original Message-----
From: Unicode [mailto:unicode-bounces_at_unicode.org] On Behalf Of Daniel Bünzli
Sent: Tuesday, June 21, 2016 2:20 PM
To: Unicode_at_unicode.org
Subject: UAX29 9.0.0 Grapheme cluster spec & test discrepancy
Hello,
It seems there's a discrepancy between the tests and the spec for grapheme clusters. In
http://www.unicode.org/Public/9.0.0/ucd/auxiliary/GraphemeBreakTest.txt
we have:
÷ 261D × 0308 × 1F3FB ÷
# ÷ [0.2] WHITE UP POINTING INDEX (E_Base) # × [9.0] COMBINING DIAERESIS (Extend) # × [10.0] EMOJI MODIFIER FITZPATRICK TYPE-1-2 (E_Modifier) ÷ [0.3]
which is
http://www.unicode.org/Public/9.0.0/ucd/auxiliary/GraphemeBreakTest.html#r10.0
but the spec doesn't talk about interleaved Extend*:
http://www.unicode.org/reports/tr29/proposed.html#GB10
It seems following the spec this would be:
÷ 261D × 0308 ÷ 1F3FB ÷
which one is right ?
Best,
Daniel
Received on Tue Jun 21 2016 - 19:34:05 CDT
This archive was generated by hypermail 2.2.0 : Tue Jun 21 2016 - 19:34:06 CDT