UAX29 9.0.0 Grapheme cluster spec & test discrepancy

From: Daniel Bünzli <daniel.buenzli_at_erratique.ch>
Date: Tue, 21 Jun 2016 22:19:31 +0100

Hello,

It seems there's a discrepancy between the tests and the spec for grapheme clusters. In

 http://www.unicode.org/Public/9.0.0/ucd/auxiliary/GraphemeBreakTest.txt

we have:

÷ 261D × 0308 × 1F3FB ÷
# ÷ [0.2] WHITE UP POINTING INDEX (E_Base)
# × [9.0] COMBINING DIAERESIS (Extend)
# × [10.0] EMOJI MODIFIER FITZPATRICK TYPE-1-2 (E_Modifier) ÷ [0.3]

which is

 http://www.unicode.org/Public/9.0.0/ucd/auxiliary/GraphemeBreakTest.html#r10.0

but the spec doesn't talk about interleaved Extend*:

 http://www.unicode.org/reports/tr29/proposed.html#GB10

It seems following the spec this would be:

÷ 261D × 0308 ÷ 1F3FB ÷

which one is right ?

Best,

Daniel
Received on Tue Jun 21 2016 - 16:20:39 CDT

This archive was generated by hypermail 2.2.0 : Tue Jun 21 2016 - 16:20:40 CDT