From: Mike (mike-list@pobox.com)
Date: Mon Oct 01 2007 - 07:32:37 CST
>> I'm not sure I agree that you want to look for default grapheme
>> cluster boundaries inside a character class.
>
> Yes this is best to look into default grapheme clusters within a character
> class so that the embedded regexp encoded using NFC or NFD are treated
> equivalently.
You ignored the problem with this that I brought up. If you had
a character class consisting of U+1100 and U+1101, both Hangul L
jamos, they would combine into a single grapheme cluster, equiv-
alent to [\q{\u1100\u1101}], instead of [\u1100\u1101].
Mike
This archive was generated by hypermail 2.1.5 : Mon Oct 01 2007 - 07:39:13 CST