Re: Call for feedback on UTS #18: Unicode Regular Expressions

From: Mark Davis ☕️ via Unicode <unicode_at_unicode.org>
Date: Thu, 2 Jan 2020 20:22:00 +0100

The line just above that is:

Name matching rules follow Matching Rules
<http://unicode.org/reports/tr44/#Matching_Rules> from [UAX44#UAX44-LM2
<http://www.unicode.org/reports/tr18/proposed.html#UAX44>].

The deletion was based on feedback that the deleted text was a recap of the
above line, but a recap that didn't have precisely the same description.
It's best to point to the exact description, and have that be in one place.

Mark

On Thu, Jan 2, 2020 at 6:40 PM Karl Williamson via Unicode <
unicode_at_unicode.org> wrote:

> One thing I noticed in reviewing this is the removal of text about loose
> matching of the name property. But I didn't see an explanation for this
> removal. Please point me to the explanation, or tell me what it is.
>
> Specifically these lines were removed:
>
> As with other property values, names should use a loose match,
> disregarding case, spaces and hyphen (the underbar character "_" cannot
> occur in Unicode character names). An implementation may also choose to
> allow namespaces, where some prefix like "LATIN LETTER" is set globally
> and used if there is no match otherwise.
>
> There are, however, three instances that require special-casing with
> loose matching, where an extra test shall be made for the presence or
> absence of a hyphen.
>
> U+0F68 TIBETAN LETTER A and
> U+0F60 TIBETAN LETTER -A
> U+0FB8 TIBETAN SUBJOINED LETTER A and
> U+0FB0 TIBETAN SUBJOINED LETTER -A
> U+116C HANGUL JUNGSEONG OE and
> U+1180 HANGUL JUNGSEONG O-E
>
>
>
Received on Thu Jan 02 2020 - 13:22:35 CST

This archive was generated by hypermail 2.2.0 : Thu Jan 02 2020 - 13:22:36 CST