Re: North Indic Fractions etc.

From: Philippe Verdy ([email protected])
Date: Tue Jun 07 2011 - 00:19:17 CDT

Next message: Philippe Verdy: "Re: Slots for Cyrillic Accented Vowels"

Previous message: fantasai: "Re: North Indic Fractions etc."
In reply to: fantasai: "North Indic Fractions etc."
Next in thread: Frédéric Grosshans: "Aegean numbers [was : Re: North Indic Fractions etc.]"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

When reexperimenting with the online Unicode Set tool, I just found a
new bug for (text/word boundaries in property values searched by
/regexps/). See :

(1) http://unicode.org/cldr/utility/list-unicodeset.jsp?a=\p{Block:/(?i)\blatin/}

OK, returns data about all codepoints in a block whose name contain a
word starting by "latin" (ignoring case). Note that \b (word boundary)
is honored at the _begining_ of a word (here this returns all Latin
blocks)

(2) http://unicode.org/cldr/utility/list-unicodeset.jsp?a=\p{Block:/(?i)\blatin$/}

BAD, should a non-empty subset ; all the above concern blocks whose
name are TERMINATED by the "latin" word, so this should be equivalent,
but it is not.

(3) http://unicode.org/cldr/utility/list-unicodeset.jsp?a=\p{Block:/(?i)\blatin\b/}

BAD, the result is empty: I just want a list in all blocks that
contain the word "latin"

What am I doing wrong ? I thought it was correct according to the help
page which explains the supported text/word boundaries (^, $, and \b):
http://cldr.unicode.org/unicode-utilities/list-unicodeset

--Philippe.

2011/6/6 fantasai <[email protected]>:
> Is there a reason why the North Indic fractions and Aegean numbers and
> measures
> are not assigned to any scripts in the ScriptExtensions.txt file?
>
> I don't really know what list of scripts they should belong to, but they
> don't
> seem very "Common".
>
> http://unicode.org/cldr/utility/list-unicodeset.jsp?a=[%3ABlock%3DAegean_Numbers%3A][\uA830-\uA837\uA839]&g=
>
> ~fantasai
>
>

Next message: Philippe Verdy: "Re: Slots for Cyrillic Accented Vowels"
Previous message: fantasai: "Re: North Indic Fractions etc."
In reply to: fantasai: "North Indic Fractions etc."
Next in thread: Frédéric Grosshans: "Aegean numbers [was : Re: North Indic Fractions etc.]"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

This archive was generated by hypermail 2.1.5 : Tue Jun 07 2011 - 00:25:40 CDT