From: Aleksander Morgado (aleksander@es.gnu.org)
Date: Sat Apr 12 2008 - 16:46:37 CDT
Hi all,
Some months ago I filled an error report in the Unicode website related 
to the 'More_Above' context specification for casing. This error was in 
Unicode 5.0, and the error report was as follows:
"I believe there is an error in the description of the 'More_Above' 
context specification for casing (Table 3-14, page 124). According to 
the regular expression provided, the wording of the description should 
say "C is followed by a character of combining class 230 (Above) with no 
intervening character of combining class 0". The last part of the 
sentence provided in the standard ("or 230 (Above)") should be removed."
I guess people in Unicode analyzed the error report, and now this is 
what we have in the errata list (http://www.unicode.org/errata/):
"On p. 124 of The Unicode Standard, Version 5.0, there is an error in 
the Regular Expressions column for "More_Above", in the third row of 
Table 3-14, Context Specification for Casing. The corrected regular 
expression should be: [^\p{ccc=230}\p{ccc=0}]* [\p{ccc=230}] "
IMHO the problem remains in the wording of the 'Description' column in 
table 3-14, and not in the regular expression (the old one was: 
[^\p{ccc=0}]* [\p{ccc=230}] ). The idea is that if the character C must 
be followed by a character of combining class 230, we shouldn't check 
for intervening characters of combining class 230.
What do you think?
-Aleksander Morgado
This archive was generated by hypermail 2.1.5 : Sat Apr 12 2008 - 16:49:58 CDT