From: Mike (mike-list@pobox.com)
Date: Wed Sep 05 2007 - 12:07:15 CDT
> To my surprise, I cannot find a mention of the First> Last> notation
> in http://www.unicode.org/Public/UNIDATA/UCD.html, which only mentions
> the "X..Y" form to describe a range.
>
> Yet, First> Last> is used in UnicodeData.txt instead of "X..Y":
>
> 3400;<CJK Ideograph Extension A, First>;Lo;0;L;;;;;N;;;;;
> 4DB5;<CJK Ideograph Extension A, Last>;Lo;0;L;;;;;N;;;;;
>
> So, where is it documented?
I can't recall finding it documented anywhere, but the reason a
range is not used is because UnicodeData.txt does not support
ranges. I believe that this is for backward compatibility
reasons -- the format of UnicodeData.txt has been frozen to
avoid breaking existing software that parses the file.
My parser ignores lines matching /(First|Last)>;/.
Mike
This archive was generated by hypermail 2.1.5 : Wed Sep 05 2007 - 12:10:49 CDT