L2/09-179
Subject: Proposed Bidi IssuesDate: 2009-05-01 From: Mark Davis To: UTC
FYI: I posted an online demo of the bidi
algorithm at
http://unicode.org/cldr/utility/bidi.jsp.
This demo can be used to see which rules
are invoked at which points, and the
resulting reordering.
The following reflects the consensus among the bidi@unicode.org participants on these three open issues in the UBA, with some additional editorial changes. X9: http://www.unicode.org/reports/tr9/tr9-20.html#X9
Option 3. (new) Clarify that all
rules apply to current types; add BN to
the list of types in X6.
Note: Having any characters of class BN survive rule X9 is simply an unintended consequence of not having drafted rule X6 carefully enough. It's more consistent if these special characters either survive all or none. At this point, we appear to have rough consensus on this option. Suggested Text Changes: (For clarification, see Editorial changes below.)
X6. For all
types besides RLE, LRE, RLO, LRO,
and PDF:
=>
X6. For all
types besides RLE, LRE, RLO, LRO,
PDF, and BN:
N1. http://www.unicode.org/reports/tr9/tr9-20.html#N1Option 1. This seems to be what a majority of current implementations do. While it is not the best technical solution if we had a blank slate, we are constrained by compatibility issues. At this point, we appear to have rough consensus on this option. Suggested Text Changes: R N R → R R R L N L → L L L R N AN → R R AN AN N R → AN R R R N EN → R R EN EN N R → EN R R=> L N L → L L L R N R → R R R R N AN → R R AN R N EN → R R EN AN N R → AN R R AN N AN → AN R AN AN N EN → AN R EN EN N R → EN R R EN N AN → EN R AN EN N EN → EN R EN HL6. http://www.unicode.org/reports/tr9/tr9-20.html#HL6We were not able to reach a consensus on this issue. Here are some of the options that have been proposed:
Editorial1. In editing, I noticed that we don't explicitly associate the Unicode property names, such as Bidi_Class, with the older terms such as BD1.
BD1. The
bidirectional characters types are
values assigned to each Unicode
character, including unassigned
characters.
We should use the formal property names listed below, and explicitly equate the older terms to these. I suggest the exact wording be left to the Editorial Subcommittee.
We discussed this in the bidi subcommittee, and the following reflects some of the discussion, but we didn't come to consensus on the text.
Note: I'm a bit concerned about drifting too far from the structure in
|
_displayNameOrEmail_ - _time_ - Remove
_text_