Unicode Utilities: BIDI (UBA)

help | character | properties | confusables | unicode-set | compare-sets | regex | bnf-regex | breaks | transform | bidi | idna | languageid

Shows processing by the UBA (Unicode Bidi Algorithm), which is used to display all Unicode Arabic and Hebrew text. For more information, see bidi info.

Not yet updated for the changes in Unicode 6.3!

Sample      

Paragraph 1

Base Level 0 = LTRexplicit

Source

Memory Position 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25
Character h t t p : / / a . b . C . D . c o m / E / F / g / h
Bidi Class L L L L CS CS CS L CS L CS R CS R CS L L L CS R CS R CS L CS L
Rules Applied W6ON
N1L
W6ON
N1L
W6ON
N1L
W6ON
N1L
W6ON
N2L
W6ON
N1R
W6ON
N2L
W6ON
N2L
W6ON
N1R
W6ON
N2L
W6ON
N1L
Resulting Level L0 L0 L0 L0 L0 L0 L0 L0 L0 L0 L0
L1

L1

L1
L0 L0 L0 L0 L0
L1

L1

L1
L0 L0 L0 L0

Reordered

Display Position 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25
Memory Position 0 1 2 3 4 5 6 7 8 9 10 13 12 11 14 15 16 17 18 21 20 19 22 23 24 25
Character h t t p : / / a . b . D . C . c o m / F / E / g / h

ASCII Hack

For testing the UBA with only ASCII characters, the following property values are used (<,> are RLM and LRM):

L> a b c d e f g h i j k l m n o p q r s t u v w x y z
LRE]
LRO}
R< A B C D E F G H I J K L M
ALN O P Q R S T U V W X Y Z
RLE[
RLO{
PDF|
EN0 1 2 3 4
ES+ -
ET# $ %
AN5 6 7 8 9
CS, . / :
NSM"
BN_
B«<control-000A>» «<control-000D>»
S«<control-0009>» «<control-000B>»
WS«<control-000C>» «SPACE»
ON! & ' ( ) * ; = ? @ \ ^ ` ~

Fonts and Display. If you don't have a good set of Unicode fonts (and modern browser), you may not be able to read some of the characters. Some suggested fonts that you can add for coverage are: Unicode Fonts for Ancient Scripts, Noto Fonts site, Large, multi-script Unicode fonts. See also: Unicode Display Problems.

Version 3.7; ICU version: 54.0.1.0; Unicode version: 7.0.0.0