Unicode Utilities: BIDI (UBA)

help | character | properties | confusables | unicode-set | compare-sets | regex | bnf-regex | breaks | transform | bidi | idna | languageid

Shows processing by the UBA (Unicode Bidi Algorithm), which is used to display all Unicode Arabic and Hebrew text. For more information, see bidi info.

Not yet updated for the changes in Unicode 6.3!

Sample      

Paragraph 1

Base Level 1 = RTLexplicit

Source

Memory Position 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25
Character h t t p : / / a . b . C . D . c o m / E / F / g / h
Bidi Class L L L L CS CS CS L CS L CS R CS R CS L L L CS R CS R CS L CS L
Rules Applied W6ON
N1L
W6ON
N1L
W6ON
N1L
W6ON
N1L
W6ON
N2R
W6ON
N1R
W6ON
N2R
W6ON
N2R
W6ON
N1R
W6ON
N2R
W6ON
N1L
Resulting Level

L2


L2


L2


L2


L2


L2


L2


L2


L2


L2

L1

L1

L1

L1

L1


L2


L2


L2

L1

L1

L1

L1

L1


L2


L2


L2

Reordered

Display Position 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25
Memory Position 23 24 25 22 21 20 19 18 15 16 17 14 13 12 11 10 0 1 2 3 4 5 6 7 8 9
Character g / h / F / E / c o m . D . C . h t t p : / / a . b

ASCII Hack

For testing the UBA with only ASCII characters, the following property values are used (<,> are RLM and LRM):

L> a b c d e f g h i j k l m n o p q r s t u v w x y z
LRE]
LRO}
R< A B C D E F G H I J K L M
ALN O P Q R S T U V W X Y Z
RLE[
RLO{
PDF|
EN0 1 2 3 4
ES+ -
ET# $ %
AN5 6 7 8 9
CS, . / :
NSM"
BN_
B«<control-000A>» «<control-000D>»
S«<control-0009>» «<control-000B>»
WS«<control-000C>» «SPACE»
ON! & ' ( ) * ; = ? @ \ ^ ` ~

Fonts and Display. If you don't have a good set of Unicode fonts (and modern browser), you may not be able to read some of the characters. Some suggested fonts that you can add for coverage are: Unicode Fonts for Ancient Scripts, Noto Fonts site, Large, multi-script Unicode fonts. See also: Unicode Display Problems.

Version 3.7; ICU version: 56.0.1.0; Unicode version: 8.0.0.0