Unicode Utilities: Confusables

Properties use ICU for Unicode V11.0; the beta properties support Unicode V12.0β. For more information, see Unicode Utilities Beta.

help | character | properties | confusables | unicode-set | compare-sets | regex | bnf-regex | breaks | transform | bidi | bidi-c | idna | languageid

Input With this demo, you can supply an Input string and see the combinations that are confusable with it, using data collected by the Unicode consortium. You can also try different restrictions, using characters valid in different approaches to international domain names. For more info, see Data below.
  

Confusable Characters

6 б 𑣕 𝟔 𝟞 𝟨 𝟲 𝟼           
0036043113EE2CD2118D51D7D41D7DE1D7E81D7F21D7FC
DIGIT SIXCYRILLIC SMALL LETTER BECHEROKEE LETTER WVCOPTIC CAPITAL LETTER OLD COPTIC HEIWARANG CITI SMALL LETTER ATMATHEMATICAL BOLD DIGIT SIXMATHEMATICAL DOUBLE-STRUCK DIGIT SIXMATHEMATICAL SANS-SERIF DIGIT SIXMATHEMATICAL SANS-SERIF BOLD DIGIT SIXMATHEMATICAL MONOSPACE DIGIT SIX
r г 𝐫 𝑟 𝒓 𝓇 𝓻 𝔯 𝕣 𝖗 𝗋 𝗿 𝘳 𝙧 𝚛
007204331D262C85AB47AB48AB811D42B1D45F1D4931D4C71D4FB1D52F1D5631D5971D5CB1D5FF1D6331D6671D69B
LATIN SMALL LETTER RCYRILLIC SMALL LETTER GHEGREEK LETTER SMALL CAPITAL GAMMACOPTIC SMALL LETTER GAMMALATIN SMALL LETTER R WITHOUT HANDLELATIN SMALL LETTER DOUBLE RCHEROKEE SMALL LETTER HUMATHEMATICAL BOLD SMALL RMATHEMATICAL ITALIC SMALL RMATHEMATICAL BOLD ITALIC SMALL RMATHEMATICAL SCRIPT SMALL RMATHEMATICAL BOLD SCRIPT SMALL RMATHEMATICAL FRAKTUR SMALL RMATHEMATICAL DOUBLE-STRUCK SMALL RMATHEMATICAL BOLD FRAKTUR SMALL RMATHEMATICAL SANS-SERIF SMALL RMATHEMATICAL SANS-SERIF BOLD SMALL RMATHEMATICAL SANS-SERIF ITALIC SMALL RMATHEMATICAL SANS-SERIF BOLD ITALIC SMALL RMATHEMATICAL MONOSPACE SMALL R

Total raw values: 200

Confusable Results

б𝔯 бꮁ б𝒓 б𝘳 бⲅ бᴦ бꭇ бꭈ б𝓇 б𝙧 б𝐫 б𝗋 б𝑟 б𝗿 бr бг б𝕣 б𝖗 б𝓻 б𝚛 𝟞𝔯 𝟞ꮁ 𝟞𝒓 𝟞𝘳 𝟞ⲅ 𝟞ᴦ 𝟞ꭇ 𝟞ꭈ 𝟞𝓇 𝟞𝙧 𝟞𝐫 𝟞𝗋 𝟞𝑟 𝟞𝗿 𝟞r 𝟞г 𝟞𝕣 𝟞𝖗 𝟞𝓻 𝟞𝚛 Ⳓ𝔯 Ⳓꮁ Ⳓ𝒓 Ⳓ𝘳 Ⳓⲅ Ⳓᴦ Ⳓꭇ Ⳓꭈ Ⳓ𝓇 Ⳓ𝙧 Ⳓ𝐫 Ⳓ𝗋 Ⳓ𝑟 Ⳓ𝗿 Ⳓr Ⳓг Ⳓ𝕣 Ⳓ𝖗 Ⳓ𝓻 Ⳓ𝚛 𝟔𝔯 𝟔ꮁ 𝟔𝒓 𝟔𝘳 𝟔ⲅ 𝟔ᴦ 𝟔ꭇ 𝟔ꭈ 𝟔𝓇 𝟔𝙧 𝟔𝐫 𝟔𝗋 𝟔𝑟 𝟔𝗿 𝟔r 𝟔г 𝟔𝕣 𝟔𝖗 𝟔𝓻 𝟔𝚛 𑣕𝔯 𑣕ꮁ 𑣕𝒓 𑣕𝘳 𑣕ⲅ 𑣕ᴦ 𑣕ꭇ 𑣕ꭈ 𑣕𝓇 𑣕𝙧 𑣕𝐫 𑣕𝗋 𑣕𝑟 𑣕𝗿 𑣕r 𑣕г 𑣕𝕣 𑣕𝖗 𑣕𝓻 𑣕𝚛 6𝔯 6ꮁ 6𝒓 6𝘳 6ⲅ 6ᴦ 6ꭇ 6ꭈ 6𝓇 6𝙧 6𝐫 6𝗋 6𝑟 6𝗿 6r 6г 6𝕣 6𝖗 6𝓻 6𝚛 𝟲𝔯 𝟲ꮁ 𝟲𝒓 𝟲𝘳 𝟲ⲅ 𝟲ᴦ 𝟲ꭇ 𝟲ꭈ 𝟲𝓇 𝟲𝙧 𝟲𝐫 𝟲𝗋 𝟲𝑟 𝟲𝗿 𝟲r 𝟲г 𝟲𝕣 𝟲𝖗 𝟲𝓻 𝟲𝚛 𝟨𝔯 𝟨ꮁ 𝟨𝒓 𝟨𝘳 𝟨ⲅ 𝟨ᴦ 𝟨ꭇ 𝟨ꭈ 𝟨𝓇 𝟨𝙧 𝟨𝐫 𝟨𝗋 𝟨𝑟 𝟨𝗿 𝟨r 𝟨г 𝟨𝕣 𝟨𝖗 𝟨𝓻 𝟨𝚛 𝟼𝔯 𝟼ꮁ 𝟼𝒓 𝟼𝘳 𝟼ⲅ 𝟼ᴦ 𝟼ꭇ 𝟼ꭈ 𝟼𝓇 𝟼𝙧 𝟼𝐫 𝟼𝗋 𝟼𝑟 𝟼𝗿 𝟼r 𝟼г 𝟼𝕣 𝟼𝖗 𝟼𝓻 𝟼𝚛 Ꮾ𝔯 Ꮾꮁ Ꮾ𝒓 Ꮾ𝘳 Ꮾⲅ Ꮾᴦ Ꮾꭇ Ꮾꭈ Ꮾ𝓇 Ꮾ𝙧 Ꮾ𝐫 Ꮾ𝗋 Ꮾ𝑟 Ꮾ𝗿 Ꮾr Ꮾг Ꮾ𝕣 Ꮾ𝖗 Ꮾ𝓻 Ꮾ𝚛

Total filtered values: 200


Data

Confusable characters are those that may be confused with others (in some common UI fonts), such as the Latin letter "o" and the Greek letter omicron "ο". Fonts make a difference: for example, the Hebrew character "ס" looks confusingly similar to "o" in some fonts (such as Arial Hebrew), but not in others. See also unaccented Latin Characters..

The data for confusables and restrictions is from UTS39. You can suggest additions or changes to the Unicode data for future versions of that standard.

For more information on the use of the data, see proposed updates Unicode Security Mechanisms and Unicode Security Considerations.

The restrictions are purely on a character level. For a more detailed view, see idna.

Caveats

The Unicode data is designed for testing, not enumerating, so not all combinations are generated in this demo; In particular, where a character is confusable with a sequence, not all combinations are generated.



Fonts and Display. If you don't have a good set of Unicode fonts (and modern browser), you may not be able to read some of the characters. Some suggested fonts that you can add for coverage are: Noto Fonts site, Unicode Fonts for Ancient Scripts, Large, multi-script Unicode fonts. See also: Unicode Display Problems.

Version 3.9; ICU version: 63.1; Unicode version: 11.0; Unicodeβ version: 12.0;