Unicode Utilities: Confusables

help | character | properties | confusables | unicode-set | compare-sets | regex | bnf-regex | breaks | transform | bidi | bidi-c | idna | languageid

Input With this demo, you can supply an Input string and see the combinations that are confusable with it, using data collected by the Unicode consortium. You can also try different restrictions, using characters valid in different approaches to international domain names. For more info, see Data below.
  

Confusable Characters

A Α А 𐊠 𖽀 𝐀 𝐴 𝑨 𝒜 𝓐 𝔄 𝔸 𝕬 𝖠 𝗔 𝘈 𝘼 𝙰 𝚨 𝛢 𝜜 𝝖 𝞐
00410391041013AA15C5A4EE102A016F401D4001D4341D4681D49C1D4D01D5041D5381D56C1D5A01D5D41D6081D63C1D6701D6A81D6E21D71C1D7561D790FF21
LATIN CAPITAL LETTER AGREEK CAPITAL LETTER ALPHACYRILLIC CAPITAL LETTER ACHEROKEE LETTER GOCANADIAN SYLLABICS CARRIER GHOLISU LETTER ACARIAN LETTER AMIAO LETTER ZZYAMATHEMATICAL BOLD CAPITAL AMATHEMATICAL ITALIC CAPITAL AMATHEMATICAL BOLD ITALIC CAPITAL AMATHEMATICAL SCRIPT CAPITAL AMATHEMATICAL BOLD SCRIPT CAPITAL AMATHEMATICAL FRAKTUR CAPITAL AMATHEMATICAL DOUBLE-STRUCK CAPITAL AMATHEMATICAL BOLD FRAKTUR CAPITAL AMATHEMATICAL SANS-SERIF CAPITAL AMATHEMATICAL SANS-SERIF BOLD CAPITAL AMATHEMATICAL SANS-SERIF ITALIC CAPITAL AMATHEMATICAL SANS-SERIF BOLD ITALIC CAPITAL AMATHEMATICAL MONOSPACE CAPITAL AMATHEMATICAL BOLD CAPITAL ALPHAMATHEMATICAL ITALIC CAPITAL ALPHAMATHEMATICAL BOLD ITALIC CAPITAL ALPHAMATHEMATICAL SANS-SERIF BOLD CAPITAL ALPHAMATHEMATICAL SANS-SERIF BOLD ITALIC CAPITAL ALPHAFULLWIDTH LATIN CAPITAL LETTER A

Total raw values: 27

Confusable Results

𝙰 A 𝐴 ᗅ 𝑨 𝚨 𝕬 𝖠 А Α 𝛢 𝝖 𝒜 𝜜 𖽀 𝓐 𝞐 A 𝗔 𝘈 Ꭺ ꓮ 𝐀 𝔄 𝔸 𐊠 𝘼

Total filtered values: 27


Data

Confusable characters are those that may be confused with others (in some common UI fonts), such as the Latin letter "o" and the Greek letter omicron "ο". Fonts make a difference: for example, the Hebrew character "ס" looks confusingly similar to "o" in some fonts (such as Arial Hebrew), but not in others. See also unaccented Latin Characters..

The data for confusables and restrictions is from UTS39. You can suggest additions or changes to the Unicode data for future versions of that standard.

For more information on the use of the data, see proposed updates Unicode Security Mechanisms and Unicode Security Considerations.

The restrictions are purely on a character level. For a more detailed view, see idna.

Caveats

The Unicode data is designed for testing, not enumerating, so not all combinations are generated in this demo; In particular, where a character is confusable with a sequence, not all combinations are generated.



Fonts and Display. If you don't have a good set of Unicode fonts (and modern browser), you may not be able to read some of the characters. Some suggested fonts that you can add for coverage are: Noto Fonts site, Unicode Fonts for Ancient Scripts, Large, multi-script Unicode fonts. See also: Unicode Display Problems.

Version 3.9; ICU version: 63.1; Unicode version: 12.0;