help | character | properties | confusables | unicode-set | compare-sets | regex | bnf-regex | breaks | transform | bidi | idna | languageid
|LATIN SMALL LETTER S||CYRILLIC SMALL LETTER DZE|
|LATIN SMALL LETTER C||CYRILLIC SMALL LETTER ES|
|LATIN SMALL LETTER O||CYRILLIC SMALL LETTER O|
|LATIN SMALL LETTER P||CYRILLIC SMALL LETTER ER|
|LATIN SMALL LETTER E||CYRILLIC SMALL LETTER IE||CYRILLIC SMALL LETTER ABKHASIAN CHE|
Total raw values: 48
Total filtered values: 3
Confusable characters are those that may be confused with others (in some common UI fonts), such as the Latin letter "o" and the Greek letter omicron "ο". Fonts make a difference: for example, the Hebrew character "ס" looks confusingly similar to "o" in some fonts (such as Arial Hebrew), but not in others. See also unaccented Latin Characters..
The data for confusables and restrictions is from UTS39. You can suggest additions or changes to the Unicode data for future versions of that standard.
For more information on the use of the data, see proposed updates Unicode Security Mechanisms and Unicode Security Considerations.
The restrictions are purely on a character level. For a more detailed view, see idna.
The Unicode data is designed for testing, not enumerating, so not all combinations are generated in this demo; In particular, where a character is confusable with a sequence, not all combinations are generated.
Fonts and Display. If you don't have a good set of Unicode fonts (and modern browser), you may not be able to read some of the characters. Some suggested fonts that you can add for coverage are: Unicode Fonts for Ancient Scripts, Noto Fonts site, Large, multi-script Unicode fonts. See also: Unicode Display Problems.
Version 3.7; ICU version: 126.96.36.199; Unicode version: 188.8.131.52