Unicode Utilities: Confusables

Properties use ICU for Unicode V11.0; the beta properties support Unicode V12.0β. For more information, see Unicode Utilities Beta.

help | character | properties | confusables | unicode-set | compare-sets | regex | bnf-regex | breaks | transform | bidi | bidi-c | idna | languageid

Input With this demo, you can supply an Input string and see the combinations that are confusable with it, using data collected by the Unicode consortium. You can also try different restrictions, using characters valid in different approaches to international domain names. For more info, see Data below.
  

Confusable Characters

\ 𝈏 𝈻                                                              
005C221627CD29F529F92F0231D44E361D20F1D23BFE68FF3C
REVERSE SOLIDUSSET MINUSMATHEMATICAL FALLING DIAGONALREVERSE SOLIDUS OPERATORBIG REVERSE SOLIDUSKANGXI RADICAL DOTCJK STROKE DCJK UNIFIED IDEOGRAPH-4E36GREEK VOCAL NOTATION SYMBOL-16GREEK INSTRUMENTAL NOTATION SYMBOL-48SMALL REVERSE SOLIDUSFULLWIDTH REVERSE SOLIDUS
/ 丿 𝈺                                                           
002F1735204120442215257127CB29F82CC62F03303330CE31D34E3F1D23A
SOLIDUSPHILIPPINE SINGLE PUNCTUATIONCARET INSERTION POINTFRACTION SLASHDIVISION SLASHBOX DRAWINGS LIGHT DIAGONAL UPPER RIGHT TO LOWER LEFTMATHEMATICAL RISING DIAGONALBIG SOLIDUSCOPTIC CAPITAL LETTER OLD COPTIC ESHKANGXI RADICAL SLASHVERTICAL KANA REPEAT MARK UPPER HALFKATAKANA LETTER NOCJK STROKE SPCJK UNIFIED IDEOGRAPH-4E3FGREEK INSTRUMENTAL NOTATION SYMBOL-47
: ː ˸ ։ ׃ ܃ ܄                                                        
003A02D002F8058905C30703070409030A8316EC18031809205A2236A4FDA789FE30FF1A
COLONMODIFIER LETTER TRIANGULAR COLONMODIFIER LETTER RAISED COLONARMENIAN FULL STOPHEBREW PUNCTUATION SOF PASUQSYRIAC SUPRALINEAR COLONSYRIAC SUBLINEAR COLONDEVANAGARI SIGN VISARGAGUJARATI SIGN VISARGARUNIC MULTIPLE PUNCTUATIONMONGOLIAN FULL STOPMONGOLIAN MANCHU FULL STOPTWO DOT PUNCTUATIONRATIOLISU LETTER TONE MYA JEUMODIFIER LETTER COLONPRESENTATION FORM FOR VERTICAL TWO DOT LEADERFULLWIDTH COLON
* ٭ 𐌟                                                                     
002A066D204E22171031F
ASTERISKARABIC FIVE POINTED STARLOW ASTERISKASTERISK OPERATOROLD ITALIC LETTER ESS
? Ɂ ʔ                                                                    
003F02410294097D13AEA6EB
QUESTION MARKLATIN CAPITAL LETTER GLOTTAL STOPLATIN LETTER GLOTTAL STOPDEVANAGARI LETTER GLOTTAL STOPCHEROKEE LETTER HEBAMUM LETTER NTUU
1 I l | Ɩ ǀ Ι І Ӏ ׀ ו ן ا ١ ۱ ߊ 𐊊 𐌉 𐌠 𖼨 𝐈 𝐥 𝐼 𝑙 𝑰 𝒍 𝓁 𝓘 𝓵 𝔩 𝕀 𝕝 𝕴 𝖑 𝖨 𝗅 𝗜 𝗹 𝘐 𝘭 𝙄 𝙡 𝙸 𝚕 𝚰 𝛪 𝜤 𝝞 𝞘 𝟏 𝟙 𝟣 𝟭 𝟷 𞣇 𞸀 𞺀
00310049006C007C019601C00399040604C005C005D505DF0627066106F107CA16C12110211121132160217C222323FD2C922D4FA4F21028A103091032016F281D4081D4251D43C1D4591D4701D48D1D4C11D4D81D4F51D5291D5401D55D1D5741D5911D5A81D5C51D5DC1D5F91D6101D62D1D6441D6611D6781D6951D6B01D6EA1D7241D75E1D7981D7CF1D7D91D7E31D7ED1D7F71E8C71EE001EE80FE8DFE8EFF29FF4CFFE8
DIGIT ONELATIN CAPITAL LETTER ILATIN SMALL LETTER LVERTICAL LINELATIN CAPITAL LETTER IOTALATIN LETTER DENTAL CLICKGREEK CAPITAL LETTER IOTACYRILLIC CAPITAL LETTER BYELORUSSIAN-UKRAINIAN ICYRILLIC LETTER PALOCHKAHEBREW PUNCTUATION PASEQHEBREW LETTER VAVHEBREW LETTER FINAL NUNARABIC LETTER ALEFARABIC-INDIC DIGIT ONEEXTENDED ARABIC-INDIC DIGIT ONENKO LETTER ARUNIC LETTER ISAZ IS ISS ISCRIPT CAPITAL IBLACK-LETTER CAPITAL ISCRIPT SMALL LROMAN NUMERAL ONESMALL ROMAN NUMERAL FIFTYDIVIDESPOWER ON SYMBOLCOPTIC CAPITAL LETTER IAUDATIFINAGH LETTER YANLISU LETTER ILYCIAN LETTER JOLD ITALIC LETTER IOLD ITALIC NUMERAL ONEMIAO LETTER GHAMATHEMATICAL BOLD CAPITAL IMATHEMATICAL BOLD SMALL LMATHEMATICAL ITALIC CAPITAL IMATHEMATICAL ITALIC SMALL LMATHEMATICAL BOLD ITALIC CAPITAL IMATHEMATICAL BOLD ITALIC SMALL LMATHEMATICAL SCRIPT SMALL LMATHEMATICAL BOLD SCRIPT CAPITAL IMATHEMATICAL BOLD SCRIPT SMALL LMATHEMATICAL FRAKTUR SMALL LMATHEMATICAL DOUBLE-STRUCK CAPITAL IMATHEMATICAL DOUBLE-STRUCK SMALL LMATHEMATICAL BOLD FRAKTUR CAPITAL IMATHEMATICAL BOLD FRAKTUR SMALL LMATHEMATICAL SANS-SERIF CAPITAL IMATHEMATICAL SANS-SERIF SMALL LMATHEMATICAL SANS-SERIF BOLD CAPITAL IMATHEMATICAL SANS-SERIF BOLD SMALL LMATHEMATICAL SANS-SERIF ITALIC CAPITAL IMATHEMATICAL SANS-SERIF ITALIC SMALL LMATHEMATICAL SANS-SERIF BOLD ITALIC CAPITAL IMATHEMATICAL SANS-SERIF BOLD ITALIC SMALL LMATHEMATICAL MONOSPACE CAPITAL IMATHEMATICAL MONOSPACE SMALL LMATHEMATICAL BOLD CAPITAL IOTAMATHEMATICAL ITALIC CAPITAL IOTAMATHEMATICAL BOLD ITALIC CAPITAL IOTAMATHEMATICAL SANS-SERIF BOLD CAPITAL IOTAMATHEMATICAL SANS-SERIF BOLD ITALIC CAPITAL IOTAMATHEMATICAL BOLD DIGIT ONEMATHEMATICAL DOUBLE-STRUCK DIGIT ONEMATHEMATICAL SANS-SERIF DIGIT ONEMATHEMATICAL SANS-SERIF BOLD DIGIT ONEMATHEMATICAL MONOSPACE DIGIT ONEMENDE KIKAKUI DIGIT ONEARABIC MATHEMATICAL ALEFARABIC MATHEMATICAL LOOPED ALEFARABIC LETTER ALEF ISOLATED FORMARABIC LETTER ALEF FINAL FORMFULLWIDTH LATIN CAPITAL LETTER IFULLWIDTH LATIN SMALL LETTER LHALFWIDTH FORMS LIGHT VERTICAL
< ˂ 𝈶                                                                   
003C02C2143816B22039276E1D236
LESS-THAN SIGNMODIFIER LETTER LEFT ARROWHEADCANADIAN SYLLABICS PARUNIC LETTER KAUNASINGLE LEFT-POINTING ANGLE QUOTATION MARKHEAVY LEFT-POINTING ANGLE QUOTATION MARK ORNAMENTGREEK INSTRUMENTAL NOTATION SYMBOL-40
> ˃ 𖼿 𝈷                                                                   
003E02C31433203A276F16F3F1D237
GREATER-THAN SIGNMODIFIER LETTER RIGHT ARROWHEADCANADIAN SYLLABICS POSINGLE RIGHT-POINTING ANGLE QUOTATION MARKHEAVY RIGHT-POINTING ANGLE QUOTATION MARK ORNAMENTMIAO LETTER ARCHAIC ZZAGREEK INSTRUMENTAL NOTATION SYMBOL-42
" '' ʺ ˝ ˮ ˶ ײ ״                                                          
00220027,002702BA02DD02EE02F605F205F41CD3201C201D201F203320363003FF02
QUOTATION MARKAPOSTROPHE + APOSTROPHEMODIFIER LETTER DOUBLE PRIMEDOUBLE ACUTE ACCENTMODIFIER LETTER DOUBLE APOSTROPHEMODIFIER LETTER MIDDLE DOUBLE ACUTE ACCENTHEBREW LIGATURE YIDDISH DOUBLE YODHEBREW PUNCTUATION GERSHAYIMVEDIC SIGN NIHSHVASALEFT DOUBLE QUOTATION MARKRIGHT DOUBLE QUOTATION MARKDOUBLE HIGH-REVERSED-9 QUOTATION MARKDOUBLE PRIMEREVERSED DOUBLE PRIMEDITTO MARKFULLWIDTH QUOTATION MARK

Total raw values: 5,562,950,400

Too many raw items to process.


Data

Confusable characters are those that may be confused with others (in some common UI fonts), such as the Latin letter "o" and the Greek letter omicron "ο". Fonts make a difference: for example, the Hebrew character "ס" looks confusingly similar to "o" in some fonts (such as Arial Hebrew), but not in others. See also unaccented Latin Characters..

The data for confusables and restrictions is from UTS39. You can suggest additions or changes to the Unicode data for future versions of that standard.

For more information on the use of the data, see proposed updates Unicode Security Mechanisms and Unicode Security Considerations.

The restrictions are purely on a character level. For a more detailed view, see idna.

Caveats

The Unicode data is designed for testing, not enumerating, so not all combinations are generated in this demo; In particular, where a character is confusable with a sequence, not all combinations are generated.



Fonts and Display. If you don't have a good set of Unicode fonts (and modern browser), you may not be able to read some of the characters. Some suggested fonts that you can add for coverage are: Noto Fonts site, Unicode Fonts for Ancient Scripts, Large, multi-script Unicode fonts. See also: Unicode Display Problems.

Version 3.9; ICU version: 63.1; Unicode version: 11.0; Unicodeβ version: 12.0;