Unicode Utilities: Character Property Index

Warning: Testing version with properties from ICU (Unicode 9.0), Unicode 10.0β, and emoji 6.0β.

help | character | properties | confusables | unicode-set | compare-sets | regex | bnf-regex | breaks | transform | bidi | idna | languageid

CategoryDatatypeSourcePropertyValues
BidirectionalBinaryUCDBidi_ControlNo (N),
Yes (Y)
Bidi_ControlβNo (N),
Yes (Y)
Bidi_MirroredNo (N),
Yes (Y)
Bidi_MirroredβNo (N),
Yes (Y)
EnumeratedBidi_ClassShow Values
Bidi_ClassβShow Values
Bidi_Paired_Bracket_TypeClose (C),
None (N),
Open (O)
Bidi_Paired_Bracket_TypeβClose (c),
None (n),
Open (o)
StringBidi_Mirroring_GlyphShow Values
Bidi_Mirroring_GlyphβShow Values
Bidi_Paired_BracketShow Values
Bidi_Paired_BracketβShow Values
CaseBinaryICUCase_SensitiveNo (N),
Yes (Y)
UCDCase_IgnorableNo (N),
Yes (Y)
Case_IgnorableβNo (N),
Yes (Y)
CasedNo (N),
Yes (Y)
CasedβNo (N),
Yes (Y)
Changes_When_CasefoldedNo (N),
Yes (Y)
Changes_When_CasefoldedβNo (N),
Yes (Y)
Changes_When_CasemappedNo (N),
Yes (Y)
Changes_When_CasemappedβNo (N),
Yes (Y)
Changes_When_LowercasedNo (N),
Yes (Y)
Changes_When_LowercasedβNo (N),
Yes (Y)
Changes_When_TitlecasedNo (N),
Yes (Y)
Changes_When_TitlecasedβNo (N),
Yes (Y)
Changes_When_UppercasedNo (N),
Yes (Y)
Changes_When_UppercasedβNo (N),
Yes (Y)
LowercaseNo (N),
Yes (Y)
LowercaseβNo (N),
Yes (Y)
Soft_DottedNo (N),
Yes (Y)
Soft_DottedβNo (N),
Yes (Y)
UppercaseNo (N),
Yes (Y)
UppercaseβNo (N),
Yes (Y)
UnicodeisCasedNo (N),
Yes (Y)
isCasefoldedNo (N),
Yes (Y)
isLowercaseNo (N),
Yes (Y)
isTitlecaseNo (N),
Yes (Y)
isUppercaseNo (N),
Yes (Y)
StringUCDCase_FoldingShow Values
Case_FoldingβShow Values
Lowercase_MappingShow Values
Lowercase_MappingβShow Values
Simple_Case_FoldingShow Values
Simple_Case_FoldingβShow Values
Simple_Lowercase_MappingShow Values
Simple_Lowercase_MappingβShow Values
Simple_Titlecase_MappingShow Values
Simple_Titlecase_MappingβShow Values
Simple_Uppercase_MappingShow Values
Simple_Uppercase_MappingβShow Values
Titlecase_MappingShow Values
Titlecase_MappingβShow Values
Uppercase_MappingShow Values
Uppercase_MappingβShow Values
UnicodetoCasefoldShow Values
toLowercaseShow Values
toTitlecaseShow Values
toUppercaseShow Values
CJKBinaryUCDIDS_Binary_OperatorNo (N),
Yes (Y)
IDS_Binary_OperatorβNo (N),
Yes (Y)
IDS_Trinary_OperatorNo (N),
Yes (Y)
IDS_Trinary_OperatorβNo (N),
Yes (Y)
IdeographicNo (N),
Yes (Y)
IdeographicβNo (N),
Yes (Y)
RadicalNo (N),
Yes (Y)
RadicalβNo (N),
Yes (Y)
Unified_IdeographNo (N),
Yes (Y)
Unified_IdeographβNo (N),
Yes (Y)
EnumeratedX-DemoHanTypeHan, Hans, Hant,
na
StringUCDCJK_RadicalβShow Values
kSimplifiedVariantβShow Values
kTraditionalVariantβShow Values
EmojiBinaryUTSEmojiNo (N),
Yes (Y)
Emoji_ComponentβNo (No),
Yes (Yes)
Emoji_Flag_SequenceβNo (No),
Yes (Yes)
Emoji_Keycap_SequenceβNo (No),
Yes (Yes)
Emoji_ModifierNo (N),
Yes (Y)
Emoji_Modifier_BaseNo (N),
Yes (Y)
Emoji_Modifier_BaseβNo (No),
Yes (Yes)
Emoji_Modifier_SequenceβNo (No),
Yes (Yes)
Emoji_ModifierβNo (No),
Yes (Yes)
Emoji_PresentationNo (N),
Yes (Y)
Emoji_PresentationβNo (No),
Yes (Yes)
Emoji_Tag_SequenceβNo (No),
Yes (Yes)
Emoji_Zwj_SequenceβNo (No),
Yes (Yes)
EmojiβNo (No),
Yes (Yes)
EnumeratedUCDRegional_IndicatorβNo (N),
Yes (Y)
GeneralBinaryUCDAlphabeticNo (N),
Yes (Y)
AlphabeticβNo (N),
Yes (Y)
Default_Ignorable_Code_PointNo (N),
Yes (Y)
Default_Ignorable_Code_PointβNo (N),
Yes (Y)
DeprecatedNo (N),
Yes (Y)
DeprecatedβNo (N),
Yes (Y)
Logical_Order_ExceptionNo (N),
Yes (Y)
Logical_Order_ExceptionβNo (N),
Yes (Y)
Noncharacter_Code_PointNo (N),
Yes (Y)
Noncharacter_Code_PointβNo (N),
Yes (Y)
Variation_SelectorNo (N),
Yes (Y)
Variation_SelectorβNo (N),
Yes (Y)
White_SpaceNo (N),
Yes (Y)
White_SpaceβNo (N),
Yes (Y)
CatalogAgeShow Values
AgeβShow Values
BlockShow Values
BlockβShow Values
ScriptShow Values
ScriptβShow Values
EnumeratedGeneral_CategoryShow Values
General_CategoryβShow Values
Hangul_Syllable_TypeLeading_Jamo (L), LV_Syllable (LV), LVT_Syllable (LVT),
Not_Applicable (NA),
Trailing_Jamo (T),
Vowel_Jamo (V)
Hangul_Syllable_TypeβLeading_Jamo (L), LV_Syllable (LV), LVT_Syllable (LVT),
Not_Applicable (NA),
Trailing_Jamo (T),
Vowel_Jamo (V)
Name_AliasβShow Values
Named_Sequences_Provβ
Named_SequencesβShow Values
StringNameslistsubheadA. Man and his occupations, A. The human body and clothing, Aa. Unclassified, Abbreviation mark, Abbreviations involving liter symbols, Abkhaz letters, Accent marks, Accidental, Accommodation symbol, Addition for German typography, Addition for Kashmiri, Addition for Sanskrit, Addition for Torwali, Addition for UPA, Additional archaic letters for Bactrian, Additional diacritical mark for symbols, Additional historic vowel, Additional letter for Ossetian, Additional letters for Mingrelian and Svan, Additional letters for Ossetian and Abkhaz, Additional marks for UPA, Additional nasal vowels, Additional vowels for Sanskrit, Additions, Additions based on 1989 IPA, Additions for Africanist linguistics, Additions for Assamese, Additions for Burushaski, Additions for early Persian, Additions for early Persian and Azerbaijani, Additions for Greek, Additions for IPA, Additions for Khowar, Additions for Lithuanian dialectology, Additions for Livonian, Additions for Nivkh, Additions for Romanian, Additions for Sencoten, Additions for Shan Pali, Additions for Shona, Additions for Sinology, Additions for Slovenian and Croatian, Additions for the Uralic Phonetic Alphabet, Additions for Uighur, Afona or Ypostaseis (Mutes or Hypostases), African letters for clicks, Africanist tone letters, Agogika (Conduits), Aiton symbols and letters, Aleut letters, Alloioseis (Differentiators), Alternate consonant forms for Chinese, Alternate forms of punctuation, Alternate letters, Alternate vowel signs, Ampersand and ligature et ornaments, Analytics, Ancient Greek acrophonic numerals, Ancient Greek instrumental notation, Ancient Greek papyrological numbers, Ancient Greek symbols, Ancient Greek textual symbols, Ancient Greek vocalic notation, Ancient Near-Eastern linguistic symbol, Ancient Roman epigraphic letters, Angles, Animal faces, Animal symbols, Annotation marks, Anusvaras, APL, Apparatus, Arabic letter for Berber, Arabic letters for African languages, Arabic letters for Arwi, Arabic letters for Bravanese, Arabic letters for European and Central Asian languages, Arabic letters for Warsh orthography, Arabic pedagogical symbols, Arabic-Indic digits, Arc consonants, Archaic baseline tone marks, Archaic letters, Archaic letters for Ewe, Archaic letters for Volapük, Archaic lowercase syllable, Archaic punctuation, Archaic Roman numerals, Archaic uppercase syllable, Ardhavisarga, Argies (Retards), ARIB compatibility ideographs, Armenian ligatures, Arrow tails, Arrowheads, Arrows, Arrows combined with operators, Arrows combined with relations, Arrows with bent tips, Arrows with equilateral triangle arrowheads, Arrows with modifications, Arrows with triangle arrowheads, Articulation, ASCII digits, ASCII punctuation and symbols, Astrological digits, Astrological signs, Astronomical symbol, Attached affixes, Audio and video symbols, Augmentation dot, Azerbaijani letters,
B. Animals, B. Woman and her occupations, Back of card, Ballot symbols, Bamboo suit tiles, Bars, Based on GB 2312, Based on ISO 8859-6, Based on ISO 8859-8, Baseline tone marks, Basic consonants, Basic glyphs for Arabic language contextual forms, Basic letters, Basic nasal vowels, Basic Russian alphabet, Basic syllables, Beams and slurs, Beverage and food symbols, Beverage symbols, Biblical editorial symbol, Black arrow, Black circles, Black curved arrows, Black diamonds, Black lozenges, Black squares, Block elements, Body shapes, Bohairic Coptic letters, Bold digits, Bold Fraktur symbols, Bold Greek symbols, Bold italic Greek symbols, Bold italic symbols, Bold script symbols, Bold symbols, Bowtie symbols, Bracket pieces, Brackets, Brackets with ticks, Braille patterns, Breathing mark for the Samaveda, Bubble symbols, Building and map symbols,
C. Anthropomorphic deities, C. Plants, C0 controls, C1 controls, Candrabindu, Candrabindu and candra ornaments, Cantillation marks, Cantillation marks (svara) for the Samaveda, Capital letters, Capital letters (Khutsuri), Cat faces, Caucasian linguistics, Ceilings and floors, Celebration symbols, Centred geometric shapes, Cereals and plants, Character cell arcs, Character cell diagonals, Character suit tiles, Characters found through Phase A, Characters found through Phase B, Characters found through Phase C, Characters found through Phase D, Characters found through Phase E, Characters found through Phase F, Chemistry symbol, Chess symbols, Chillu letters, Chinantec tone marks, Chinook punctuation, Chukchi letters, Chuvash letters, Circle, Circle suit tiles, Circle symbols, Circle vowels, Circled arrows, Circled Hangul letters, Circled Hangul syllable, Circled Hangul syllables, Circled ideographs, Circled ideographs from ARIB STD B24, Circled italic Latin letters, Circled Katakana, Circled Korean words, Circled Latin letter sequences, Circled Latin letters, Circled numbers, Circled numbers on black squares from ARIB STD B24, Circled sans-serif digits, Circles, CJK angle brackets, CJK brackets, CJK corner brackets, CJK radicals supplement, CJK strokes, CJK symbols, CJK symbols and punctuation, Claudian letters, Clef, Clock face symbols, Clothing and accessories, Clubs or wands, Cluster-final consonants, Cluster-initial consonant, Cluster-initial letters, Cluster-specific consonant forms, Codas, Combining diacritical marks, Combining diacritical marks for symbols, Combining half marks, Combining half marks below, Combining letters, Combining maddah and hamza, Combining marks, Combining marks for Old Cyrillic, Combining number bases, Combining numeric signs, Combining stroke, Combining tone marks, Comic style symbols, Communication symbols, Complex signs, Complex signs with vase shapes, Compound vowels, Compressed arrows, Computer symbols, Conjunct letters, Conjunction and length marks, Consonant, Consonant addition for Tibetan, Consonant additions for Sanskrit, Consonant for Addu dialect, Consonant letters, Consonant modifiers, Consonant onsets, Consonant shifters, Consonant signs, Consonants and consonant signs too many values to show
UCDNameShow Values
NameβShow Values
Script_ExtensionsShow Values
Script_ExtensionsβShow Values
IdentifiersBinaryUCDID_ContinueNo (N),
Yes (Y)
ID_ContinueβNo (N),
Yes (Y)
ID_StartNo (N),
Yes (Y)
ID_StartβNo (N),
Yes (Y)
Pattern_SyntaxNo (N),
Yes (Y)
Pattern_SyntaxβNo (N),
Yes (Y)
Pattern_White_SpaceNo (N),
Yes (Y)
Pattern_White_SpaceβNo (N),
Yes (Y)
XID_ContinueNo (N),
Yes (Y)
XID_ContinueβNo (N),
Yes (Y)
XID_StartNo (N),
Yes (Y)
XID_StartβNo (N),
Yes (Y)
IDNAEnumeratedUTSIdn_2008βna (na),
NV8 (nv8),
XV8 (xv8)
Idn_Statusβdeviation (dv), disallowed (da), disallowed_STD3_mapped (ds3m), disallowed_STD3_valid (ds3v),
ignored (i),
mapped (m),
valid (v)
idna2003deviation, disallowed,
ignored,
mapped,
valid
idna2008CONTEXTJ, CONTEXTO,
DISALLOWED,
PVALID,
UNASSIGNED
idna2008cdeviation, disallowed,
ignored,
mapped,
valid
uts46deviation, disallowed,
ignored,
mapped,
valid
StringIdn_MappingβShow Values
toIdna2003Show Values
toUts46nShow Values
toUts46tShow Values
MiscellaneousBinaryUCDDashNo (N),
Yes (Y)
DashβNo (N),
Yes (Y)
DiacriticNo (N),
Yes (Y)
DiacriticβNo (N),
Yes (Y)
ExtenderNo (N),
Yes (Y)
ExtenderβNo (N),
Yes (Y)
Grapheme_BaseNo (N),
Yes (Y)
Grapheme_ExtendNo (N),
Yes (Y)
Grapheme_LinkNo (N),
Yes (Y)
HyphenNo (N),
Yes (Y)
MathNo (N),
Yes (Y)
MathβNo (N),
Yes (Y)
Quotation_MarkNo (N),
Yes (Y)
Quotation_MarkβNo (N),
Yes (Y)
Sentence_TerminalNo (N),
Yes (Y)
Sentence_TerminalβNo (N),
Yes (Y)
Terminal_PunctuationNo (N),
Yes (Y)
Terminal_PunctuationβNo (N),
Yes (Y)
EnumeratedIndic_Positional_CategoryβShow Values
Indic_Syllabic_CategoryβShow Values
MiscellaneousISO_CommentShow Values
Unicode_1_NameShow Values
NormalizationBinaryICUNFC_InertNo (N),
Yes (Y)
NFD_InertNo (N),
Yes (Y)
NFKC_InertNo (N),
Yes (Y)
NFKD_InertNo (N),
Yes (Y)
isNFMNo,
Yes
UCDChanges_When_NFKC_CasefoldedNo (N),
Yes (Y)
Changes_When_NFKC_CasefoldedβNo (N),
Yes (Y)
Full_Composition_ExclusionNo (N),
Yes (Y)
UnicodeisNFCNo,
Yes
isNFDNo,
Yes
isNFKCNo,
Yes
isNFKDNo,
Yes
EnumeratedICULead_Canonical_Combining_ClassShow Values
Trail_Canonical_Combining_ClassShow Values
UCDCanonical_Combining_ClassShow Values
Canonical_Combining_ClassβShow Values
Decomposition_TypeShow Values
Decomposition_TypeβShow Values
NFC_Quick_CheckMaybe (M),
No (N),
Yes (Y)
NFC_Quick_CheckβMaybe (M),
No (N),
Yes (Y)
NFD_Quick_CheckNo (N),
Yes (Y)
NFD_Quick_CheckβNo (N),
Yes (Y)
NFKC_Quick_CheckMaybe (M),
No (N),
Yes (Y)
NFKC_Quick_CheckβMaybe (M),
No (N),
Yes (Y)
NFKD_Quick_CheckNo (N),
Yes (Y)
NFKD_Quick_CheckβNo (N),
Yes (Y)
StringICUtoNFMShow Values
UCDNFKC_CasefoldShow Values
NFKC_CasefoldβShow Values
UnicodetoNFCShow Values
toNFDShow Values
toNFKCShow Values
toNFKDShow Values
NumericBinaryUCDASCII_Hex_DigitNo (N),
Yes (Y)
ASCII_Hex_DigitβNo (N),
Yes (Y)
Hex_DigitNo (N),
Yes (Y)
Hex_DigitβNo (N),
Yes (Y)
EnumeratedNumeric_TypeDecimal (De), Digit (Di),
None (None), Numeric (Nu)
Numeric_TypeβDecimal (De), Digit (Di),
None (None), Numeric (Nu)
kAccountingNumericβShow Values
kOtherNumericβShow Values
kPrimaryNumericβShow Values
NumericNumeric_ValueShow Values
Numeric_ValueβShow Values
RegexBinaryUTSANYNo,
Yes
ASCIINo,
Yes
alnumNo (N),
Yes (Y)
blankNo (N),
Yes (Y)
bmpNo,
Yes
graphNo (N),
Yes (Y)
printNo (N),
Yes (Y)
xdigitNo (N),
Yes (Y)
SecurityEnumeratedUTSConfusable_MAβShow Values
Identifier_StatusβAllowed (a),
Restricted (r)
Identifier_TypeβShow Values
Shaping and RenderingBinaryICUSegment_StarterNo (N),
Yes (Y)
UCDJoin_ControlNo (N),
Yes (Y)
Join_ControlβNo (N),
Yes (Y)
EnumeratedEast_Asian_WidthAmbiguous (A),
Fullwidth (F),
Halfwidth (H),
Narrow (Na), Neutral (N),
Wide (W)
East_Asian_WidthβAmbiguous (A),
Fullwidth (F),
Halfwidth (H),
Narrow (Na), Neutral (N),
Wide (W)
Grapheme_Cluster_BreakShow Values
Grapheme_Cluster_BreakβShow Values
Joining_GroupShow Values
Joining_GroupβShow Values
Joining_TypeDual_Joining (D),
Join_Causing (C),
Left_Joining (L),
Non_Joining (U),
Right_Joining (R),
Transparent (T)
Joining_TypeβDual_Joining (D),
Join_Causing (C),
Left_Joining (L),
Non_Joining (U),
Right_Joining (R),
Transparent (T)
Line_BreakShow Values
Line_BreakβShow Values
Prepended_Concatenation_MarkβNo (N),
Yes (Y)
Sentence_BreakShow Values
Sentence_BreakβShow Values
Standardized_VariantβShow Values
Vertical_OrientationβRotated (R),
Transformed_Rotated (Tr), Transformed_Upright (Tu),
Upright (U)
Word_BreakShow Values
Word_BreakβShow Values
UCABinaryUTSucaShow Values
uca2Show Values
uca2.5Show Values
uca3Show Values
Z-OtherOtherOtherExtended_PictographicβOther

Key

The Categories are from UCD Table 8. Property Summary Table, with some extended categories: Emoji, IDNA, Regex, Security, and UCA.

The Datatypes are from UCD Table 5. Property Type Key.

The Sources are:


Fonts and Display. If you don't have a good set of Unicode fonts (and modern browser), you may not be able to read some of the characters. Some suggested fonts that you can add for coverage are: Noto Fonts site, Unicode Fonts for Ancient Scripts, Large, multi-script Unicode fonts. See also: Unicode Display Problems.

Version 3.8; ICU version: 59.1.0.0; Unicode version: 9.0.0.0