Unicode Utilities: Character Property Index

help | character | properties | confusables | unicode-set | compare-sets | regex | bnf-regex | breaks | transform | bidi | idna | languageid

CategoryDatatypeSourcePropertyValues
BidirectionalBinaryUCDBidi_ControlNo (N),
Yes (Y)
Bidi_MirroredNo (N),
Yes (Y)
EnumeratedBidi_ClassShow Values
Bidi_Paired_Bracket_TypeClose (C),
None (N),
Open (O)
StringBidi_Mirroring_GlyphShow Values
Bidi_Paired_BracketShow Values
CaseBinaryICUCase_SensitiveNo (N),
Yes (Y)
UCDCase_IgnorableNo (N),
Yes (Y)
CasedNo (N),
Yes (Y)
Changes_When_CasefoldedNo (N),
Yes (Y)
Changes_When_CasemappedNo (N),
Yes (Y)
Changes_When_LowercasedNo (N),
Yes (Y)
Changes_When_TitlecasedNo (N),
Yes (Y)
Changes_When_UppercasedNo (N),
Yes (Y)
LowercaseNo (N),
Yes (Y)
Soft_DottedNo (N),
Yes (Y)
UppercaseNo (N),
Yes (Y)
UnicodeisCasedNo (N),
Yes (Y)
isCasefoldedNo (N),
Yes (Y)
isLowercaseNo (N),
Yes (Y)
isTitlecaseNo (N),
Yes (Y)
isUppercaseNo (N),
Yes (Y)
StringUCDCase_FoldingShow Values
Lowercase_MappingShow Values
Simple_Case_FoldingShow Values
Simple_Lowercase_MappingShow Values
Simple_Titlecase_MappingShow Values
Simple_Uppercase_MappingShow Values
Titlecase_MappingShow Values
Uppercase_MappingShow Values
UnicodetoCasefoldShow Values
toLowercaseShow Values
toTitlecaseShow Values
toUppercaseShow Values
CJKBinaryUCDIDS_Binary_OperatorNo (N),
Yes (Y)
IDS_Trinary_OperatorNo (N),
Yes (Y)
IdeographicNo (N),
Yes (Y)
RadicalNo (N),
Yes (Y)
Unified_IdeographNo (N),
Yes (Y)
EnumeratedX-DemoHanTypeHan, Hans, Hant,
na
StringUCDCJK_RadicalShow Values
kSimplifiedVariantShow Values
kTraditionalVariantShow Values
EmojiBinaryUTSEmojiNo (N),
Yes (Y)
Emoji_ComponentNo (N),
Yes (Y)
Emoji_Flag_SequenceNo (No),
Yes (Yes)
Emoji_Keycap_SequenceNo (No),
Yes (Yes)
Emoji_ModifierNo (N),
Yes (Y)
Emoji_Modifier_BaseNo (N),
Yes (Y)
Emoji_Modifier_SequenceNo (No),
Yes (Yes)
Emoji_PresentationNo (N),
Yes (Y)
Emoji_Tag_SequenceNo (No),
Yes (Yes)
Emoji_Zwj_SequenceNo (No),
Yes (Yes)
EnumeratedUCDRegional_IndicatorNo (N),
Yes (Y)
GeneralBinaryUCDAlphabeticNo (N),
Yes (Y)
Default_Ignorable_Code_PointNo (N),
Yes (Y)
DeprecatedNo (N),
Yes (Y)
Logical_Order_ExceptionNo (N),
Yes (Y)
Noncharacter_Code_PointNo (N),
Yes (Y)
Variation_SelectorNo (N),
Yes (Y)
White_SpaceNo (N),
Yes (Y)
CatalogAgeShow Values
BlockShow Values
ScriptShow Values
EnumeratedGeneral_CategoryShow Values
Hangul_Syllable_TypeLeading_Jamo (L), LV_Syllable (LV), LVT_Syllable (LVT),
Not_Applicable (NA),
Trailing_Jamo (T),
Vowel_Jamo (V)
Name_AliasShow Values
Named_SequencesShow Values
Named_Sequences_Prov
StringNameslistsubheadShow Values
UCDNameShow Values
Script_ExtensionsShow Values
IdentifiersBinaryUCDID_ContinueNo (N),
Yes (Y)
ID_StartNo (N),
Yes (Y)
Pattern_SyntaxNo (N),
Yes (Y)
Pattern_White_SpaceNo (N),
Yes (Y)
XID_ContinueNo (N),
Yes (Y)
XID_StartNo (N),
Yes (Y)
IDNAEnumeratedUTSIdn_2008na (na),
NV8 (nv8),
XV8 (xv8)
Idn_Statusdeviation (dv), disallowed (da), disallowed_STD3_mapped (ds3m), disallowed_STD3_valid (ds3v),
ignored (i),
mapped (m),
valid (v)
idna2003deviation, disallowed,
ignored,
mapped,
valid
idna2008CONTEXTJ, CONTEXTO,
DISALLOWED,
PVALID,
UNASSIGNED
idna2008cdeviation, disallowed,
ignored,
mapped,
valid
uts46deviation, disallowed,
ignored,
mapped,
valid
StringIdn_MappingShow Values
toIdna2003Show Values
toUts46nShow Values
toUts46tShow Values
MiscellaneousBinaryUCDDashNo (N),
Yes (Y)
DiacriticNo (N),
Yes (Y)
ExtenderNo (N),
Yes (Y)
Grapheme_BaseNo (N),
Yes (Y)
Grapheme_ExtendNo (N),
Yes (Y)
Grapheme_LinkNo (N),
Yes (Y)
HyphenNo (N),
Yes (Y)
MathNo (N),
Yes (Y)
Quotation_MarkNo (N),
Yes (Y)
Sentence_TerminalNo (N),
Yes (Y)
Terminal_PunctuationNo (N),
Yes (Y)
EnumeratedIndic_Positional_CategoryShow Values
Indic_Syllabic_CategoryShow Values
MiscellaneousISO_CommentShow Values
Unicode_1_NameShow Values
NormalizationBinaryICUNFC_InertNo (N),
Yes (Y)
NFD_InertNo (N),
Yes (Y)
NFKC_InertNo (N),
Yes (Y)
NFKD_InertNo (N),
Yes (Y)
isNFMNo,
Yes
UCDChanges_When_NFKC_CasefoldedNo (N),
Yes (Y)
Full_Composition_ExclusionNo (N),
Yes (Y)
UnicodeisNFCNo,
Yes
isNFDNo,
Yes
isNFKCNo,
Yes
isNFKDNo,
Yes
EnumeratedICULead_Canonical_Combining_ClassShow Values
Trail_Canonical_Combining_ClassShow Values
UCDCanonical_Combining_ClassShow Values
Decomposition_TypeShow Values
NFC_Quick_CheckMaybe (M),
No (N),
Yes (Y)
NFD_Quick_CheckNo (N),
Yes (Y)
NFKC_Quick_CheckMaybe (M),
No (N),
Yes (Y)
NFKD_Quick_CheckNo (N),
Yes (Y)
StringICUtoNFMShow Values
UCDNFKC_CasefoldShow Values
UnicodetoNFCShow Values
toNFDShow Values
toNFKCShow Values
toNFKDShow Values
NumericBinaryUCDASCII_Hex_DigitNo (N),
Yes (Y)
Hex_DigitNo (N),
Yes (Y)
EnumeratedNumeric_TypeDecimal (De), Digit (Di),
None (None), Numeric (Nu)
kAccountingNumericShow Values
kOtherNumericShow Values
kPrimaryNumericShow Values
NumericNumeric_ValueShow Values
RegexBinaryUTSANYNo,
Yes
ASCIINo,
Yes
alnumNo (N),
Yes (Y)
blankNo (N),
Yes (Y)
bmpNo,
Yes
graphNo (N),
Yes (Y)
printNo (N),
Yes (Y)
xdigitNo (N),
Yes (Y)
SecurityEnumeratedUTSConfusable_MAShow Values
Identifier_StatusAllowed (a),
Restricted (r)
Identifier_TypeShow Values
Shaping and RenderingBinaryICUSegment_StarterNo (N),
Yes (Y)
UCDJoin_ControlNo (N),
Yes (Y)
EnumeratedEast_Asian_WidthAmbiguous (A),
Fullwidth (F),
Halfwidth (H),
Narrow (Na), Neutral (N),
Wide (W)
Grapheme_Cluster_BreakShow Values
Joining_GroupShow Values
Joining_TypeDual_Joining (D),
Join_Causing (C),
Left_Joining (L),
Non_Joining (U),
Right_Joining (R),
Transparent (T)
Line_BreakShow Values
Prepended_Concatenation_MarkNo (N),
Yes (Y)
Sentence_BreakShow Values
Standardized_Variantalternate form (isolate final) (alternate form (isolate final)), alternate form (isolate) (alternate form (isolate)),
CJK COMPATIBILITY IDEOGRAPH-# (CJK COMPATIBILITY IDEOGRAPH-#),
dotted form (dotted form),
following the slant of the lower leg (following the slant of the lower leg), fourth form (final) (fourth form (final)), fourth form (medial) (fourth form (medial)),
phags-pa letter reversed shaping e (phags-pa letter reversed shaping e), phags-pa letter reversed shaping ha (phags-pa letter reversed shaping ha), phags-pa letter reversed shaping i (phags-pa letter reversed shaping i), phags-pa letter reversed shaping small a (phags-pa letter reversed shaping small a), phags-pa letter reversed shaping subjoined ya (phags-pa letter reversed shaping subjoined ya), phags-pa letter reversed shaping u (phags-pa letter reversed shaping u),
second form (second form), second form (final) (second form (final)), second form (initial final) (second form (initial final)), second form (initial medial final) (second form (initial medial final)), second form (initial medial) (second form (initial medial)), second form (isolate final) (second form (isolate final)), second form (isolate initial medial) (second form (isolate initial medial)), second form (isolate medial final) (second form (isolate medial final)), second form (isolate medial) (second form (isolate medial)), second form (medial final) (second form (medial final)), second form (medial) (second form (medial)), short diagonal stroke form (short diagonal stroke form),
tall variant with narrow foot (tall variant with narrow foot), third form (final) (third form (final)), third form (medial final) (third form (medial final)), third form (medial) (third form (medial)),
with equal sign touching the circle (with equal sign touching the circle), with serifs (with serifs), with similar following the slant of the upper leg (with similar following the slant of the upper leg), with slanted equal (with slanted equal), with stroke through bottom members (with stroke through bottom members), with vertical stroke (with vertical stroke), with white rim (with white rim),
zero with long diagonal stroke overlay form (zero with long diagonal stroke overlay form)
Vertical_OrientationRotated (R),
Transformed_Rotated (Tr), Transformed_Upright (Tu),
Upright (U)
Word_BreakShow Values
UCABinaryUTSucaShow Values
uca2Show Values
uca2.5Show Values
uca3Show Values
Z-OtherOtherOtherExtended_PictographicOther

Key

The Categories are from UCD Table 8. Property Summary Table, with some extended categories: Emoji, IDNA, Regex, Security, and UCA.

The Datatypes are from UCD Table 5. Property Type Key.

The Sources are:


Fonts and Display. If you don't have a good set of Unicode fonts (and modern browser), you may not be able to read some of the characters. Some suggested fonts that you can add for coverage are: Noto Fonts site, Unicode Fonts for Ancient Scripts, Large, multi-script Unicode fonts. See also: Unicode Display Problems.

Version 3.8; ICU version: 60.0.1.0; Unicode version: 10.0.0.0