L2/03-140

Re: Property Containment Relationships
From: Mark Davis
Date: 2003-04-29

While researching the POSIX property issues, I ran into the not-very-intuitive fact that the set of characters with the Lowercase property is not a subset of the set of characters with the Alphabetic property. I built a quick & dirty program to generate a listing of the containment relationships among other sets corresponding to Unicode Character Properties, just in case that would be useful for spotting other cases of "surprises".

For simplicity, this skips the default value for each property.



ASCII_Hex_Digit=True
IS CONTAINED IN: East_Asian_Width=Narrow, Grapheme_Base=True, Hex_Digit=True, ID_Continue=True, XID_Continue=True

Alphabetic=True
CONTAINS: Decomposition_Type=Final, Decomposition_Type=Initial, Decomposition_Type=Medial, General_Category=Letter_Number, General_Category=Lowercase_Letter, General_Category=Modifier_Letter, General_Category=Other_Letter, General_Category=Titlecase_Letter, General_Category=Uppercase_Letter, Hangul_Syllable_Type=LVT_Syllable, Hangul_Syllable_Type=LV_Syllable, Hangul_Syllable_Type=Leading_Jamo, Hangul_Syllable_Type=Trailing_Jamo, Hangul_Syllable_Type=Vowel_Jamo, Ideographic=True, Joining_Type=Dual_Joining, Joining_Type=Right_Joining, Logical_Order_Exception=True, Script=Arabic, Script=Armenian, Script=Bopomofo, Script=Buhid, Script=Canadian_Aboriginal, Script=Cherokee, Script=Cypriot, Script=Deseret, Script=Georgian, Script=Gothic, Script=Greek, Script=Hangul, Script=Hebrew, Script=Hiragana, Script=Katakana, Script=Latin, Script=Linear_B, Script=Ogham, Script=Old_Italic, Script=Runic, Script=Shavian, Script=Tagbanwa, Script=Tai_Le, Script=Thaana, Script=Ugaritic, Soft_Dotted=True, Unified_Ideograph=True
IS CONTAINED IN: ID_Continue=True

Bidi_Class=Arabic_Letter
CONTAINS: Decomposition_Type=Final, Decomposition_Type=Initial, Decomposition_Type=Isolated, Decomposition_Type=Medial, Joining_Type=Dual_Joining, Joining_Type=Right_Joining, Script=Arabic

Bidi_Class=Arabic_Number
IS CONTAINED IN: Grapheme_Base=True

Bidi_Class=Boundary_Neutral
CONTAINS: Join_Control=True, Line_Break=Word_Joiner, Line_Break=ZWSpace
IS CONTAINED IN: Default_Ignorable_Code_Point=True

Bidi_Class=Common_Separator
IS CONTAINED IN: Grapheme_Base=True

Bidi_Class=European_Number
IS CONTAINED IN: Grapheme_Base=True

Bidi_Class=European_Separator
CONTAINS: Line_Break=Break_Symbols
IS CONTAINED IN: General_Category=Other_Punctuation, Grapheme_Base=True, Math=True

Bidi_Class=European_Terminator
CONTAINS: Line_Break=Hyphen
IS CONTAINED IN: Grapheme_Base=True

Bidi_Class=Left_To_Right_Embedding
IS CONTAINED IN: Bidi_Control=True, Default_Ignorable_Code_Point=True, General_Category=Format, Joining_Type=Transparent, Line_Break=Combining_Mark

Bidi_Class=Left_To_Right_Override
IS CONTAINED IN: Bidi_Control=True, Default_Ignorable_Code_Point=True, General_Category=Format, Joining_Type=Transparent, Line_Break=Combining_Mark

Bidi_Class=Nonspacing_Mark
CONTAINS: General_Category=Enclosing_Mark, Grapheme_Link=True
IS CONTAINED IN: Line_Break=Combining_Mark

Bidi_Class=Other_Neutral
CONTAINS: Bidi_Mirrored=True, Decomposition_Type=Fraction, General_Category=Close_Punctuation, General_Category=Connector_Punctuation, General_Category=Final_Punctuation, General_Category=Initial_Punctuation, General_Category=Modifier_Symbol, General_Category=Open_Punctuation, IDS_Binary_Operator=True, IDS_Trinary_Operator=True, Line_Break=Break_Before, Line_Break=Break_Both, Line_Break=Contingent_Break, Line_Break=Exclamation, Line_Break=Inseperable, Line_Break=Open_Punctuation, Line_Break=Quotation, Quotation_Mark=True, Radical=True, Script=Braille

Bidi_Class=Paragraph_Separator
CONTAINS: General_Category=Paragraph_Separator, Line_Break=Carriage_Return, Line_Break=Line_Feed, Line_Break=Next_Line

Bidi_Class=Pop_Directional_Format
IS CONTAINED IN: Bidi_Control=True, Default_Ignorable_Code_Point=True, General_Category=Format, Joining_Type=Transparent, Line_Break=Combining_Mark

Bidi_Class=Right_To_Left
CONTAINS: Script=Cypriot, Script=Hebrew

Bidi_Class=Right_To_Left_Embedding
IS CONTAINED IN: Bidi_Control=True, Default_Ignorable_Code_Point=True, General_Category=Format, Joining_Type=Transparent, Line_Break=Combining_Mark

Bidi_Class=Right_To_Left_Override
IS CONTAINED IN: Bidi_Control=True, Default_Ignorable_Code_Point=True, General_Category=Format, Joining_Type=Transparent, Line_Break=Combining_Mark

Bidi_Class=Segment_Separator
IS CONTAINED IN: General_Category=Control

Bidi_Class=White_Space
CONTAINS: General_Category=Line_Separator, Line_Break=Space
IS CONTAINED IN: White_Space=True

Bidi_Control=True
CONTAINS: Bidi_Class=Left_To_Right_Embedding, Bidi_Class=Left_To_Right_Override, Bidi_Class=Pop_Directional_Format, Bidi_Class=Right_To_Left_Embedding, Bidi_Class=Right_To_Left_Override
IS CONTAINED IN: Default_Ignorable_Code_Point=True, General_Category=Format, Joining_Type=Transparent, Line_Break=Combining_Mark

Bidi_Mirrored=True
IS CONTAINED IN: Bidi_Class=Other_Neutral, Grapheme_Base=True

Composition_Exclusion=True
IS CONTAINED IN: Decomposition_Type=Canonical, Expands_On_NFC=True, Expands_On_NFD=True, Expands_On_NFKC=True, Expands_On_NFKD=True, Full_Composition_Exclusion=True

Dash=True
CONTAINS: Line_Break=Break_Both, Line_Break=Hyphen
IS CONTAINED IN: Grapheme_Base=True

Decomposition_Type=Canonical
CONTAINS: Composition_Exclusion=True, Expands_On_NFC=True, Expands_On_NFD=True, Full_Composition_Exclusion=True, Hangul_Syllable_Type=LVT_Syllable, Hangul_Syllable_Type=LV_Syllable

Decomposition_Type=Circle
IS CONTAINED IN: Grapheme_Base=True

Decomposition_Type=Compat
CONTAINS: Line_Break=Inseperable

Decomposition_Type=Final
IS CONTAINED IN: Alphabetic=True, Bidi_Class=Arabic_Letter, General_Category=Other_Letter, Grapheme_Base=True, ID_Continue=True, ID_Start=True, Line_Break=Alphabetic, Script=Arabic, XID_Continue=True, XID_Start=True

Decomposition_Type=Font
IS CONTAINED IN: Grapheme_Base=True

Decomposition_Type=Fraction
IS CONTAINED IN: Bidi_Class=Other_Neutral, Expands_On_NFKC=True, Expands_On_NFKD=True, General_Category=Other_Number, Grapheme_Base=True, Numeric_Type=Numeric

Decomposition_Type=Initial
IS CONTAINED IN: Alphabetic=True, Bidi_Class=Arabic_Letter, General_Category=Other_Letter, Grapheme_Base=True, ID_Continue=True, ID_Start=True, Line_Break=Alphabetic, Script=Arabic, XID_Continue=True, XID_Start=True

Decomposition_Type=Isolated
IS CONTAINED IN: Bidi_Class=Arabic_Letter, Grapheme_Base=True

Decomposition_Type=Medial
IS CONTAINED IN: Alphabetic=True, Bidi_Class=Arabic_Letter, General_Category=Other_Letter, Grapheme_Base=True, ID_Continue=True, ID_Start=True, Line_Break=Alphabetic, Script=Arabic, XID_Continue=True, XID_Start=True

Decomposition_Type=Narrow
IS CONTAINED IN: East_Asian_Width=Halfwidth, Grapheme_Base=True

Decomposition_Type=No_Break
IS CONTAINED IN: Grapheme_Base=True, Line_Break=Glue

Decomposition_Type=Small
IS CONTAINED IN: East_Asian_Width=Wide, Grapheme_Base=True

Decomposition_Type=Square
IS CONTAINED IN: East_Asian_Width=Wide, Expands_On_NFKC=True, Expands_On_NFKD=True, General_Category=Other_Symbol, Grapheme_Base=True, Line_Break=Ideographic

Decomposition_Type=Sub
IS CONTAINED IN: Grapheme_Base=True

Decomposition_Type=Super
IS CONTAINED IN: Grapheme_Base=True

Decomposition_Type=Vertical
IS CONTAINED IN: East_Asian_Width=Wide, Grapheme_Base=True

Decomposition_Type=Wide
EQUALS: East_Asian_Width=Fullwidth
IS CONTAINED IN: Grapheme_Base=True

Default_Ignorable_Code_Point=True
CONTAINS: Bidi_Class=Boundary_Neutral, Bidi_Class=Left_To_Right_Embedding, Bidi_Class=Left_To_Right_Override, Bidi_Class=Pop_Directional_Format, Bidi_Class=Right_To_Left_Embedding, Bidi_Class=Right_To_Left_Override, Bidi_Control=True, General_Category=Format, General_Category=Surrogate, Join_Control=True, Line_Break=Surrogate, Line_Break=Word_Joiner, Line_Break=ZWSpace

Diacritic=True
CONTAINS: General_Category=Modifier_Symbol

East_Asian_Width=Ambiguous
CONTAINS: General_Category=Private_Use, Line_Break=Break_Both, Line_Break=Inseperable

East_Asian_Width=Fullwidth
EQUALS: Decomposition_Type=Wide
IS CONTAINED IN: Grapheme_Base=True

East_Asian_Width=Halfwidth
CONTAINS: Decomposition_Type=Narrow
IS CONTAINED IN: Grapheme_Base=True

East_Asian_Width=Narrow
CONTAINS: ASCII_Hex_Digit=True, Line_Break=Break_Symbols, Line_Break=Hyphen, Line_Break=Space
IS CONTAINED IN: Grapheme_Base=True

East_Asian_Width=Wide
CONTAINS: Decomposition_Type=Small, Decomposition_Type=Square, Decomposition_Type=Vertical, Hangul_Syllable_Type=LVT_Syllable, Hangul_Syllable_Type=LV_Syllable, Hangul_Syllable_Type=Leading_Jamo, IDS_Binary_Operator=True, IDS_Trinary_Operator=True, Ideographic=True, Radical=True, Script=Bopomofo, Script=Han, Script=Hiragana, Script=Yi, Unified_Ideograph=True

Expands_On_NFC=True
CONTAINS: Composition_Exclusion=True
IS CONTAINED IN: Decomposition_Type=Canonical, Expands_On_NFD=True, Expands_On_NFKC=True, Expands_On_NFKD=True, Full_Composition_Exclusion=True

Expands_On_NFD=True
CONTAINS: Composition_Exclusion=True, Expands_On_NFC=True, Hangul_Syllable_Type=LVT_Syllable, Hangul_Syllable_Type=LV_Syllable
IS CONTAINED IN: Decomposition_Type=Canonical, Expands_On_NFKD=True

Expands_On_NFKC=True
CONTAINS: Composition_Exclusion=True, Decomposition_Type=Fraction, Decomposition_Type=Square, Expands_On_NFC=True
IS CONTAINED IN: Expands_On_NFKD=True

Expands_On_NFKD=True
CONTAINS: Composition_Exclusion=True, Decomposition_Type=Fraction, Decomposition_Type=Square, Expands_On_NFC=True, Expands_On_NFD=True, Expands_On_NFKC=True, General_Category=Titlecase_Letter, Hangul_Syllable_Type=LVT_Syllable, Hangul_Syllable_Type=LV_Syllable

Extender=True
IS CONTAINED IN: Grapheme_Base=True, XID_Continue=True

Full_Composition_Exclusion=True
CONTAINS: Composition_Exclusion=True, Expands_On_NFC=True
IS CONTAINED IN: Decomposition_Type=Canonical

General_Category=Close_Punctuation
IS CONTAINED IN: Bidi_Class=Other_Neutral, Grapheme_Base=True, Line_Break=Close_Punctuation

General_Category=Connector_Punctuation
IS CONTAINED IN: Bidi_Class=Other_Neutral, Grapheme_Base=True, ID_Continue=True, XID_Continue=True

General_Category=Control
CONTAINS: Bidi_Class=Segment_Separator, Line_Break=Carriage_Return, Line_Break=Line_Feed, Line_Break=Next_Line

General_Category=Currency_Symbol
IS CONTAINED IN: Grapheme_Base=True

General_Category=Dash_Punctuation
CONTAINS: Line_Break=Break_Both, Line_Break=Hyphen
IS CONTAINED IN: Grapheme_Base=True

General_Category=Decimal_Number
EQUALS: Numeric_Type=Decimal
CONTAINS: Line_Break=Numeric
IS CONTAINED IN: Grapheme_Base=True, ID_Continue=True, XID_Continue=True

General_Category=Enclosing_Mark
IS CONTAINED IN: Bidi_Class=Nonspacing_Mark, Grapheme_Extend=True, Line_Break=Combining_Mark, Script=Inherited

General_Category=Final_Punctuation
IS CONTAINED IN: Bidi_Class=Other_Neutral, Grapheme_Base=True, Line_Break=Quotation, Quotation_Mark=True

General_Category=Format
CONTAINS: Bidi_Class=Left_To_Right_Embedding, Bidi_Class=Left_To_Right_Override, Bidi_Class=Pop_Directional_Format, Bidi_Class=Right_To_Left_Embedding, Bidi_Class=Right_To_Left_Override, Bidi_Control=True, Join_Control=True, Line_Break=Word_Joiner
IS CONTAINED IN: Default_Ignorable_Code_Point=True

General_Category=Initial_Punctuation
IS CONTAINED IN: Bidi_Class=Other_Neutral, Grapheme_Base=True, Line_Break=Quotation, Quotation_Mark=True

General_Category=Letter_Number
IS CONTAINED IN: Alphabetic=True, Grapheme_Base=True, ID_Continue=True, ID_Start=True, XID_Continue=True, XID_Start=True

General_Category=Line_Separator
IS CONTAINED IN: Bidi_Class=White_Space, Line_Break=Mandatory_Break, White_Space=True

General_Category=Lowercase_Letter
CONTAINS: Soft_Dotted=True
IS CONTAINED IN: Alphabetic=True, Grapheme_Base=True, ID_Continue=True, ID_Start=True, Lowercase=True, XID_Continue=True, XID_Start=True

General_Category=Math_Symbol
IS CONTAINED IN: Grapheme_Base=True, Math=True

General_Category=Modifier_Letter
IS CONTAINED IN: Alphabetic=True, Grapheme_Base=True, ID_Continue=True, ID_Start=True

General_Category=Modifier_Symbol
IS CONTAINED IN: Bidi_Class=Other_Neutral, Diacritic=True, Grapheme_Base=True

General_Category=Nonspacing_Mark
CONTAINS: Grapheme_Link=True
IS CONTAINED IN: Grapheme_Extend=True, ID_Continue=True, Joining_Type=Transparent, Line_Break=Combining_Mark, XID_Continue=True

General_Category=Open_Punctuation
EQUALS: Line_Break=Open_Punctuation
IS CONTAINED IN: Bidi_Class=Other_Neutral, Grapheme_Base=True

General_Category=Other_Letter
CONTAINS: Decomposition_Type=Final, Decomposition_Type=Initial, Decomposition_Type=Medial, Hangul_Syllable_Type=LVT_Syllable, Hangul_Syllable_Type=LV_Syllable, Hangul_Syllable_Type=Leading_Jamo, Hangul_Syllable_Type=Trailing_Jamo, Hangul_Syllable_Type=Vowel_Jamo, Joining_Type=Dual_Joining, Joining_Type=Right_Joining, Logical_Order_Exception=True, Script=Bopomofo, Script=Canadian_Aboriginal, Script=Cherokee, Script=Cypriot, Script=Hangul, Script=Hebrew, Script=Linear_B, Script=Ogham, Script=Old_Italic, Script=Shavian, Script=Tai_Le, Script=Ugaritic, Unified_Ideograph=True
IS CONTAINED IN: Alphabetic=True, Grapheme_Base=True, ID_Continue=True, ID_Start=True

General_Category=Other_Number
CONTAINS: Decomposition_Type=Fraction, Numeric_Type=Digit
IS CONTAINED IN: Grapheme_Base=True

General_Category=Other_Punctuation
CONTAINS: Bidi_Class=European_Separator, Line_Break=Break_Symbols, Line_Break=Inseperable, Terminal_Punctuation=True
IS CONTAINED IN: Grapheme_Base=True

General_Category=Other_Symbol
CONTAINS: Decomposition_Type=Square, IDS_Binary_Operator=True, IDS_Trinary_Operator=True, Line_Break=Contingent_Break, Radical=True, Script=Braille
IS CONTAINED IN: Grapheme_Base=True

General_Category=Paragraph_Separator
IS CONTAINED IN: Bidi_Class=Paragraph_Separator, Line_Break=Mandatory_Break, White_Space=True

General_Category=Private_Use
IS CONTAINED IN: East_Asian_Width=Ambiguous

General_Category=Space_Separator
CONTAINS: Line_Break=Space, Line_Break=ZWSpace
IS CONTAINED IN: Grapheme_Base=True

General_Category=Spacing_Mark
IS CONTAINED IN: ID_Continue=True, Line_Break=Combining_Mark, XID_Continue=True

General_Category=Surrogate
EQUALS: Line_Break=Surrogate
IS CONTAINED IN: Default_Ignorable_Code_Point=True

General_Category=Titlecase_Letter
IS CONTAINED IN: Alphabetic=True, Expands_On_NFKD=True, Grapheme_Base=True, ID_Continue=True, ID_Start=True, Line_Break=Alphabetic, XID_Continue=True, XID_Start=True

General_Category=Uppercase_Letter
IS CONTAINED IN: Alphabetic=True, Grapheme_Base=True, ID_Continue=True, ID_Start=True, Uppercase=True, XID_Continue=True, XID_Start=True

Grapheme_Base=True
CONTAINS: ASCII_Hex_Digit=True, Bidi_Class=Arabic_Number, Bidi_Class=Common_Separator, Bidi_Class=European_Number, Bidi_Class=European_Separator, Bidi_Class=European_Terminator, Bidi_Mirrored=True, Dash=True, Decomposition_Type=Circle, Decomposition_Type=Final, Decomposition_Type=Font, Decomposition_Type=Fraction, Decomposition_Type=Initial, Decomposition_Type=Isolated, Decomposition_Type=Medial, Decomposition_Type=Narrow, Decomposition_Type=No_Break, Decomposition_Type=Small, Decomposition_Type=Square, Decomposition_Type=Sub, Decomposition_Type=Super, Decomposition_Type=Vertical, Decomposition_Type=Wide, East_Asian_Width=Fullwidth, East_Asian_Width=Halfwidth, East_Asian_Width=Narrow, Extender=True, General_Category=Close_Punctuation, General_Category=Connector_Punctuation, General_Category=Currency_Symbol, General_Category=Dash_Punctuation, General_Category=Decimal_Number, General_Category=Final_Punctuation, General_Category=Initial_Punctuation, General_Category=Letter_Number, General_Category=Lowercase_Letter, General_Category=Math_Symbol, General_Category=Modifier_Letter, General_Category=Modifier_Symbol, General_Category=Open_Punctuation, General_Category=Other_Letter, General_Category=Other_Number, General_Category=Other_Punctuation, General_Category=Other_Symbol, General_Category=Space_Separator, General_Category=Titlecase_Letter, General_Category=Uppercase_Letter, Hangul_Syllable_Type=LVT_Syllable, Hangul_Syllable_Type=LV_Syllable, Hangul_Syllable_Type=Leading_Jamo, Hangul_Syllable_Type=Trailing_Jamo, Hangul_Syllable_Type=Vowel_Jamo, Hex_Digit=True, IDS_Binary_Operator=True, IDS_Trinary_Operator=True, ID_Start=True, Ideographic=True, Joining_Type=Dual_Joining, Joining_Type=Right_Joining, Line_Break=Ambiguous, Line_Break=Break_Before, Line_Break=Break_Both, Line_Break=Break_Symbols, Line_Break=Close_Punctuation, Line_Break=Contingent_Break, Line_Break=Exclamation, Line_Break=Glue, Line_Break=Hyphen, Line_Break=Ideographic, Line_Break=Infix_Numeric, Line_Break=Inseperable, Line_Break=Nonstarter, Line_Break=Numeric, Line_Break=Open_Punctuation, Line_Break=Postfix_Numeric, Line_Break=Prefix_Numeric, Line_Break=Quotation, Line_Break=Space, Line_Break=ZWSpace, Logical_Order_Exception=True, Numeric_Type=Decimal, Numeric_Type=Digit, Numeric_Type=Numeric, Quotation_Mark=True, Radical=True, Script=Arabic, Script=Armenian, Script=Bopomofo, Script=Braille, Script=Canadian_Aboriginal, Script=Cherokee, Script=Cypriot, Script=Deseret, Script=Ethiopic, Script=Georgian, Script=Gothic, Script=Greek, Script=Han, Script=Hangul, Script=Hebrew, Script=Hiragana, Script=Katakana, Script=Latin, Script=Linear_B, Script=Ogham, Script=Old_Italic, Script=Osmanya, Script=Runic, Script=Shavian, Script=Tai_Le, Script=Ugaritic, Script=Yi, Soft_Dotted=True, Terminal_Punctuation=True, Unified_Ideograph=True, Uppercase=True, XID_Start=True

Grapheme_Extend=True
CONTAINS: General_Category=Enclosing_Mark, General_Category=Nonspacing_Mark, Grapheme_Link=True
IS CONTAINED IN: Line_Break=Combining_Mark

Grapheme_Link=True
IS CONTAINED IN: Bidi_Class=Nonspacing_Mark, General_Category=Nonspacing_Mark, Grapheme_Extend=True, ID_Continue=True, Joining_Type=Transparent, Line_Break=Combining_Mark, XID_Continue=True

Hangul_Syllable_Type=LVT_Syllable
IS CONTAINED IN: Alphabetic=True, Decomposition_Type=Canonical, East_Asian_Width=Wide, Expands_On_NFD=True, Expands_On_NFKD=True, General_Category=Other_Letter, Grapheme_Base=True, ID_Continue=True, ID_Start=True, Line_Break=Ideographic, Script=Hangul, XID_Continue=True, XID_Start=True

Hangul_Syllable_Type=LV_Syllable
IS CONTAINED IN: Alphabetic=True, Decomposition_Type=Canonical, East_Asian_Width=Wide, Expands_On_NFD=True, Expands_On_NFKD=True, General_Category=Other_Letter, Grapheme_Base=True, ID_Continue=True, ID_Start=True, Line_Break=Ideographic, Script=Hangul, XID_Continue=True, XID_Start=True

Hangul_Syllable_Type=Leading_Jamo
IS CONTAINED IN: Alphabetic=True, East_Asian_Width=Wide, General_Category=Other_Letter, Grapheme_Base=True, ID_Continue=True, ID_Start=True, Line_Break=Ideographic, Script=Hangul, XID_Continue=True, XID_Start=True

Hangul_Syllable_Type=Trailing_Jamo
IS CONTAINED IN: Alphabetic=True, General_Category=Other_Letter, Grapheme_Base=True, ID_Continue=True, ID_Start=True, Line_Break=Ideographic, Script=Hangul, XID_Continue=True, XID_Start=True

Hangul_Syllable_Type=Vowel_Jamo
IS CONTAINED IN: Alphabetic=True, General_Category=Other_Letter, Grapheme_Base=True, ID_Continue=True, ID_Start=True, Line_Break=Ideographic, Script=Hangul, XID_Continue=True, XID_Start=True

Hex_Digit=True
CONTAINS: ASCII_Hex_Digit=True
IS CONTAINED IN: Grapheme_Base=True, ID_Continue=True, XID_Continue=True

Hyphen=True
CONTAINS: Line_Break=Hyphen

IDS_Binary_Operator=True
IS CONTAINED IN: Bidi_Class=Other_Neutral, East_Asian_Width=Wide, General_Category=Other_Symbol, Grapheme_Base=True, Line_Break=Ideographic

IDS_Trinary_Operator=True
IS CONTAINED IN: Bidi_Class=Other_Neutral, East_Asian_Width=Wide, General_Category=Other_Symbol, Grapheme_Base=True, Line_Break=Ideographic

ID_Continue=True
CONTAINS: ASCII_Hex_Digit=True, Alphabetic=True, Decomposition_Type=Final, Decomposition_Type=Initial, Decomposition_Type=Medial, General_Category=Connector_Punctuation, General_Category=Decimal_Number, General_Category=Letter_Number, General_Category=Lowercase_Letter, General_Category=Modifier_Letter, General_Category=Nonspacing_Mark, General_Category=Other_Letter, General_Category=Spacing_Mark, General_Category=Titlecase_Letter, General_Category=Uppercase_Letter, Grapheme_Link=True, Hangul_Syllable_Type=LVT_Syllable, Hangul_Syllable_Type=LV_Syllable, Hangul_Syllable_Type=Leading_Jamo, Hangul_Syllable_Type=Trailing_Jamo, Hangul_Syllable_Type=Vowel_Jamo, Hex_Digit=True, ID_Start=True, Ideographic=True, Joining_Type=Dual_Joining, Joining_Type=Right_Joining, Line_Break=Numeric, Logical_Order_Exception=True, Numeric_Type=Decimal, Script=Arabic, Script=Armenian, Script=Bengali, Script=Bopomofo, Script=Buhid, Script=Canadian_Aboriginal, Script=Cherokee, Script=Cypriot, Script=Cyrillic, Script=Deseret, Script=Devanagari, Script=Georgian, Script=Gothic, Script=Greek, Script=Gujarati, Script=Gurmukhi, Script=Hangul, Script=Hanunoo, Script=Hebrew, Script=Hiragana, Script=Kannada, Script=Katakana, Script=Lao, Script=Latin, Script=Limbu, Script=Linear_B, Script=Malayalam, Script=Mongolian, Script=Myanmar, Script=Ogham, Script=Old_Italic, Script=Oriya, Script=Osmanya, Script=Runic, Script=Shavian, Script=Sinhala, Script=Syriac, Script=Tagalog, Script=Tagbanwa, Script=Tai_Le, Script=Telugu, Script=Thaana, Script=Thai, Script=Ugaritic, Soft_Dotted=True, Unified_Ideograph=True, XID_Start=True

ID_Start=True
CONTAINS: Decomposition_Type=Final, Decomposition_Type=Initial, Decomposition_Type=Medial, General_Category=Letter_Number, General_Category=Lowercase_Letter, General_Category=Modifier_Letter, General_Category=Other_Letter, General_Category=Titlecase_Letter, General_Category=Uppercase_Letter, Hangul_Syllable_Type=LVT_Syllable, Hangul_Syllable_Type=LV_Syllable, Hangul_Syllable_Type=Leading_Jamo, Hangul_Syllable_Type=Trailing_Jamo, Hangul_Syllable_Type=Vowel_Jamo, Ideographic=True, Joining_Type=Dual_Joining, Joining_Type=Right_Joining, Logical_Order_Exception=True, Script=Arabic, Script=Armenian, Script=Bopomofo, Script=Canadian_Aboriginal, Script=Cherokee, Script=Cypriot, Script=Deseret, Script=Georgian, Script=Gothic, Script=Greek, Script=Hangul, Script=Hebrew, Script=Hiragana, Script=Katakana, Script=Latin, Script=Linear_B, Script=Ogham, Script=Old_Italic, Script=Runic, Script=Shavian, Script=Tai_Le, Script=Ugaritic, Soft_Dotted=True, Unified_Ideograph=True, XID_Start=True
IS CONTAINED IN: Grapheme_Base=True, ID_Continue=True

Ideographic=True
CONTAINS: Unified_Ideograph=True
IS CONTAINED IN: Alphabetic=True, East_Asian_Width=Wide, Grapheme_Base=True, ID_Continue=True, ID_Start=True, Line_Break=Ideographic, XID_Continue=True, XID_Start=True

Join_Control=True
IS CONTAINED IN: Bidi_Class=Boundary_Neutral, Default_Ignorable_Code_Point=True, General_Category=Format, Line_Break=Combining_Mark

Joining_Type=Dual_Joining
IS CONTAINED IN: Alphabetic=True, Bidi_Class=Arabic_Letter, General_Category=Other_Letter, Grapheme_Base=True, ID_Continue=True, ID_Start=True, Line_Break=Alphabetic, XID_Continue=True, XID_Start=True

Joining_Type=Right_Joining
IS CONTAINED IN: Alphabetic=True, Bidi_Class=Arabic_Letter, General_Category=Other_Letter, Grapheme_Base=True, ID_Continue=True, ID_Start=True, Line_Break=Alphabetic, XID_Continue=True, XID_Start=True

Joining_Type=Transparent
CONTAINS: Bidi_Class=Left_To_Right_Embedding, Bidi_Class=Left_To_Right_Override, Bidi_Class=Pop_Directional_Format, Bidi_Class=Right_To_Left_Embedding, Bidi_Class=Right_To_Left_Override, Bidi_Control=True, General_Category=Nonspacing_Mark, Grapheme_Link=True, Line_Break=Word_Joiner

Line_Break=Alphabetic
CONTAINS: Decomposition_Type=Final, Decomposition_Type=Initial, Decomposition_Type=Medial, General_Category=Titlecase_Letter, Joining_Type=Dual_Joining, Joining_Type=Right_Joining, Script=Arabic, Script=Armenian, Script=Braille, Script=Canadian_Aboriginal, Script=Cherokee, Script=Cypriot, Script=Deseret, Script=Georgian, Script=Gothic, Script=Hebrew, Script=Linear_B, Script=Ogham, Script=Old_Italic, Script=Runic, Script=Shavian, Script=Tai_Le, Script=Ugaritic, Soft_Dotted=True

Line_Break=Ambiguous
IS CONTAINED IN: Grapheme_Base=True

Line_Break=Break_Before
IS CONTAINED IN: Bidi_Class=Other_Neutral, Grapheme_Base=True

Line_Break=Break_Both
IS CONTAINED IN: Bidi_Class=Other_Neutral, Dash=True, East_Asian_Width=Ambiguous, General_Category=Dash_Punctuation, Grapheme_Base=True

Line_Break=Break_Symbols
IS CONTAINED IN: Bidi_Class=European_Separator, East_Asian_Width=Narrow, General_Category=Other_Punctuation, Grapheme_Base=True, Math=True

Line_Break=Carriage_Return
IS CONTAINED IN: Bidi_Class=Paragraph_Separator, General_Category=Control, White_Space=True

Line_Break=Close_Punctuation
CONTAINS: General_Category=Close_Punctuation
IS CONTAINED IN: Grapheme_Base=True

Line_Break=Combining_Mark
CONTAINS: Bidi_Class=Left_To_Right_Embedding, Bidi_Class=Left_To_Right_Override, Bidi_Class=Nonspacing_Mark, Bidi_Class=Pop_Directional_Format, Bidi_Class=Right_To_Left_Embedding, Bidi_Class=Right_To_Left_Override, Bidi_Control=True, General_Category=Enclosing_Mark, General_Category=Nonspacing_Mark, General_Category=Spacing_Mark, Grapheme_Extend=True, Grapheme_Link=True, Join_Control=True

Line_Break=Complex_Context
CONTAINS: Logical_Order_Exception=True

Line_Break=Contingent_Break
IS CONTAINED IN: Bidi_Class=Other_Neutral, General_Category=Other_Symbol, Grapheme_Base=True

Line_Break=Exclamation
IS CONTAINED IN: Bidi_Class=Other_Neutral, Grapheme_Base=True

Line_Break=Glue
CONTAINS: Decomposition_Type=No_Break
IS CONTAINED IN: Grapheme_Base=True

Line_Break=Hyphen
IS CONTAINED IN: Bidi_Class=European_Terminator, Dash=True, East_Asian_Width=Narrow, General_Category=Dash_Punctuation, Grapheme_Base=True, Hyphen=True, Math=True

Line_Break=Ideographic
CONTAINS: Decomposition_Type=Square, Hangul_Syllable_Type=LVT_Syllable, Hangul_Syllable_Type=LV_Syllable, Hangul_Syllable_Type=Leading_Jamo, Hangul_Syllable_Type=Trailing_Jamo, Hangul_Syllable_Type=Vowel_Jamo, IDS_Binary_Operator=True, IDS_Trinary_Operator=True, Ideographic=True, Radical=True, Script=Bopomofo, Script=Yi, Unified_Ideograph=True
IS CONTAINED IN: Grapheme_Base=True

Line_Break=Infix_Numeric
IS CONTAINED IN: Grapheme_Base=True

Line_Break=Inseperable
IS CONTAINED IN: Bidi_Class=Other_Neutral, Decomposition_Type=Compat, East_Asian_Width=Ambiguous, General_Category=Other_Punctuation, Grapheme_Base=True

Line_Break=Line_Feed
IS CONTAINED IN: Bidi_Class=Paragraph_Separator, General_Category=Control, White_Space=True

Line_Break=Mandatory_Break
CONTAINS: General_Category=Line_Separator, General_Category=Paragraph_Separator
IS CONTAINED IN: White_Space=True

Line_Break=Next_Line
IS CONTAINED IN: Bidi_Class=Paragraph_Separator, General_Category=Control, White_Space=True

Line_Break=Nonstarter
IS CONTAINED IN: Grapheme_Base=True

Line_Break=Numeric
IS CONTAINED IN: General_Category=Decimal_Number, Grapheme_Base=True, ID_Continue=True, Numeric_Type=Decimal, XID_Continue=True

Line_Break=Open_Punctuation
EQUALS: General_Category=Open_Punctuation
IS CONTAINED IN: Bidi_Class=Other_Neutral, Grapheme_Base=True

Line_Break=Postfix_Numeric
IS CONTAINED IN: Grapheme_Base=True

Line_Break=Prefix_Numeric
IS CONTAINED IN: Grapheme_Base=True

Line_Break=Quotation
CONTAINS: General_Category=Final_Punctuation, General_Category=Initial_Punctuation
IS CONTAINED IN: Bidi_Class=Other_Neutral, Grapheme_Base=True

Line_Break=Space
IS CONTAINED IN: Bidi_Class=White_Space, East_Asian_Width=Narrow, General_Category=Space_Separator, Grapheme_Base=True, White_Space=True

Line_Break=Surrogate
EQUALS: General_Category=Surrogate
IS CONTAINED IN: Default_Ignorable_Code_Point=True

Line_Break=Word_Joiner
IS CONTAINED IN: Bidi_Class=Boundary_Neutral, Default_Ignorable_Code_Point=True, General_Category=Format, Joining_Type=Transparent

Line_Break=ZWSpace
IS CONTAINED IN: Bidi_Class=Boundary_Neutral, Default_Ignorable_Code_Point=True, General_Category=Space_Separator, Grapheme_Base=True

Logical_Order_Exception=True
IS CONTAINED IN: Alphabetic=True, General_Category=Other_Letter, Grapheme_Base=True, ID_Continue=True, ID_Start=True, Line_Break=Complex_Context, XID_Continue=True, XID_Start=True

Lowercase=True
CONTAINS: General_Category=Lowercase_Letter, Soft_Dotted=True

Math=True
CONTAINS: Bidi_Class=European_Separator, General_Category=Math_Symbol, Line_Break=Break_Symbols, Line_Break=Hyphen

Numeric_Type=Decimal
EQUALS: General_Category=Decimal_Number
CONTAINS: Line_Break=Numeric
IS CONTAINED IN: Grapheme_Base=True, ID_Continue=True, XID_Continue=True

Numeric_Type=Digit
IS CONTAINED IN: General_Category=Other_Number, Grapheme_Base=True

Numeric_Type=Numeric
CONTAINS: Decomposition_Type=Fraction
IS CONTAINED IN: Grapheme_Base=True

Quotation_Mark=True
CONTAINS: General_Category=Final_Punctuation, General_Category=Initial_Punctuation
IS CONTAINED IN: Bidi_Class=Other_Neutral, Grapheme_Base=True

Radical=True
IS CONTAINED IN: Bidi_Class=Other_Neutral, East_Asian_Width=Wide, General_Category=Other_Symbol, Grapheme_Base=True, Line_Break=Ideographic, Script=Han

Script=Arabic
CONTAINS: Decomposition_Type=Final, Decomposition_Type=Initial, Decomposition_Type=Medial
IS CONTAINED IN: Alphabetic=True, Bidi_Class=Arabic_Letter, Grapheme_Base=True, ID_Continue=True, ID_Start=True, Line_Break=Alphabetic

Script=Armenian
IS CONTAINED IN: Alphabetic=True, Grapheme_Base=True, ID_Continue=True, ID_Start=True, Line_Break=Alphabetic, XID_Continue=True, XID_Start=True

Script=Bengali
IS CONTAINED IN: ID_Continue=True, XID_Continue=True

Script=Bopomofo
IS CONTAINED IN: Alphabetic=True, East_Asian_Width=Wide, General_Category=Other_Letter, Grapheme_Base=True, ID_Continue=True, ID_Start=True, Line_Break=Ideographic, XID_Continue=True, XID_Start=True

Script=Braille
IS CONTAINED IN: Bidi_Class=Other_Neutral, General_Category=Other_Symbol, Grapheme_Base=True, Line_Break=Alphabetic

Script=Buhid
IS CONTAINED IN: Alphabetic=True, ID_Continue=True, XID_Continue=True

Script=Canadian_Aboriginal
IS CONTAINED IN: Alphabetic=True, General_Category=Other_Letter, Grapheme_Base=True, ID_Continue=True, ID_Start=True, Line_Break=Alphabetic, XID_Continue=True, XID_Start=True

Script=Cherokee
IS CONTAINED IN: Alphabetic=True, General_Category=Other_Letter, Grapheme_Base=True, ID_Continue=True, ID_Start=True, Line_Break=Alphabetic, XID_Continue=True, XID_Start=True

Script=Cypriot
IS CONTAINED IN: Alphabetic=True, Bidi_Class=Right_To_Left, General_Category=Other_Letter, Grapheme_Base=True, ID_Continue=True, ID_Start=True, Line_Break=Alphabetic, XID_Continue=True, XID_Start=True

Script=Cyrillic
IS CONTAINED IN: ID_Continue=True, XID_Continue=True

Script=Deseret
IS CONTAINED IN: Alphabetic=True, Grapheme_Base=True, ID_Continue=True, ID_Start=True, Line_Break=Alphabetic, XID_Continue=True, XID_Start=True

Script=Devanagari
IS CONTAINED IN: ID_Continue=True, XID_Continue=True

Script=Ethiopic
IS CONTAINED IN: Grapheme_Base=True

Script=Georgian
IS CONTAINED IN: Alphabetic=True, Grapheme_Base=True, ID_Continue=True, ID_Start=True, Line_Break=Alphabetic, XID_Continue=True, XID_Start=True

Script=Gothic
IS CONTAINED IN: Alphabetic=True, Grapheme_Base=True, ID_Continue=True, ID_Start=True, Line_Break=Alphabetic, XID_Continue=True, XID_Start=True

Script=Greek
IS CONTAINED IN: Alphabetic=True, Grapheme_Base=True, ID_Continue=True, ID_Start=True

Script=Gujarati
IS CONTAINED IN: ID_Continue=True, XID_Continue=True

Script=Gurmukhi
IS CONTAINED IN: ID_Continue=True, XID_Continue=True

Script=Han
CONTAINS: Radical=True, Unified_Ideograph=True
IS CONTAINED IN: East_Asian_Width=Wide, Grapheme_Base=True

Script=Hangul
CONTAINS: Hangul_Syllable_Type=LVT_Syllable, Hangul_Syllable_Type=LV_Syllable, Hangul_Syllable_Type=Leading_Jamo, Hangul_Syllable_Type=Trailing_Jamo, Hangul_Syllable_Type=Vowel_Jamo
IS CONTAINED IN: Alphabetic=True, General_Category=Other_Letter, Grapheme_Base=True, ID_Continue=True, ID_Start=True, XID_Continue=True, XID_Start=True

Script=Hanunoo
IS CONTAINED IN: ID_Continue=True, XID_Continue=True

Script=Hebrew
IS CONTAINED IN: Alphabetic=True, Bidi_Class=Right_To_Left, General_Category=Other_Letter, Grapheme_Base=True, ID_Continue=True, ID_Start=True, Line_Break=Alphabetic, XID_Continue=True, XID_Start=True

Script=Hiragana
IS CONTAINED IN: Alphabetic=True, East_Asian_Width=Wide, Grapheme_Base=True, ID_Continue=True, ID_Start=True, XID_Continue=True, XID_Start=True

Script=Inherited
CONTAINS: General_Category=Enclosing_Mark

Script=Kannada
IS CONTAINED IN: ID_Continue=True, XID_Continue=True

Script=Katakana
IS CONTAINED IN: Alphabetic=True, Grapheme_Base=True, ID_Continue=True, ID_Start=True, XID_Continue=True, XID_Start=True

Script=Lao
IS CONTAINED IN: ID_Continue=True, XID_Continue=True

Script=Latin
IS CONTAINED IN: Alphabetic=True, Grapheme_Base=True, ID_Continue=True, ID_Start=True, XID_Continue=True, XID_Start=True

Script=Limbu
IS CONTAINED IN: ID_Continue=True, XID_Continue=True

Script=Linear_B
IS CONTAINED IN: Alphabetic=True, General_Category=Other_Letter, Grapheme_Base=True, ID_Continue=True, ID_Start=True, Line_Break=Alphabetic, XID_Continue=True, XID_Start=True

Script=Malayalam
IS CONTAINED IN: ID_Continue=True, XID_Continue=True

Script=Mongolian
IS CONTAINED IN: ID_Continue=True, XID_Continue=True

Script=Myanmar
IS CONTAINED IN: ID_Continue=True, XID_Continue=True

Script=Ogham
IS CONTAINED IN: Alphabetic=True, General_Category=Other_Letter, Grapheme_Base=True, ID_Continue=True, ID_Start=True, Line_Break=Alphabetic, XID_Continue=True, XID_Start=True

Script=Old_Italic
IS CONTAINED IN: Alphabetic=True, General_Category=Other_Letter, Grapheme_Base=True, ID_Continue=True, ID_Start=True, Line_Break=Alphabetic, XID_Continue=True, XID_Start=True

Script=Oriya
IS CONTAINED IN: ID_Continue=True, XID_Continue=True

Script=Osmanya
IS CONTAINED IN: Grapheme_Base=True, ID_Continue=True, XID_Continue=True

Script=Runic
IS CONTAINED IN: Alphabetic=True, Grapheme_Base=True, ID_Continue=True, ID_Start=True, Line_Break=Alphabetic, XID_Continue=True, XID_Start=True

Script=Shavian
IS CONTAINED IN: Alphabetic=True, General_Category=Other_Letter, Grapheme_Base=True, ID_Continue=True, ID_Start=True, Line_Break=Alphabetic, XID_Continue=True, XID_Start=True

Script=Sinhala
IS CONTAINED IN: ID_Continue=True, XID_Continue=True

Script=Syriac
IS CONTAINED IN: ID_Continue=True, XID_Continue=True

Script=Tagalog
IS CONTAINED IN: ID_Continue=True, XID_Continue=True

Script=Tagbanwa
IS CONTAINED IN: Alphabetic=True, ID_Continue=True, XID_Continue=True

Script=Tai_Le
IS CONTAINED IN: Alphabetic=True, General_Category=Other_Letter, Grapheme_Base=True, ID_Continue=True, ID_Start=True, Line_Break=Alphabetic, XID_Continue=True, XID_Start=True

Script=Telugu
IS CONTAINED IN: ID_Continue=True, XID_Continue=True

Script=Thaana
IS CONTAINED IN: Alphabetic=True, ID_Continue=True, XID_Continue=True

Script=Thai
IS CONTAINED IN: ID_Continue=True, XID_Continue=True

Script=Ugaritic
IS CONTAINED IN: Alphabetic=True, General_Category=Other_Letter, Grapheme_Base=True, ID_Continue=True, ID_Start=True, Line_Break=Alphabetic, XID_Continue=True, XID_Start=True

Script=Yi
IS CONTAINED IN: East_Asian_Width=Wide, Grapheme_Base=True, Line_Break=Ideographic

Soft_Dotted=True
IS CONTAINED IN: Alphabetic=True, General_Category=Lowercase_Letter, Grapheme_Base=True, ID_Continue=True, ID_Start=True, Line_Break=Alphabetic, Lowercase=True, XID_Continue=True, XID_Start=True

Terminal_Punctuation=True
IS CONTAINED IN: General_Category=Other_Punctuation, Grapheme_Base=True

Unified_Ideograph=True
IS CONTAINED IN: Alphabetic=True, East_Asian_Width=Wide, General_Category=Other_Letter, Grapheme_Base=True, ID_Continue=True, ID_Start=True, Ideographic=True, Line_Break=Ideographic, Script=Han, XID_Continue=True, XID_Start=True

Uppercase=True
CONTAINS: General_Category=Uppercase_Letter
IS CONTAINED IN: Grapheme_Base=True

White_Space=True
CONTAINS: Bidi_Class=White_Space, General_Category=Line_Separator, General_Category=Paragraph_Separator, Line_Break=Carriage_Return, Line_Break=Line_Feed, Line_Break=Mandatory_Break, Line_Break=Next_Line, Line_Break=Space

XID_Continue=True
CONTAINS: ASCII_Hex_Digit=True, Decomposition_Type=Final, Decomposition_Type=Initial, Decomposition_Type=Medial, Extender=True, General_Category=Connector_Punctuation, General_Category=Decimal_Number, General_Category=Letter_Number, General_Category=Lowercase_Letter, General_Category=Nonspacing_Mark, General_Category=Spacing_Mark, General_Category=Titlecase_Letter, General_Category=Uppercase_Letter, Grapheme_Link=True, Hangul_Syllable_Type=LVT_Syllable, Hangul_Syllable_Type=LV_Syllable, Hangul_Syllable_Type=Leading_Jamo, Hangul_Syllable_Type=Trailing_Jamo, Hangul_Syllable_Type=Vowel_Jamo, Hex_Digit=True, Ideographic=True, Joining_Type=Dual_Joining, Joining_Type=Right_Joining, Line_Break=Numeric, Logical_Order_Exception=True, Numeric_Type=Decimal, Script=Armenian, Script=Bengali, Script=Bopomofo, Script=Buhid, Script=Canadian_Aboriginal, Script=Cherokee, Script=Cypriot, Script=Cyrillic, Script=Deseret, Script=Devanagari, Script=Georgian, Script=Gothic, Script=Gujarati, Script=Gurmukhi, Script=Hangul, Script=Hanunoo, Script=Hebrew, Script=Hiragana, Script=Kannada, Script=Katakana, Script=Lao, Script=Latin, Script=Limbu, Script=Linear_B, Script=Malayalam, Script=Mongolian, Script=Myanmar, Script=Ogham, Script=Old_Italic, Script=Oriya, Script=Osmanya, Script=Runic, Script=Shavian, Script=Sinhala, Script=Syriac, Script=Tagalog, Script=Tagbanwa, Script=Tai_Le, Script=Telugu, Script=Thaana, Script=Thai, Script=Ugaritic, Soft_Dotted=True, Unified_Ideograph=True, XID_Start=True

XID_Start=True
CONTAINS: Decomposition_Type=Final, Decomposition_Type=Initial, Decomposition_Type=Medial, General_Category=Letter_Number, General_Category=Lowercase_Letter, General_Category=Titlecase_Letter, General_Category=Uppercase_Letter, Hangul_Syllable_Type=LVT_Syllable, Hangul_Syllable_Type=LV_Syllable, Hangul_Syllable_Type=Leading_Jamo, Hangul_Syllable_Type=Trailing_Jamo, Hangul_Syllable_Type=Vowel_Jamo, Ideographic=True, Joining_Type=Dual_Joining, Joining_Type=Right_Joining, Logical_Order_Exception=True, Script=Armenian, Script=Bopomofo, Script=Canadian_Aboriginal, Script=Cherokee, Script=Cypriot, Script=Deseret, Script=Georgian, Script=Gothic, Script=Hangul, Script=Hebrew, Script=Hiragana, Script=Katakana, Script=Latin, Script=Linear_B, Script=Ogham, Script=Old_Italic, Script=Runic, Script=Shavian, Script=Tai_Le, Script=Ugaritic, Soft_Dotted=True, Unified_Ideograph=True
IS CONTAINED IN: Grapheme_Base=True, ID_Continue=True, ID_Start=True, XID_Continue=True

NONE OF THE ABOVE: Deprecated=True, Joining_Type=Join_Causing, Joining_Type=Left_Joining, Line_Break=Break_After, Noncharacter_Code_Point=True, Script=Khmer, Script=Tamil, Script=Tibetan