Unicode Utilities: Character Properties

help | character | properties | confusables | unicode-set | compare-sets | regex | bnf-regex | breaks | transform | bidi | idna | languageid


 十 
5341
CJK UNIFIED IDEOGRAPH-5341
Han Script
id: allowed
confuse: ,
Properties for U+5341
With Non-Default ValuesWith Default Values
Age1.1
alnumYes
AlphabeticYes
Bidi_Paired_Bracketnull
BlockCJK_Unified_Ideographs
East_Asian_WidthWide
enc_Big5A4 51
enc_EUC-KRE4 A8
enc_GB2312CA AE
enc_GBKCA AE
enc_Shift_JIS8F 5C
General_CategoryOther_Letter
graphYes
Grapheme_BaseYes
HanTypeHan
ID_ContinueYes
ID_StartYes
identifier-restrictionrecommended
IdeographicYes
idna2003valid
idna2008PVALID
idna2008cvalid
is_enc_Big5Yes
is_enc_EUC-KRYes
is_enc_GB2312Yes
is_enc_GBKYes
is_enc_Shift_JISYes
isCasedYes
isCasefoldedYes
isLowercaseYes
ISO_Commentnull
isTitlecaseYes
isUppercaseYes
Line_BreakIdeographic
Numeric_TypeNumeric
Numeric_Value10.0
printYes
ScriptHan
Script_ExtensionsHan
Sentence_BreakOLetter
subheadnull
Subheadernull
toIdna2003null
toUts46nnull
toUts46tnull
Unicode_1_Namenull
Unified_IdeographYes
Usagecommon
uts46valid
XID_ContinueYes
XID_StartYes
ANYYes
ASCIINo
ASCII_Hex_DigitNo
Bidi_ClassLeft_To_Right
Bidi_ControlNo
Bidi_MirroredNo
Bidi_Mirroring_Glyph
Bidi_Paired_Bracket_TypeNone
blankNo
bmpYes
Canonical_Combining_ClassNot_Reordered
Case_Folding
Case_IgnorableNo
Case_SensitiveNo
CasedNo
Changes_When_CasefoldedNo
Changes_When_CasemappedNo
Changes_When_LowercasedNo
Changes_When_NFKC_CasefoldedNo
Changes_When_TitlecasedNo
Changes_When_UppercasedNo
DashNo
Decomposition_TypeNone
Default_Ignorable_Code_PointNo
DeprecatedNo
DiacriticNo
emojiNo
enc_ISO-8859-1
enc_ISO-8859-2
enc_ISO-8859-3
enc_ISO-8859-4
enc_ISO-8859-5
enc_ISO-8859-6
enc_ISO-8859-7
enc_ISO-8859-8
enc_ISO-8859-9
enc_ISO-8859-13
enc_ISO-8859-15
ExtenderNo
Full_Composition_ExclusionNo
Grapheme_Cluster_BreakOther
Grapheme_ExtendNo
Grapheme_LinkNo
Hangul_Syllable_TypeNot_Applicable
Hex_DigitNo
HyphenNo
IDS_Binary_OperatorNo
IDS_Trinary_OperatorNo
is_enc_ISO-8859-1No
is_enc_ISO-8859-2No
is_enc_ISO-8859-3No
is_enc_ISO-8859-4No
is_enc_ISO-8859-5No
is_enc_ISO-8859-6No
is_enc_ISO-8859-7No
is_enc_ISO-8859-8No
is_enc_ISO-8859-9No
is_enc_ISO-8859-13No
is_enc_ISO-8859-15No
isNFCYes
isNFDYes
isNFKCYes
isNFKDYes
isNFMYes
Join_ControlNo
Joining_GroupNo_Joining_Group
Joining_TypeNon_Joining
Lead_Canonical_Combining_ClassNot_Reordered
Logical_Order_ExceptionNo
LowercaseNo
Lowercase_Mapping
MathNo
NFC_InertYes
NFC_Quick_CheckYes
NFD_InertYes
NFD_Quick_CheckYes
NFKC_Casefold
NFKC_InertYes
NFKC_Quick_CheckYes
NFKD_InertYes
NFKD_Quick_CheckYes
Noncharacter_Code_PointNo
Pattern_SyntaxNo
Pattern_White_SpaceNo
Quotation_MarkNo
RadicalNo
Segment_StarterYes
Simple_Case_Folding
Simple_Lowercase_Mapping
Simple_Titlecase_Mapping
Simple_Uppercase_Mapping
Soft_DottedNo
STermNo
Terminal_PunctuationNo
Titlecase_Mapping
toCasefold
toLowercase
toLowerCase
toNfc
toNFC
toNfd
toNFD
toNfkc
toNFKC
toNfkd
toNFKD
toNFM
toTitlecase
toTitleCase
toUppercase
toUpperCase
Trail_Canonical_Combining_ClassNot_Reordered
ucanull
uca2null
uca2.5null
uca3null
UppercaseNo
Uppercase_Mapping
Variation_SelectorNo
White_SpaceNo
Word_BreakOther
xdigitNo

The list includes both Unicode Character Properties and some additions (like idna2003 or subhead)


Fonts and Display. If you don't have a good set of Unicode fonts (and modern browser), you may not be able to read some of the characters. Some suggested fonts that you can add for coverage are: Unicode Fonts for Ancient Scripts, Noto Fonts site, Large, multi-script Unicode fonts. See also: Unicode Display Problems.

Version 3.7; ICU version: 54.0.1.0; Unicode version: 7.0.0.0