Unicode Utilities: UnicodeSet

Warning: Testing version with properties from ICU (Unicode 9.0), Unicode 10.0β, and emoji 6.0β.

help | character | properties | confusables | unicode-set | compare-sets | regex | bnf-regex | breaks | transform | bidi | idna | languageid

Input
              

2,156 Code Points


[\u0000-\u0008\u000E-\u001F\u007F-\u0084\u0086-\u009F\u00AD\u061C\u180E\u200B\u200E\u200F\u202A-\u202E\u2060-\u2064\u2066-\u206F\uFEFF\uFFF9-\uFFFB\U0001BCA0-\U0001BCA3\U0001D173-\U0001D17A\U000E0001 \u0009 \u000B \u000C \u0085 \u2028 \u2029 \uD800-\uDFFF]


Unassigned, Private use, or Surrogates
items: 2,048

 � U+D800null
…{2046}…
 � U+DFFFnull

Basic LatinC0 controls
items: 30

 � U+0000null
 � U+0001null
 � U+0002null
 � U+0003null
 � U+0004null
 � U+0005null
 � U+0006null
 � U+0007null
 � U+0008null
   U+0009null
   U+000Bnull
   U+000Cnull
 � U+000Enull
 � U+000Fnull
 � U+0010null
 � U+0011null
 � U+0012null
 � U+0013null
 � U+0014null
 � U+0015null
 � U+0016null
 � U+0017null
 � U+0018null
 � U+0019null
 � U+001Anull
 � U+001Bnull
 � U+001Cnull
 � U+001Dnull
 � U+001Enull
 � U+001Fnull

Basic LatinControl character
items: 1

 � U+007Fnull

Latin 1 SupplementC1 controls
items: 32

 � U+0080null
 � U+0081null
 � U+0082null
 � U+0083null
 � U+0084null
 … U+0085null
 � U+0086null
 � U+0087null
 � U+0088null
 � U+0089null
 � U+008Anull
 � U+008Bnull
 � U+008Cnull
 � U+008Dnull
 � U+008Enull
 � U+008Fnull
 � U+0090null
 � U+0091null
 � U+0092null
 � U+0093null
 � U+0094null
 � U+0095null
 � U+0096null
 � U+0097null
 � U+0098null
 � U+0099null
 � U+009Anull
 � U+009Bnull
 � U+009Cnull
 � U+009Dnull
 � U+009Enull
 � U+009Fnull

Latin 1 SupplementLatin-1 punctuation and symbols
items: 1

  U+00ADSOFT HYPHEN

ArabicFormat character
items: 1

  U+061CARABIC LETTER MARK

MongolianFormat controls
items: 1

  U+180EMONGOLIAN VOWEL SEPARATOR

General PunctuationFormat character
items: 15

  U+200BZERO WIDTH SPACE
 ‎ U+200ELEFT-TO-RIGHT MARK
 ‎‏‎ U+200FRIGHT-TO-LEFT MARK
 
 U+2028LINE SEPARATOR
 
 U+2029PARAGRAPH SEPARATOR
  U+202ALEFT-TO-RIGHT EMBEDDING
  U+202BRIGHT-TO-LEFT EMBEDDING
  U+202CPOP DIRECTIONAL FORMATTING
  U+202DLEFT-TO-RIGHT OVERRIDE
  U+202ERIGHT-TO-LEFT OVERRIDE
  U+2060WORD JOINER
  U+2066LEFT-TO-RIGHT ISOLATE
  U+2067RIGHT-TO-LEFT ISOLATE
  U+2068FIRST STRONG ISOLATE
  U+2069POP DIRECTIONAL ISOLATE

General PunctuationInvisible operators
items: 4

  U+2061FUNCTION APPLICATION
  U+2062INVISIBLE TIMES
  U+2063INVISIBLE SEPARATOR
  U+2064INVISIBLE PLUS

General PunctuationDeprecated
items: 6

  U+206AINHIBIT SYMMETRIC SWAPPING
  U+206BACTIVATE SYMMETRIC SWAPPING
  U+206CINHIBIT ARABIC FORM SHAPING
  U+206DACTIVATE ARABIC FORM SHAPING
  U+206ENATIONAL DIGIT SHAPES
  U+206FNOMINAL DIGIT SHAPES

Arabic Presentation Forms BSpecial
items: 1

  U+FEFFZERO WIDTH NO-BREAK SPACE

SpecialsInterlinear annotation
items: 3

  U+FFF9INTERLINEAR ANNOTATION ANCHOR
  U+FFFAINTERLINEAR ANNOTATION SEPARATOR
  U+FFFBINTERLINEAR ANNOTATION TERMINATOR

Shorthand Format ControlsShorthand format controls
items: 4

  U+1BCA0SHORTHAND FORMAT LETTER OVERLAP
  U+1BCA1SHORTHAND FORMAT CONTINUING OVERLAP
  U+1BCA2SHORTHAND FORMAT DOWN STEP
  U+1BCA3SHORTHAND FORMAT UP STEP

Musical SymbolsBeams and slurs
items: 8

  U+1D173MUSICAL SYMBOL BEGIN BEAM
  U+1D174MUSICAL SYMBOL END BEAM
  U+1D175MUSICAL SYMBOL BEGIN TIE
  U+1D176MUSICAL SYMBOL END TIE
  U+1D177MUSICAL SYMBOL BEGIN SLUR
  U+1D178MUSICAL SYMBOL END SLUR
  U+1D179MUSICAL SYMBOL BEGIN PHRASE
  U+1D17AMUSICAL SYMBOL END PHRASE

TagsTag identifiers
items: 1

  U+E0001LANGUAGE TAG

Fonts and Display. If you don't have a good set of Unicode fonts (and modern browser), you may not be able to read some of the characters. Some suggested fonts that you can add for coverage are: Noto Fonts site, Unicode Fonts for Ancient Scripts, Large, multi-script Unicode fonts. See also: Unicode Display Problems.

Version 3.8; ICU version: 59.1.0.0; Unicode version: 9.0.0.0