Unicode Utilities: UnicodeSet

Warning: Testing version with both ICU and Unicode 10.0β properties!

help | character | properties | confusables | unicode-set | compare-sets | regex | bnf-regex | breaks | transform | bidi | idna | languageid

Input
              

5,925 Code Points


[\u0000-\u0008\u000E-\u001F\u007F-\u0084\u0086-\u009F\u00AD\u061C\u180E\u200B\u200E\u200F\u202A-\u202E\u2060-\u2064\u2066-\u206F\uFEFF\uFFF9-\uFFFB\U0001BCA0-\U0001BCA3\U0001D173-\U0001D17A\U000E0001 \u0009 \u000B \u000C \u0085 \u2028 \u2029 \u2065 \uD800-\uDFFF \uFFF0-\uFFF8 \U000E0000 \U000E0002-\U000E001F \U000E0080-\U000E00FF \U000E01F0-\U000E0FFF]


Unassigned, Private use, or Surrogates
items: 5,817

  U+2065<unassigned-2065>
 � U+D800<lead surrogate-D800>
…{2046}…
 � U+DFFF<trail surrogate-DFFF>
  U+FFF0<unassigned-FFF0>
…{7}…
  U+FFF8<unassigned-FFF8>
  U+E0000<unassigned-E0000>
  U+E0002<unassigned-E0002>
…{28}…
  U+E001F<unassigned-E001F>
  U+E0080<unassigned-E0080>
…{126}…
  U+E00FF<unassigned-E00FF>
  U+E01F0<unassigned-E01F0>
…{3598}…
  U+E0FFF<unassigned-E0FFF>

Basic LatinC0 controls
items: 30

 � U+0000<control-0000>
 � U+0001<control-0001>
 � U+0002<control-0002>
 � U+0003<control-0003>
 � U+0004<control-0004>
 � U+0005<control-0005>
 � U+0006<control-0006>
 � U+0007<control-0007>
 � U+0008<control-0008>
   U+0009<control-0009>
   U+000B<control-000B>
   U+000C<control-000C>
 � U+000E<control-000E>
 � U+000F<control-000F>
 � U+0010<control-0010>
 � U+0011<control-0011>
 � U+0012<control-0012>
 � U+0013<control-0013>
 � U+0014<control-0014>
 � U+0015<control-0015>
 � U+0016<control-0016>
 � U+0017<control-0017>
 � U+0018<control-0018>
 � U+0019<control-0019>
 � U+001A<control-001A>
 � U+001B<control-001B>
 � U+001C<control-001C>
 � U+001D<control-001D>
 � U+001E<control-001E>
 � U+001F<control-001F>

Basic LatinControl character
items: 1

 � U+007F<control-007F>

Latin 1 SupplementC1 controls
items: 32

 � U+0080<control-0080>
 � U+0081<control-0081>
 � U+0082<control-0082>
 � U+0083<control-0083>
 � U+0084<control-0084>
 … U+0085<control-0085>
 � U+0086<control-0086>
 � U+0087<control-0087>
 � U+0088<control-0088>
 � U+0089<control-0089>
 � U+008A<control-008A>
 � U+008B<control-008B>
 � U+008C<control-008C>
 � U+008D<control-008D>
 � U+008E<control-008E>
 � U+008F<control-008F>
 � U+0090<control-0090>
 � U+0091<control-0091>
 � U+0092<control-0092>
 � U+0093<control-0093>
 � U+0094<control-0094>
 � U+0095<control-0095>
 � U+0096<control-0096>
 � U+0097<control-0097>
 � U+0098<control-0098>
 � U+0099<control-0099>
 � U+009A<control-009A>
 � U+009B<control-009B>
 � U+009C<control-009C>
 � U+009D<control-009D>
 � U+009E<control-009E>
 � U+009F<control-009F>

Latin 1 SupplementLatin-1 punctuation and symbols
items: 1

  U+00ADSOFT HYPHEN

ArabicFormat character
items: 1

  U+061CARABIC LETTER MARK

MongolianFormat controls
items: 1

  U+180EMONGOLIAN VOWEL SEPARATOR

General PunctuationFormat character
items: 15

  U+200BZERO WIDTH SPACE
 ‎ U+200ELEFT-TO-RIGHT MARK
 ‎‏‎ U+200FRIGHT-TO-LEFT MARK
 
 U+2028LINE SEPARATOR
 
 U+2029PARAGRAPH SEPARATOR
  U+202ALEFT-TO-RIGHT EMBEDDING
  U+202BRIGHT-TO-LEFT EMBEDDING
  U+202CPOP DIRECTIONAL FORMATTING
  U+202DLEFT-TO-RIGHT OVERRIDE
  U+202ERIGHT-TO-LEFT OVERRIDE
  U+2060WORD JOINER
  U+2066LEFT-TO-RIGHT ISOLATE
  U+2067RIGHT-TO-LEFT ISOLATE
  U+2068FIRST STRONG ISOLATE
  U+2069POP DIRECTIONAL ISOLATE

General PunctuationInvisible operators
items: 4

  U+2061FUNCTION APPLICATION
  U+2062INVISIBLE TIMES
  U+2063INVISIBLE SEPARATOR
  U+2064INVISIBLE PLUS

General PunctuationDeprecated
items: 6

  U+206AINHIBIT SYMMETRIC SWAPPING
  U+206BACTIVATE SYMMETRIC SWAPPING
  U+206CINHIBIT ARABIC FORM SHAPING
  U+206DACTIVATE ARABIC FORM SHAPING
  U+206ENATIONAL DIGIT SHAPES
  U+206FNOMINAL DIGIT SHAPES

Arabic Presentation Forms BSpecial
items: 1

  U+FEFFZERO WIDTH NO-BREAK SPACE

SpecialsInterlinear annotation
items: 3

  U+FFF9INTERLINEAR ANNOTATION ANCHOR
  U+FFFAINTERLINEAR ANNOTATION SEPARATOR
  U+FFFBINTERLINEAR ANNOTATION TERMINATOR

Shorthand Format ControlsShorthand format controls
items: 4

  U+1BCA0SHORTHAND FORMAT LETTER OVERLAP
  U+1BCA1SHORTHAND FORMAT CONTINUING OVERLAP
  U+1BCA2SHORTHAND FORMAT DOWN STEP
  U+1BCA3SHORTHAND FORMAT UP STEP

Musical SymbolsBeams and slurs
items: 8

  U+1D173MUSICAL SYMBOL BEGIN BEAM
  U+1D174MUSICAL SYMBOL END BEAM
  U+1D175MUSICAL SYMBOL BEGIN TIE
  U+1D176MUSICAL SYMBOL END TIE
  U+1D177MUSICAL SYMBOL BEGIN SLUR
  U+1D178MUSICAL SYMBOL END SLUR
  U+1D179MUSICAL SYMBOL BEGIN PHRASE
  U+1D17AMUSICAL SYMBOL END PHRASE

TagsTag identifiers
items: 1

  U+E0001LANGUAGE TAG

Fonts and Display. If you don't have a good set of Unicode fonts (and modern browser), you may not be able to read some of the characters. Some suggested fonts that you can add for coverage are: Noto Fonts site, Unicode Fonts for Ancient Scripts, Large, multi-script Unicode fonts. See also: Unicode Display Problems.

Version 3.8; ICU version: 59.1.0.0; Unicode version: 9.0.0.0