LGR for unspecified language Selected-recommended-IdentifierType-not-in-MSR-and-RefLGR

This document is mechanically formatted from the above XML file for the LGR. It provides additional summary data and explanatory text. The XML file remains the sole normative specification of the LGR.

Date 2025-01-09
LGR Version 16.0.0
Unicode Version 16.0.0

Description

L2/25-033
Partially updates
L2/19-329R

Characters excluded from both MSR and Reference LGR but allowed in UTS#39

This document has been submitted as a UTC document. For convenience in documenting the character list it is presented using an LGR template format. A few minor details of the boilerplate in that template may not be applicable in this context and should be disregarded.

The collection comprises 696 characters are not part of the Reference LGR [RefLGR] and also excluded from the MSR [MSR], as well as the uppercase equivalents for 128 of them (Latin, Cyrillic and Greek), for a total of 824 characters. These characters are listed as Recommended or Inclusion in UTS#39.

Recommendation

These 824 characters should not be considered Recommended, based on an independent analysis [MSR] that has found indications that they should have been considered Uncommon_Use, Obsolete or Technical based on information available at the time of encoding, such as character proposal documents, correlated with available information on the use of the writing systems these were intended for. Where they are already assigned Identifier_Type Inclusion, no change is recommended.

More specifically the Identifier_Types should be updated, based on the tags provided:

  • Technical for "technical", "poetic", "religious_use", "modifier" or "phonetic"
  • Obsolete for "obsolete" or "historic" and "(Greek)Uppercase" (where lowercase is "obsolete" or "historic")
  • Uncommon_Use "uncommon_use", "limited_use" and "Uppercase" (where lowercase is "uncommon_use")

A few characters have been given no classification in the tags. These are for the most part not in IDNA2008 or they represent punctuation with Identifier_Type Inclusion, which seems fine. Some characters are classified as "context*" which makes them ineligible for the [MSR] and [RZ-LGR], but not the [RefLGR].

The best matching Identifier_Type for characters tagged only as "punctuation" and "symbol" is unclear; there's a strong bias against intra-word punctuation in the Reference LGRs and a complete prohibition in the DNS Root Zone ("letter principle"). Arguably most intra-word punctuation not generic enough for use in IDNs might perhaps best be considered for optional "Inclusion" instead of "Recommended", unless there's an indication that the use is uncommon or obsolete.

The latest published result of the [MSR] project has a cutoff of Unicode, Version 11.0. The current document includes characters added as Recommended between versions 12.0 and 16.0 for which analysis is preliminary. These should be treated as "tentatively analyzed" and have been tagged accordingly. Unless their need for identifiers, or use in a reasonably widely used general orthography for any language can be established affirmatively from other sources, they should NOT retain Identifier_Type Recommended. Instead, they should be considered Uncommon_Use, unless a they can be identified as Obsolete or Technical. There are no indications that any of these would be included in a future update of the [MSR], and inclusion in either [RZ-LGR] or [RefLGR] in the near term would seem even less likely.

Background

There are about a thousand non-Han characters with Identifier_Type Recommended that are proposed for consideration of reclassification because they appear to fail reasonable criteria for being needed in identifiers. They come in two sets. For the set documented here, an independent analysis [MSR] has found indications that they should have been considered Uncommon_Use, Obsolete or Technical based on information available at the time of encoding. The second set contains characters that were tentatively retained as Recommended in the [MSR] but upon further review by local expert teams from the [RZ-LGR] project were found to not be needed after all for any language or minority language in reasonably widespread use. That set is discussed in another document.

The analysis on which the recommendations in this document is based was originally carried out for the purposes of defining the repertoire for IDN Top Level Domain names for the DNS Root Zone. There are some restrictions that are specific to the Root Zone, such as a prohibition on digits, so a follow-on effort determined how to relax these restrictions in a manner appropriate for the needs of Second-Level Domains. This resulted in the Second-Level Reference Label Generation Rules [RefLGR]. The characters listed here are those that were not included in either the [MSR] or the [RefLGR].

Because the original analysis [MSR] found specific indications why the characters in this document should not be recommended, they have been broken out separately and are listed in this document with the rationale for that exclusion provided. Notably, the review for purposes of the [RefLGR] did find that a few characters tentatively excluded in the [MSR] were indeed appropriate for domain names on the Second Level, while most were not.

The implication here is that any character not included in this set should be classified with some value other than Recommended for Unicode's default identifiers—until such time as independent evidence to the contrary is produced.

Arriving at a precise cutoff for the various Identifier_Type categories is difficult because there is no single source or perfect information on the use of writing systems and the de-facto general purpose subset thereof. In addition, the details of such use may change over time. Accordingly, this document suggests that the UTC should consider the published results of the cited research as one of the better sources of information available and only deviate from it on the basis of even better information.

All decisions for the classification of characters in [MSR], or inclusion in [RZ-LGR] and [RefLGR] are documented and sourced on the character level; the same is not true for Unicode's classification, so it is not easily possible to verify any of the decisions that underlie the classification published in UTS39. By first making the alignment proposed here, and then carefully documenting deviations, a positive side effect might be that the classification overall becomes more transparent and reviewable.

Special Considerations

The [MSR] and [RefLGR] exclude the Bopomofo script, considering the entire script special use as it tends to be used almost exclusively in education. This could be addressed by either changing the status of the script in UAX31 to Limited_Use or to mark the entire set of Bopomofo characters as Technical.

The domain name registry community appears to have settled on regarding the Greek polytoniko orthography as obsolete for Greek. This should be reflected in Unicode's default identifiers.

Defaults for characters newly assigned for Unicode 17.0.0 and after. Newly encoded characters for existing scripts should not be given Identifier_Type Recommended by default, as most often, scripts are supplemented with additional Technical or Obsolete characters. For modern characters, unless they are intended for support of a particularly vigorous language with well established orthography in widespread use, they should be considered Uncommon_Use by default. The same should be retroactively applied to the few characters added to the recommended scripts since the cutoff for the [MSR] work (as documented in the latest MSR version).

High risk characters. Certain characters are risky for security relevant identifiers, such as IDNs. These include characters that look like punctuation marks. The usual mitigation mechanism for visually equivalent characters apply to situations where both partners are in the repertoire. By definition, that is not the case for these, as the confusability is with delimiting punctuation which whould falsely suggest part of the string is outside the label. They should therefore not be recommended for default identifiers.

No definite recommendations can be made for the Tibetan script at this time. It is considered by ICANN as eligible for the Root Zone in principle, but work on defining the label generation rules has faced some difficulties and has not commenced. Nevertheless, Tibetan was covered in the MSR analysis, and for some characters a suggested adjustment of Identifier_Type can be provided. Beyond that, it might be reasonable to reflect the preliminary nature of the understanding of the likely use of Tibetan in identifiers by also not giving any of the Tibetan characters an Identifier_Type Recommended until some body, project, or group has created a definite analysis of this script for identifier purposes. (Only Tibetan characters with proposed exclusion from the [MSR] have been included in this document).

Root Zone and Reference LGRs

For further background on the DNS Root Zone and Second-Level Reference LGR see the cited references and links therein.

Additional Notes

All characters have tags matching indicating a recommended classification based on available evidence. This classification was used by experts and reviewers in making an initial cutoff for the MSR and has been slightly amended here.

The classification of characters excluded from the MSR is a bit more fine-grained than the current values for Identifier Type, see the recommended mappings above.

Uppercase equivalents are tagged as Uppercase or GreekUppercase only, their classification is understood to match that of their lowercase equivalent. As uppercase characters are used in IDNA2008, they are not independently evaluated here. Any other characters not in IDNA2008 are not classified, but indicated in the comments.

For other characters, comments give further information, such as the name of a language that can be considered limited in use, a Unicode version or a UTC document reference. Code points slated for encoding in 17.0 may not display with other than code point value in the table as the tool formatting the template cannot retrieve the character name or script for unreleased versions.

Discussion and Review

Domain names are an important, and deliberately conservative set of identifiers. That said, there may be other classes of identifiers that don't require the same level of restrictions, so this proposal should not be understood to suggest that default Identifiers must be restricted to only those characters that are being recommended for IDNs. Rather, the purpose is to bring the facts discovered during the development of the IDN repertoire for the DNS Root Zone and the [RefLGR] to the attention of the Unicode Technical Committee, so that characters that were classified Recommended can be given additional scrutiny before confirming their status.

As review progresses, a number of characters have been identified that may well have documented use:

  • U+063D ؽ ARABIC LETTER FARSI YEH WITH INVERTED V - this letter is claimed to be in use with Azerbaijani, and should not be considered "obsolete, historic".
  • U+093D DEVANAGARI SING AVAGRAHA - this sign is claimed to be in use in modern Hindi for writing foreign sounds (cool: कूऽल) or stretched native ones (माँऽऽऽ!). This may be a matter of which criteria are to be used, UTC should make an affirmative decision.

The repertoires needed for Uyghur, Kazakh, and Kyrgyz, all written in Arabic in China with their own published standards are claimed to be missing in the cited review. However, they are mentioned in “Proposal for Arabic Script Root Zone LGR”, [Proposal-Arabic]. It might be necessary to make a more detailed review based on the full claimed repertoire for these.

Contributors

This excerpt was prepared by Asmus Freytag, based on published data found in [RefLGR] and reference information from [MSR]. For details on the process and contributors to those projects, see [RefLGR-Overview], in particular, Section 1, “Overview” and Section 6, “Contributors”. Mark Davis, Michel Suignard and Roozbeh Pournader have contributed feedback.

Repertoire

Repertoire Summary

Number of elements in repertoire 824
Longest code point sequence 1

Repertoire by Code Point

The following table lists the repertoire by code point (or code point sequence). The data in the Script and Name column are extracted from the Unicode character database. Where a comment in the original LGR is equal to the character name, it has been suppressed.

See also the legend provided below the table.

Code
Point
Glyph Script Name Ref Tags Comment
U+0027 ' Common APOSTROPHE   Inclusion Not IDN2008
U+002E . Common FULL STOP   Inclusion Not IDN2008
U+003A : Common COLON   Inclusion Not IDN2008
U+005F _ Common LOW LINE     Not IDN2008
U+0138 ĸ Latin LATIN SMALL LETTER KRA [100] obsolete Greenlandic
U+0200 Ȁ Latin LATIN CAPITAL LETTER A WITH DOUBLE GRAVE   Uppercase Not IDNA2008
U+0201 ȁ Latin LATIN SMALL LETTER A WITH DOUBLE GRAVE [100] poetic, technical tone in poetry
U+0202 Ȃ Latin LATIN CAPITAL LETTER A WITH INVERTED BREVE   Uppercase Not IDNA2008
U+0203 ȃ Latin LATIN SMALL LETTER A WITH INVERTED BREVE [100] poetic, technical tone in poetry
U+0204 Ȅ Latin LATIN CAPITAL LETTER E WITH DOUBLE GRAVE   Uppercase Not IDNA2008
U+0205 ȅ Latin LATIN SMALL LETTER E WITH DOUBLE GRAVE [100] poetic, technical tone in poetry
U+0206 Ȇ Latin LATIN CAPITAL LETTER E WITH INVERTED BREVE   Uppercase Not IDNA2008
U+0207 ȇ Latin LATIN SMALL LETTER E WITH INVERTED BREVE [100] poetic, technical tone in poetry
U+0208 Ȉ Latin LATIN CAPITAL LETTER I WITH DOUBLE GRAVE   Uppercase Not IDNA2008
U+0209 ȉ Latin LATIN SMALL LETTER I WITH DOUBLE GRAVE [100] poetic, technical tone in poetry
U+020A Ȋ Latin LATIN CAPITAL LETTER I WITH INVERTED BREVE   Uppercase Not IDNA2008
U+020B ȋ Latin LATIN SMALL LETTER I WITH INVERTED BREVE [100] poetic, technical tone in poetry
U+020C Ȍ Latin LATIN CAPITAL LETTER O WITH DOUBLE GRAVE   Uppercase Not IDNA2008
U+020D ȍ Latin LATIN SMALL LETTER O WITH DOUBLE GRAVE [100] poetic, technical tone in poetry
U+020E Ȏ Latin LATIN CAPITAL LETTER O WITH INVERTED BREVE   Uppercase Not IDNA2008
U+020F ȏ Latin LATIN SMALL LETTER O WITH INVERTED BREVE [100] poetic, technical tone in poetry
U+0210 Ȑ Latin LATIN CAPITAL LETTER R WITH DOUBLE GRAVE   Uppercase Not IDNA2008
U+0211 ȑ Latin LATIN SMALL LETTER R WITH DOUBLE GRAVE [100] poetic, technical tone in poetry
U+0212 Ȓ Latin LATIN CAPITAL LETTER R WITH INVERTED BREVE   Uppercase Not IDNA2008
U+0213 ȓ Latin LATIN SMALL LETTER R WITH INVERTED BREVE [100] poetic, technical tone in poetry
U+0214 Ȕ Latin LATIN CAPITAL LETTER U WITH DOUBLE GRAVE   Uppercase Not IDNA2008
U+0215 ȕ Latin LATIN SMALL LETTER U WITH DOUBLE GRAVE [100] poetic, technical tone in poetry
U+0216 Ȗ Latin LATIN CAPITAL LETTER U WITH INVERTED BREVE   Uppercase Not IDNA2008
U+0217 ȗ Latin LATIN SMALL LETTER U WITH INVERTED BREVE [100] poetic, technical tone in poetry
U+02BB ʻ Common MODIFIER LETTER TURNED COMMA [100] punctuation excluded from domain names because of similarity to punctuation
U+02BC ʼ Common MODIFIER LETTER APOSTROPHE [100] technical excluded from domain names because of similarity to punctuation
U+02EC ˬ Common MODIFIER LETTER VOICING [100] technical  
U+030F  ̏ Inherited COMBINING DOUBLE GRAVE ACCENT [100] technical Serbian and Croatian poetics
U+0310  ̐ Inherited COMBINING CANDRABINDU [100] technical  
U+0311  ̑ Inherited COMBINING INVERTED BREVE [100] technical Serbian and Croatian poetics
U+0313  ̓ Inherited COMBINING COMMA ABOVE [100] technical  
U+0314  ̔ Inherited COMBINING REVERSED COMMA ABOVE [100] technical  
U+0324  ̤ Inherited COMBINING DIAERESIS BELOW [100] technical  
U+0325  ̥ Inherited COMBINING RING BELOW [100] technical  
U+032D  ̭ Inherited COMBINING CIRCUMFLEX ACCENT BELOW [100] technical  
U+032E  ̮ Inherited COMBINING BREVE BELOW [100] technical  
U+0330  ̰ Inherited COMBINING TILDE BELOW [100] technical  
U+0335  ̵ Inherited COMBINING SHORT STROKE OVERLAY [100] technical  
U+0338  ̸ Inherited COMBINING LONG SOLIDUS OVERLAY [100] technical  
U+0339  ̹ Inherited COMBINING RIGHT HALF RING BELOW [100] technical  
U+0342  ͂ Inherited COMBINING GREEK PERISPOMENI [100] technical  
U+0345  ͅ Inherited COMBINING GREEK YPOGEGRAMMENI     NOT IDNA2008
U+0375 ͵ Greek GREEK LOWER NUMERAL SIGN [100] context-other, Inclusion, obsolete IDNA2008 CONTEXTO
U+037B ͻ Greek GREEK SMALL REVERSED LUNATE SIGMA SYMBOL [100] obsolete  
U+037C ͼ Greek GREEK SMALL DOTTED LUNATE SIGMA SYMBOL [100] obsolete  
U+037D ͽ Greek GREEK SMALL REVERSED DOTTED LUNATE SIGMA SYMBOL [100] obsolete  
U+03FC ϼ Greek GREEK RHO WITH STROKE SYMBOL [100] obsolete, symbol  
U+03FD Ͻ Greek GREEK CAPITAL REVERSED LUNATE SIGMA SYMBOL   GreekUppercase Not IDNA2008
U+03FE Ͼ Greek GREEK CAPITAL DOTTED LUNATE SIGMA SYMBOL   GreekUppercase Not IDNA2008
U+03FF Ͽ Greek GREEK CAPITAL REVERSED DOTTED LUNATE SIGMA SYMBOL   GreekUppercase Not IDNA2008
U+048A Ҋ Cyrillic CYRILLIC CAPITAL LETTER SHORT I WITH TAIL   Uppercase Not IDNA2008
U+048B ҋ Cyrillic CYRILLIC SMALL LETTER SHORT I WITH TAIL [100] uncommon_use nearly extinct
U+048C Ҍ Cyrillic CYRILLIC CAPITAL LETTER SEMISOFT SIGN   Uppercase Not IDNA2008
U+048D ҍ Cyrillic CYRILLIC SMALL LETTER SEMISOFT SIGN [100] uncommon_use nearly extinct
U+048E Ҏ Cyrillic CYRILLIC CAPITAL LETTER ER WITH TICK   Uppercase Not IDNA2008
U+048F ҏ Cyrillic CYRILLIC SMALL LETTER ER WITH TICK [100] uncommon_use nearly extinct
U+049C Ҝ Cyrillic CYRILLIC CAPITAL LETTER KA WITH VERTICAL STROKE   Uppercase Not IDNA2008
U+049D ҝ Cyrillic CYRILLIC SMALL LETTER KA WITH VERTICAL STROKE [100] obsolete Azerbaijani
U+04A6 Ҧ Cyrillic CYRILLIC CAPITAL LETTER PE WITH MIDDLE HOOK   Uppercase Not IDNA2008
U+04A7 ҧ Cyrillic CYRILLIC SMALL LETTER PE WITH MIDDLE HOOK [100] obsolete Abkhazian
U+04B8 Ҹ Cyrillic CYRILLIC CAPITAL LETTER CHE WITH VERTICAL STROKE   Uppercase Not IDNA2008
U+04B9 ҹ Cyrillic CYRILLIC SMALL LETTER CHE WITH VERTICAL STROKE [100] obsolete Azerbaijani
U+04C3 Ӄ Cyrillic CYRILLIC CAPITAL LETTER KA WITH HOOK   Uppercase Not IDNA2008
U+04C4 ӄ Cyrillic CYRILLIC SMALL LETTER KA WITH HOOK [100] uncommon_use threatened
U+04C5 Ӆ Cyrillic CYRILLIC CAPITAL LETTER EL WITH TAIL   Uppercase Not IDNA2008
U+04C6 ӆ Cyrillic CYRILLIC SMALL LETTER EL WITH TAIL [100] uncommon_use nearly extinct
U+04C7 Ӈ Cyrillic CYRILLIC CAPITAL LETTER EN WITH HOOK   Uppercase Not IDNA2008
U+04C8 ӈ Cyrillic CYRILLIC SMALL LETTER EN WITH HOOK [100] uncommon_use threatened
U+04C9 Ӊ Cyrillic CYRILLIC CAPITAL LETTER EN WITH TAIL   Uppercase Not IDNA2008
U+04CA ӊ Cyrillic CYRILLIC SMALL LETTER EN WITH TAIL [100] uncommon_use nearly extinct
U+04CD Ӎ Cyrillic CYRILLIC CAPITAL LETTER EM WITH TAIL   Uppercase Not IDNA2008
U+04CE ӎ Cyrillic CYRILLIC SMALL LETTER EM WITH TAIL [100] uncommon_use nearly extinct
U+04F6 Ӷ Cyrillic CYRILLIC CAPITAL LETTER GHE WITH DESCENDER   Uppercase Not IDNA2008
U+04F7 ӷ Cyrillic CYRILLIC SMALL LETTER GHE WITH DESCENDER [100] uncommon_use educational
U+04FA Ӻ Cyrillic CYRILLIC CAPITAL LETTER GHE WITH STROKE AND HOOK   Uppercase Not IDNA2008
U+04FB ӻ Cyrillic CYRILLIC SMALL LETTER GHE WITH STROKE AND HOOK [100] uncommon_use nearly extinct
U+04FC Ӽ Cyrillic CYRILLIC CAPITAL LETTER HA WITH HOOK   Uppercase Not IDNA2008
U+04FD ӽ Cyrillic CYRILLIC SMALL LETTER HA WITH HOOK [100] uncommon_use nearly extinct
U+04FE Ӿ Cyrillic CYRILLIC CAPITAL LETTER HA WITH STROKE   Uppercase Not IDNA2008
U+04FF ӿ Cyrillic CYRILLIC SMALL LETTER HA WITH STROKE [100] uncommon_use nearly extinct
U+0510 Ԑ Cyrillic CYRILLIC CAPITAL LETTER REVERSED ZE   Uppercase Not IDNA2008
U+0511 ԑ Cyrillic CYRILLIC SMALL LETTER REVERSED ZE [100] uncommon_use threatened
U+0512 Ԓ Cyrillic CYRILLIC CAPITAL LETTER EL WITH HOOK   Uppercase Not IDNA2008
U+0513 ԓ Cyrillic CYRILLIC SMALL LETTER EL WITH HOOK [100] uncommon_use threatened
U+0514 Ԕ Cyrillic CYRILLIC CAPITAL LETTER LHA   Uppercase Not IDNA2008
U+0515 ԕ Cyrillic CYRILLIC SMALL LETTER LHA [100] obsolete Mordvin
U+0516 Ԗ Cyrillic CYRILLIC CAPITAL LETTER RHA   Uppercase Not IDNA2008
U+0517 ԗ Cyrillic CYRILLIC SMALL LETTER RHA [100] obsolete Mordvin
U+0518 Ԙ Cyrillic CYRILLIC CAPITAL LETTER YAE   Uppercase Not IDNA2008
U+0519 ԙ Cyrillic CYRILLIC SMALL LETTER YAE [100] obsolete Mordvin
U+051A Ԛ Cyrillic CYRILLIC CAPITAL LETTER QA   Uppercase Not IDNA2008
U+051B ԛ Cyrillic CYRILLIC SMALL LETTER QA [100] obsolete Abkhaz, Kurdish
U+051C Ԝ Cyrillic CYRILLIC CAPITAL LETTER WE   Uppercase Not IDNA2008
U+051D ԝ Cyrillic CYRILLIC SMALL LETTER WE [100] obsolete Abkhaz, Kurdish
U+051E Ԟ Cyrillic CYRILLIC CAPITAL LETTER ALEUT KA   Uppercase Not IDNA2008
U+051F ԟ Cyrillic CYRILLIC SMALL LETTER ALEUT KA [100] obsolete Aleut
U+0520 Ԡ Cyrillic CYRILLIC CAPITAL LETTER EL WITH MIDDLE HOOK   Uppercase Not IDNA2008
U+0521 ԡ Cyrillic CYRILLIC SMALL LETTER EL WITH MIDDLE HOOK [100] obsolete Abkhaz, Chuvash
U+0522 Ԣ Cyrillic CYRILLIC CAPITAL LETTER EN WITH MIDDLE HOOK   Uppercase Not IDNA2008
U+0523 ԣ Cyrillic CYRILLIC SMALL LETTER EN WITH MIDDLE HOOK [100] obsolete Chuvash
U+0526 Ԧ Cyrillic CYRILLIC CAPITAL LETTER SHHA WITH DESCENDER   Uppercase Not IDNA2008
U+0527 ԧ Cyrillic CYRILLIC SMALL LETTER SHHA WITH DESCENDER [100] obsolete Azerbaijani
U+0528 Ԩ Cyrillic CYRILLIC CAPITAL LETTER EN WITH LEFT HOOK   Uppercase Not IDNA2008
U+0529 ԩ Cyrillic CYRILLIC SMALL LETTER EN WITH LEFT HOOK [100] obsolete (Orok) (7.0)
U+052E Ԯ Cyrillic CYRILLIC CAPITAL LETTER EL WITH DESCENDER   Uppercase Not IDNA2008
U+052F ԯ Cyrillic CYRILLIC SMALL LETTER EL WITH DESCENDER [100] obsolete (Khanty, Nenets) (7.0)
U+0559 ՙ Armenian ARMENIAN MODIFIER LETTER LEFT HALF RING [100] technical  
U+058A ֊ Armenian ARMENIAN HYPHEN   Inclusion NOT IDNA2008
U+05CD ׍ Zzzz   tentatively-excluded Not yet analyzed for MSR
U+05EF ׯ Hebrew HEBREW YOD TRIANGLE [100] uncommon_use 11.0 16/305
U+05F3 ׳ Hebrew HEBREW PUNCTUATION GERESH [100] context-other, Inclusion IDNA2008 CONTEXTO
U+05F4 ״ Hebrew HEBREW PUNCTUATION GERSHAYIM [100] context-other, Inclusion IDNA2008 CONTEXTO
U+063B ػ Arabic ARABIC LETTER KEHEH WITH TWO DOTS ABOVE [100] obsolete historic
U+063C ؼ Arabic ARABIC LETTER KEHEH WITH THREE DOTS BELOW [100] obsolete historic
U+063D ؽ Arabic ARABIC LETTER FARSI YEH WITH INVERTED V [100] obsolete historic
U+063E ؾ Arabic ARABIC LETTER FARSI YEH WITH TWO DOTS ABOVE [100] obsolete historic
U+063F ؿ Arabic ARABIC LETTER FARSI YEH WITH THREE DOTS ABOVE [100] obsolete historic
U+0653  ٓ Inherited ARABIC MADDAH ABOVE [100] religious_use  
U+06AC ڬ Arabic ARABIC LETTER KAF WITH DOT ABOVE [100] obsolete (old Malay-Jawi)*use 0762 instead
U+06E5 ۥ Arabic ARABIC SMALL WAW [100] technical religious annotation
U+06E6 ۦ Arabic ARABIC SMALL YEH [100] technical religious annotation
U+06FD ۽ Arabic ARABIC SIGN SINDHI AMPERSAND [100] Inclusion, punctuation  
U+06FE ۾ Arabic ARABIC SIGN SINDHI POSTPOSITION MEN [100] Inclusion, punctuation  
U+077E ݾ Arabic ARABIC LETTER SEEN WITH INVERTED V [100] obsolete early Persian
U+077F ݿ Arabic ARABIC LETTER KAF WITH TWO DOTS ABOVE [100] obsolete early Persian
U+0870 Arabic ARABIC LETTER ALEF WITH ATTACHED FATHA   tentatively-religious  
U+0871 Arabic ARABIC LETTER ALEF WITH ATTACHED TOP RIGHT FATHA   tentatively-religious  
U+0872 Arabic ARABIC LETTER ALEF WITH RIGHT MIDDLE STROKE   tentatively-religious  
U+0873 Arabic ARABIC LETTER ALEF WITH LEFT MIDDLE STROKE   tentatively-religious  
U+0874 Arabic ARABIC LETTER ALEF WITH ATTACHED KASRA   tentatively-religious  
U+0875 Arabic ARABIC LETTER ALEF WITH ATTACHED BOTTOM RIGHT KASRA   tentatively-religious  
U+0876 Arabic ARABIC LETTER ALEF WITH ATTACHED ROUND DOT ABOVE   tentatively-religious  
U+0877 Arabic ARABIC LETTER ALEF WITH ATTACHED RIGHT ROUND DOT   tentatively-religious  
U+0878 Arabic ARABIC LETTER ALEF WITH ATTACHED LEFT ROUND DOT   tentatively-religious  
U+0879 Arabic ARABIC LETTER ALEF WITH ATTACHED ROUND DOT BELOW   tentatively-religious  
U+087A Arabic ARABIC LETTER ALEF WITH DOT ABOVE   tentatively-religious  
U+087B Arabic ARABIC LETTER ALEF WITH ATTACHED TOP RIGHT FATHA AND DOT ABOVE   tentatively-religious  
U+087C Arabic ARABIC LETTER ALEF WITH RIGHT MIDDLE STROKE AND DOT ABOVE   tentatively-religious  
U+087D Arabic ARABIC LETTER ALEF WITH ATTACHED BOTTOM RIGHT KASRA AND DOT ABOVE   tentatively-religious  
U+087E Arabic ARABIC LETTER ALEF WITH ATTACHED TOP RIGHT FATHA AND LEFT RING   tentatively-religious  
U+087F Arabic ARABIC LETTER ALEF WITH RIGHT MIDDLE STROKE AND LEFT RING   tentatively-religious  
U+0880 Arabic ARABIC LETTER ALEF WITH ATTACHED BOTTOM RIGHT KASRA AND LEFT RING   tentatively-religious  
U+0881 Arabic ARABIC LETTER ALEF WITH ATTACHED RIGHT HAMZA   tentatively-religious  
U+0882 Arabic ARABIC LETTER ALEF WITH ATTACHED LEFT HAMZA   tentatively-religious  
U+0883 Arabic ARABIC TATWEEL WITH OVERSTRUCK HAMZA   tentatively-religious  
U+0884 Arabic ARABIC TATWEEL WITH OVERSTRUCK WAW   tentatively-religious  
U+0885 Arabic ARABIC TATWEEL WITH TWO DOTS BELOW   tentatively-religious  
U+0886 Arabic ARABIC LETTER THIN YEH   tentatively-religious  
U+0887 Arabic ARABIC BASELINE ROUND DOT   tentatively-religious  
U+0888 Arabic ARABIC RAISED ROUND DOT   tentatively-religious  
U+0889 Arabic ARABIC LETTER NOON WITH INVERTED SMALL V   tentatively-limited_use  
U+088A Arabic ARABIC LETTER HAH WITH INVERTED SMALL V BELOW   tentatively-limited_use Bosnian
U+088B Arabic ARABIC LETTER TAH WITH DOT BELOW   tentatively-limited_use Pegon
U+088C Arabic ARABIC LETTER TAH WITH THREE DOTS BELOW   tentatively-limited_use Pegon
U+088D Arabic ARABIC LETTER KEHEH WITH TWO DOTS VERTICALLY BELOW   tentatively-limited_use Pegon
U+088E Arabic ARABIC VERTICAL TAIL   tentatively-limited_use historic abbreviation mark
U+08B2 Arabic ARABIC LETTER ZAIN WITH INVERTED V ABOVE [100] uncommon_use (Berber) (7.0)
U+08B5 Arabic ARABIC LETTER QAF WITH DOT BELOW AND NO DOTS ABOVE   tentatively-historic early Arabic
U+08B6 Arabic ARABIC LETTER BEH WITH SMALL MEEM ABOVE [100] uncommon_use (Bravanese) (9.0 13/178)
U+08B7 Arabic ARABIC LETTER PEH WITH SMALL MEEM ABOVE [100] uncommon_use (Bravanese) (9.0 13/178)
U+08B8 Arabic ARABIC LETTER TEH WITH SMALL TEH ABOVE [100] uncommon_use (Bravanese) (9.0 13/178)
U+08B9 Arabic ARABIC LETTER REH WITH SMALL NOON ABOVE [100] uncommon_use (Bravanese) (9.0 13/178)
U+08BA Arabic ARABIC LETTER YEH WITH TWO DOTS BELOW AND SMALL NOON ABOVE [100] uncommon_use (Bravanese) (9.0 13/178)
U+08C8 Arabic ARABIC LETTER GRAF   tentatively-limited_use Balti
U+093D Devanagari DEVANAGARI SIGN AVAGRAHA [100] obsolete Sanskrit
U+0950 Devanagari DEVANAGARI OM [100] religious_use, symbol  
U+0960 Devanagari DEVANAGARI LETTER VOCALIC RR [100] obsolete Sanskrit
U+0961 Devanagari DEVANAGARI LETTER VOCALIC LL [100] obsolete Sanskrit
U+0962  ॢ Devanagari DEVANAGARI VOWEL SIGN VOCALIC L [100] obsolete Sanskrit
U+0963  ॣ Devanagari DEVANAGARI VOWEL SIGN VOCALIC LL [100] obsolete Sanskrit
U+0971 Devanagari DEVANAGARI SIGN HIGH SPACING DOT [100] obsolete Sanskrit
U+097D Devanagari DEVANAGARI LETTER GLOTTAL STOP [100] technical Limbu
U+09BD Bengali BENGALI SIGN AVAGRAHA [100] obsolete Sanskrit
U+09E0 Bengali BENGALI LETTER VOCALIC RR [100] obsolete Sanskrit
U+09E1 Bengali BENGALI LETTER VOCALIC LL [100] obsolete Sanskrit
U+09E2  ৢ Bengali BENGALI VOWEL SIGN VOCALIC L [100] obsolete Sanskrit
U+09E3  ৣ Bengali BENGALI VOWEL SIGN VOCALIC LL [100] obsolete Sanskrit
U+09FE  ৾ Bengali BENGALI SANDHI MARK [100] uncommon_use (Sanskrit) (11.0 16/322)
U+0A01  ਁ Gurmukhi GURMUKHI SIGN ADAK BINDI [100] uncommon_use  
U+0A74 Gurmukhi GURMUKHI EK ONKAR [100] religious_use, symbol  
U+0ABD Gujarati GUJARATI SIGN AVAGRAHA [100] obsolete Sanskrit
U+0AD0 Gujarati GUJARATI OM [100] religious_use, symbol  
U+0AE0 Gujarati GUJARATI LETTER VOCALIC RR [100] obsolete Sanskrit
U+0AE1 Gujarati GUJARATI LETTER VOCALIC LL [100] obsolete Sanskrit
U+0AE2  ૢ Gujarati GUJARATI VOWEL SIGN VOCALIC L [100] obsolete Sanskrit
U+0AE3  ૣ Gujarati GUJARATI VOWEL SIGN VOCALIC LL [100] obsolete Sanskrit
U+0AFA  ૺ Gujarati GUJARATI SIGN SUKUN [100] uncommon_use (Arabic transliteration) (10.0 13/143)
U+0AFB  ૻ Gujarati GUJARATI SIGN SHADDA [100] uncommon_use (Arabic transliteration) (10.0 13/143)
U+0AFC  ૼ Gujarati GUJARATI SIGN MADDAH [100] uncommon_use (Arabic transliteration) (10.0 13/143)
U+0AFD  ૽ Gujarati GUJARATI SIGN THREE-DOT NUKTA ABOVE [100] uncommon_use (Arabic transliteration) (10.0 13/143)
U+0AFE  ૾ Gujarati GUJARATI SIGN CIRCLE NUKTA ABOVE [100] uncommon_use (Arabic transliteration) (10.0 13/143)
U+0AFF  ૿ Gujarati GUJARATI SIGN TWO-CIRCLE NUKTA ABOVE [100] uncommon_use (Arabic transliteration) (10.0 13/143)
U+0B3D Oriya ORIYA SIGN AVAGRAHA [100] obsolete Sanskrit
U+0B60 Oriya ORIYA LETTER VOCALIC RR [100] obsolete Sanskrit
U+0B61 Oriya ORIYA LETTER VOCALIC LL [100] obsolete Sanskrit
U+0B82  ஂ Tamil TAMIL SIGN ANUSVARA [100] technical not used in Tamil
U+0BD0 Tamil TAMIL OM [100] religious_use, symbol  
U+0C01  ఁ Telugu TELUGU SIGN CANDRABINDU [100] uncommon_use  
U+0C04  ఄ Telugu TELUGU SIGN COMBINING ANUSVARA ABOVE [100] uncommon_use (Prakrit) (11.0 16/285)
U+0C3C  ఼ Telugu TELUGU SIGN NUKTA   tentatively-limited_use 19/401
U+0C3D Telugu TELUGU SIGN AVAGRAHA [100] obsolete Sanskrit
U+0C5D Telugu TELUGU LETTER NAKAARA POLLU   tentatively-limited_use 11/409
U+0C60 Telugu TELUGU LETTER VOCALIC RR [100] obsolete Sanskrit
U+0C61 Telugu TELUGU LETTER VOCALIC LL [100] obsolete Sanskrit
U+0C80 Kannada KANNADA SIGN SPACING CANDRABINDU [100] uncommon_use (Badaga) (9.0 14/153)
U+0CBD Kannada KANNADA SIGN AVAGRAHA [100] obsolete Sanskrit
U+0CDD Kannada KANNADA LETTER NAKAARA POLLU   tentatively-excluded, tentatively-limited_use 13/228
U+0CE0 Kannada KANNADA LETTER VOCALIC RR [100] obsolete Sanskrit
U+0CE1 Kannada KANNADA LETTER VOCALIC LL [100] obsolete Sanskrit
U+0CE2  ೢ Kannada KANNADA VOWEL SIGN VOCALIC L [100] obsolete Sanskrit
U+0CE3  ೣ Kannada KANNADA VOWEL SIGN VOCALIC LL [100] obsolete Sanskrit
U+0CF1 Kannada KANNADA SIGN JIHVAMULIYA [100] obsolete Sankrit
U+0CF2 Kannada KANNADA SIGN UPADHMANIYA [100] obsolete Sankrit
U+0CF3  ೳ Kannada KANNADA SIGN COMBINING ANUSVARA ABOVE RIGHT   tentatively-excluded, tentatively-limited_use  
U+0D00  ഀ Malayalam MALAYALAM SIGN COMBINING ANUSVARA ABOVE [100] uncommon_use (Prakrit) (10.0 14/003)
U+0D3A Malayalam MALAYALAM LETTER TTTA [100] obsolete Historic
U+0D3D Malayalam MALAYALAM SIGN AVAGRAHA [100] obsolete Sanskrit
U+0D4C  ൌ Malayalam MALAYALAM VOWEL SIGN AU [100] obsolete Archaic
U+0D4E Malayalam MALAYALAM LETTER DOT REPH [100] obsolete Historic
U+0D54 Malayalam MALAYALAM LETTER CHILLU M [100] uncommon_use (Chillu) (9.0 14/013)
U+0D55 Malayalam MALAYALAM LETTER CHILLU Y [100] uncommon_use (Chillu) (9.0 14/013)
U+0D56 Malayalam MALAYALAM LETTER CHILLU LLL [100] uncommon_use (Chillu) (9.0 14/013)
U+0D60 Malayalam MALAYALAM LETTER VOCALIC RR [100] obsolete Sanskrit
U+0D61 Malayalam MALAYALAM LETTER VOCALIC LL [100] obsolete Sanskrit
U+0E86 Lao LAO LETTER PALI GHA [100] uncommon_use (Pali/Sanskrit) (12.0 17/106)
U+0E89 Lao LAO LETTER PALI CHA [100] uncommon_use (Pali/Sanskrit) (12.0 17/106)
U+0E8C Lao LAO LETTER PALI JHA [100] uncommon_use (Pali/Sanskrit) (12.0 17/106)
U+0E8E Lao LAO LETTER PALI NYA [100] uncommon_use (Pali/Sanskrit) (12.0 17/106)
U+0E8F Lao LAO LETTER PALI TTA [100] uncommon_use (Pali/Sanskrit) (12.0 17/106)
U+0E90 Lao LAO LETTER PALI TTHA [100] uncommon_use (Pali/Sanskrit) (12.0 17/106)
U+0E91 Lao LAO LETTER PALI DDA [100] uncommon_use (Pali/Sanskrit) (12.0 17/106)
U+0E92 Lao LAO LETTER PALI DDHA [100] uncommon_use (Pali/Sanskrit) (12.0 17/106)
U+0E93 Lao LAO LETTER PALI NNA [100] uncommon_use (Pali/Sanskrit) (12.0 17/106)
U+0E98 Lao LAO LETTER PALI DHA [100] uncommon_use (Pali/Sanskrit) (12.0 17/106)
U+0EA0 Lao LAO LETTER PALI BHA [100] uncommon_use (Pali/Sanskrit) (12.0 17/106)
U+0EA8 Lao LAO LETTER SANSKRIT SHA [100] uncommon_use (Pali/Sanskrit) (12.0 17/106)
U+0EA9 Lao LAO LETTER SANSKRIT SSA [100] uncommon_use (Pali/Sanskrit) (12.0 17/106)
U+0EAC Lao LAO LETTER PALI LLA [100] uncommon_use (Pali/Sanskrit) (12.0 17/106)
U+0EAF Lao LAO ELLIPSIS [100] symbol  
U+0EBA  ຺ Lao LAO SIGN PALI VIRAMA [100] uncommon_use (Pali/Sanskrit) (12.0 17/106)
U+0ECE  ໎ Lao LAO YAMAKKAN   tentatively-excluded, tentatively-limited_use Pali
U+0F00 Tibetan TIBETAN SYLLABLE OM [100] religious_use, symbol  
U+0F35  ༵ Tibetan TIBETAN MARK NGAS BZUNG NYI ZLA [100] symbol honorific, emphasis
U+0F37  ༷ Tibetan TIBETAN MARK NGAS BZUNG SGOR RTAGS [100] symbol emphasis
U+0F3E  ༾ Tibetan TIBETAN SIGN YAR TSHES [100] numeric almanacs
U+0F3F  ༿ Tibetan TIBETAN SIGN MAR TSHES [100] numeric almanacs
U+0F6A Tibetan TIBETAN LETTER FIXED-FORM RA [100] obsolete, uncommon_use Sanskrit
U+0F6B Tibetan TIBETAN LETTER KKA [100] uncommon_use Balti
U+0F6C Tibetan TIBETAN LETTER RRA [100] uncommon_use Balti
U+0F7B  ཻ Tibetan TIBETAN VOWEL SIGN EE [100] duplicate homoglyph (digraph of 0F7A 0F7A)
U+0F7D  ཽ Tibetan TIBETAN VOWEL SIGN OO [100] duplicate homoglyph (digraph of 0F7C 0F7C)
U+0F82  ྂ Tibetan TIBETAN SIGN NYI ZLA NAA DA [100] obsolete, uncommon_use Sanskrit
U+0F83  ྃ Tibetan TIBETAN SIGN SNA LDAN [100] obsolete, uncommon_use Sanskrit
U+0F86  ྆ Tibetan TIBETAN SIGN LCI RTAGS [100] obsolete historic
U+0F87  ྇ Tibetan TIBETAN SIGN YANG RTAGS [100] obsolete historic
U+0F88 Tibetan TIBETAN SIGN LCE TSA CAN [100] obsolete historic
U+0F89 Tibetan TIBETAN SIGN MCHU CAN [100] obsolete historic
U+0F8A Tibetan TIBETAN SIGN GRU CAN RGYINGS [100] obsolete historic
U+0F8B Tibetan TIBETAN SIGN GRU MED RGYINGS [100] obsolete historic
U+0F8C Tibetan TIBETAN SIGN INVERTED MCHU CAN [100] obsolete historic
U+0F8D  ྍ Tibetan TIBETAN SUBJOINED SIGN LCE TSA CAN [100] obsolete historic
U+0F8E  ྎ Tibetan TIBETAN SUBJOINED SIGN MCHU CAN [100] obsolete historic
U+0F8F  ྏ Tibetan TIBETAN SUBJOINED SIGN INVERTED MCHU CAN [100] obsolete historic
U+0FAE  ྮ Tibetan TIBETAN SUBJOINED LETTER ZHA [100] uncommon_use  
U+0FAF  ྯ Tibetan TIBETAN SUBJOINED LETTER ZA [100] uncommon_use  
U+0FB0  ྰ Tibetan TIBETAN SUBJOINED LETTER -A [100] uncommon_use  
U+0FC6  ࿆ Tibetan TIBETAN SYMBOL PADMA GDAN [100] symbol, uncommon_use  
U+1050 Myanmar MYANMAR LETTER SHA [100] obsolete, uncommon_use (Pali) (Sanskrit)
U+1051 Myanmar MYANMAR LETTER SSA [100] obsolete, uncommon_use (Pali) (Sanskrit)
U+1052 Myanmar MYANMAR LETTER VOCALIC R [100] obsolete, uncommon_use (Pali) (Sanskrit)
U+1053 Myanmar MYANMAR LETTER VOCALIC RR [100] obsolete, uncommon_use (Pali) (Sanskrit)
U+1054 Myanmar MYANMAR LETTER VOCALIC L [100] obsolete, uncommon_use (Pali) (Sanskrit)
U+1055 Myanmar MYANMAR LETTER VOCALIC LL [100] obsolete, uncommon_use (Pali) (Sanskrit)
U+1056  ၖ Myanmar MYANMAR VOWEL SIGN VOCALIC R [100] obsolete, uncommon_use (Pali) (Sanskrit)
U+1057  ၗ Myanmar MYANMAR VOWEL SIGN VOCALIC RR [100] obsolete, uncommon_use (Pali) (Sanskrit)
U+1058  ၘ Myanmar MYANMAR VOWEL SIGN VOCALIC L [100] obsolete, uncommon_use (Pali) (Sanskrit)
U+1059  ၙ Myanmar MYANMAR VOWEL SIGN VOCALIC LL [100] obsolete, uncommon_use (Pali) (Sanskrit)
U+1065 Myanmar MYANMAR LETTER WESTERN PWO KAREN THA [100] uncommon_use Western Pwo Karen
U+1066 Myanmar MYANMAR LETTER WESTERN PWO KAREN PWA [100] uncommon_use Western Pwo Karen
U+1067  ၧ Myanmar MYANMAR VOWEL SIGN WESTERN PWO KAREN EU [100] uncommon_use Western Pwo Karen
U+1068  ၨ Myanmar MYANMAR VOWEL SIGN WESTERN PWO KAREN UE [100] uncommon_use Western Pwo Karen
U+1069  ၩ Myanmar MYANMAR SIGN WESTERN PWO KAREN TONE-1 [100] uncommon_use Western Pwo Karen
U+106A  ၪ Myanmar MYANMAR SIGN WESTERN PWO KAREN TONE-2 [100] uncommon_use Western Pwo Karen
U+106B  ၫ Myanmar MYANMAR SIGN WESTERN PWO KAREN TONE-3 [100] uncommon_use Western Pwo Karen
U+106C  ၬ Myanmar MYANMAR SIGN WESTERN PWO KAREN TONE-4 [100] uncommon_use Western Pwo Karen
U+106D  ၭ Myanmar MYANMAR SIGN WESTERN PWO KAREN TONE-5 [100] uncommon_use Western Pwo Karen
U+106E Myanmar MYANMAR LETTER EASTERN PWO KAREN NNA [100] uncommon_use Eastern Pwo Karen
U+106F Myanmar MYANMAR LETTER EASTERN PWO KAREN YWA [100] uncommon_use Eastern Pwo Karen
U+1070 Myanmar MYANMAR LETTER EASTERN PWO KAREN GHWA [100] uncommon_use Eastern Pwo Karen
U+1071  ၱ Myanmar MYANMAR VOWEL SIGN GEBA KAREN I [100] uncommon_use Geba Karen
U+1072  ၲ Myanmar MYANMAR VOWEL SIGN KAYAH OE [100] uncommon_use Kayah
U+1073  ၳ Myanmar MYANMAR VOWEL SIGN KAYAH U [100] uncommon_use Kayah
U+1074  ၴ Myanmar MYANMAR VOWEL SIGN KAYAH EE [100] uncommon_use Kayah
U+108E Myanmar MYANMAR LETTER RUMAI PALAUNG FA [100] uncommon_use Rumai Palaung
U+109A  ႚ Myanmar MYANMAR SIGN KHAMTI TONE-1 [100] uncommon_use Kamti Shan
U+109B  ႛ Myanmar MYANMAR SIGN KHAMTI TONE-3 [100] uncommon_use Kamti Shan
U+109C  ႜ Myanmar MYANMAR VOWEL SIGN AITON A [100] uncommon_use Aiton, Phake
U+109D  ႝ Myanmar MYANMAR VOWEL SIGN AITON AI [100] uncommon_use Aiton, Phake
U+10F9 Georgian GEORGIAN LETTER TURNED GAN [100] technical, uncommon_use educational
U+10FA Georgian GEORGIAN LETTER AIN [100] technical, uncommon_use threatened
U+10FD Georgian GEORGIAN LETTER AEN [100] uncommon_use Ossetian, Abkhaz
U+10FE Georgian GEORGIAN LETTER HARD SIGN [100] uncommon_use Ossetian, Abkhaz
U+10FF Georgian GEORGIAN LETTER LABIAL SIGN [100] uncommon_use Ossetian, Abkhaz
U+1380 Ethiopic ETHIOPIC SYLLABLE SEBATBEIT MWA [100] uncommon_use Sebatbeit
U+1381 Ethiopic ETHIOPIC SYLLABLE MWI [100] uncommon_use Sebatbeit
U+1382 Ethiopic ETHIOPIC SYLLABLE MWEE [100] uncommon_use Sebatbeit
U+1383 Ethiopic ETHIOPIC SYLLABLE MWE [100] uncommon_use Sebatbeit
U+1384 Ethiopic ETHIOPIC SYLLABLE SEBATBEIT BWA [100] uncommon_use Sebatbeit
U+1385 Ethiopic ETHIOPIC SYLLABLE BWI [100] uncommon_use Sebatbeit
U+1386 Ethiopic ETHIOPIC SYLLABLE BWEE [100] uncommon_use Sebatbeit
U+1387 Ethiopic ETHIOPIC SYLLABLE BWE [100] uncommon_use Sebatbeit
U+1388 Ethiopic ETHIOPIC SYLLABLE SEBATBEIT FWA [100] uncommon_use Sebatbeit
U+1389 Ethiopic ETHIOPIC SYLLABLE FWI [100] uncommon_use Sebatbeit
U+138A Ethiopic ETHIOPIC SYLLABLE FWEE [100] uncommon_use Sebatbeit
U+138B Ethiopic ETHIOPIC SYLLABLE FWE [100] uncommon_use Sebatbeit
U+138C Ethiopic ETHIOPIC SYLLABLE SEBATBEIT PWA [100] uncommon_use Sebatbeit
U+138D Ethiopic ETHIOPIC SYLLABLE PWI [100] uncommon_use Sebatbeit
U+138E Ethiopic ETHIOPIC SYLLABLE PWEE [100] uncommon_use Sebatbeit
U+138F Ethiopic ETHIOPIC SYLLABLE PWE [100] uncommon_use Sebatbeit
U+17DC Khmer KHMER SIGN AVAKRAHASANYA [100] obsolete Sanskrit
U+1E00 Latin LATIN CAPITAL LETTER A WITH RING BELOW   Uppercase Not IDNA2008
U+1E01 Latin LATIN SMALL LETTER A WITH RING BELOW [100] technical  
U+1E18 Latin LATIN CAPITAL LETTER E WITH CIRCUMFLEX BELOW   Uppercase Not IDNA2008
U+1E19 Latin LATIN SMALL LETTER E WITH CIRCUMFLEX BELOW [100] technical  
U+1E1A Latin LATIN CAPITAL LETTER E WITH TILDE BELOW   Uppercase Not IDNA2008
U+1E1B Latin LATIN SMALL LETTER E WITH TILDE BELOW [100] technical  
U+1E2A Latin LATIN CAPITAL LETTER H WITH BREVE BELOW   Uppercase Not IDNA2008
U+1E2B Latin LATIN SMALL LETTER H WITH BREVE BELOW [100] technical Semitic transliteration
U+1E2C Latin LATIN CAPITAL LETTER I WITH TILDE BELOW   Uppercase Not IDNA2008
U+1E2D Latin LATIN SMALL LETTER I WITH TILDE BELOW [100] technical  
U+1E72 Latin LATIN CAPITAL LETTER U WITH DIAERESIS BELOW   Uppercase Not IDNA2008
U+1E73 Latin LATIN SMALL LETTER U WITH DIAERESIS BELOW [100] technical  
U+1E74 Latin LATIN CAPITAL LETTER U WITH TILDE BELOW   Uppercase Not IDNA2008
U+1E75 Latin LATIN SMALL LETTER U WITH TILDE BELOW [100] technical  
U+1E76 Latin LATIN CAPITAL LETTER U WITH CIRCUMFLEX BELOW   Uppercase Not IDNA2008
U+1E77 Latin LATIN SMALL LETTER U WITH CIRCUMFLEX BELOW [100] technical  
U+1EF2 Latin LATIN CAPITAL LETTER Y WITH GRAVE   GreekUppercase Not IDNA2008
U+1F00 Greek GREEK SMALL LETTER ALPHA WITH PSILI [100] historic polytoniko orthography
U+1F01 Greek GREEK SMALL LETTER ALPHA WITH DASIA [100] historic polytoniko orthography
U+1F02 Greek GREEK SMALL LETTER ALPHA WITH PSILI AND VARIA [100] historic polytoniko orthography
U+1F03 Greek GREEK SMALL LETTER ALPHA WITH DASIA AND VARIA [100] historic polytoniko orthography
U+1F04 Greek GREEK SMALL LETTER ALPHA WITH PSILI AND OXIA [100] historic polytoniko orthography
U+1F05 Greek GREEK SMALL LETTER ALPHA WITH DASIA AND OXIA [100] historic polytoniko orthography
U+1F06 Greek GREEK SMALL LETTER ALPHA WITH PSILI AND PERISPOMENI [100] historic polytoniko orthography
U+1F07 Greek GREEK SMALL LETTER ALPHA WITH DASIA AND PERISPOMENI [100] historic polytoniko orthography
U+1F08 Greek GREEK CAPITAL LETTER ALPHA WITH PSILI   GreekUppercase Not IDNA2008
U+1F09 Greek GREEK CAPITAL LETTER ALPHA WITH DASIA   GreekUppercase Not IDNA2008
U+1F0A Greek GREEK CAPITAL LETTER ALPHA WITH PSILI AND VARIA   GreekUppercase Not IDNA2008
U+1F0B Greek GREEK CAPITAL LETTER ALPHA WITH DASIA AND VARIA   GreekUppercase Not IDNA2008
U+1F0C Greek GREEK CAPITAL LETTER ALPHA WITH PSILI AND OXIA   GreekUppercase Not IDNA2008
U+1F0D Greek GREEK CAPITAL LETTER ALPHA WITH DASIA AND OXIA   GreekUppercase Not IDNA2008
U+1F0E Greek GREEK CAPITAL LETTER ALPHA WITH PSILI AND PERISPOMENI   GreekUppercase Not IDNA2008
U+1F0F Greek GREEK CAPITAL LETTER ALPHA WITH DASIA AND PERISPOMENI   GreekUppercase Not IDNA2008
U+1F10 Greek GREEK SMALL LETTER EPSILON WITH PSILI [100] historic polytoniko orthography
U+1F11 Greek GREEK SMALL LETTER EPSILON WITH DASIA [100] historic polytoniko orthography
U+1F12 Greek GREEK SMALL LETTER EPSILON WITH PSILI AND VARIA [100] historic polytoniko orthography
U+1F13 Greek GREEK SMALL LETTER EPSILON WITH DASIA AND VARIA [100] historic polytoniko orthography
U+1F14 Greek GREEK SMALL LETTER EPSILON WITH PSILI AND OXIA [100] historic polytoniko orthography
U+1F15 Greek GREEK SMALL LETTER EPSILON WITH DASIA AND OXIA [100] historic polytoniko orthography
U+1F18 Greek GREEK CAPITAL LETTER EPSILON WITH PSILI   GreekUppercase Not IDNA2008
U+1F19 Greek GREEK CAPITAL LETTER EPSILON WITH DASIA   GreekUppercase Not IDNA2008
U+1F1A Greek GREEK CAPITAL LETTER EPSILON WITH PSILI AND VARIA   GreekUppercase Not IDNA2008
U+1F1B Greek GREEK CAPITAL LETTER EPSILON WITH DASIA AND VARIA   GreekUppercase Not IDNA2008
U+1F1C Greek GREEK CAPITAL LETTER EPSILON WITH PSILI AND OXIA   GreekUppercase Not IDNA2008
U+1F1D Greek GREEK CAPITAL LETTER EPSILON WITH DASIA AND OXIA   GreekUppercase Not IDNA2008
U+1F20 Greek GREEK SMALL LETTER ETA WITH PSILI [100] historic polytoniko orthography
U+1F21 Greek GREEK SMALL LETTER ETA WITH DASIA [100] historic polytoniko orthography
U+1F22 Greek GREEK SMALL LETTER ETA WITH PSILI AND VARIA [100] historic polytoniko orthography
U+1F23 Greek GREEK SMALL LETTER ETA WITH DASIA AND VARIA [100] historic polytoniko orthography
U+1F24 Greek GREEK SMALL LETTER ETA WITH PSILI AND OXIA [100] historic polytoniko orthography
U+1F25 Greek GREEK SMALL LETTER ETA WITH DASIA AND OXIA [100] historic polytoniko orthography
U+1F26 Greek GREEK SMALL LETTER ETA WITH PSILI AND PERISPOMENI [100] historic polytoniko orthography
U+1F27 Greek GREEK SMALL LETTER ETA WITH DASIA AND PERISPOMENI [100] historic polytoniko orthography
U+1F28 Greek GREEK CAPITAL LETTER ETA WITH PSILI   GreekUppercase Not IDNA2008
U+1F29 Greek GREEK CAPITAL LETTER ETA WITH DASIA   GreekUppercase Not IDNA2008
U+1F2A Greek GREEK CAPITAL LETTER ETA WITH PSILI AND VARIA   GreekUppercase Not IDNA2008
U+1F2B Greek GREEK CAPITAL LETTER ETA WITH DASIA AND VARIA   GreekUppercase Not IDNA2008
U+1F2C Greek GREEK CAPITAL LETTER ETA WITH PSILI AND OXIA   GreekUppercase Not IDNA2008
U+1F2D Greek GREEK CAPITAL LETTER ETA WITH DASIA AND OXIA   GreekUppercase Not IDNA2008
U+1F2E Greek GREEK CAPITAL LETTER ETA WITH PSILI AND PERISPOMENI   GreekUppercase Not IDNA2008
U+1F2F Greek GREEK CAPITAL LETTER ETA WITH DASIA AND PERISPOMENI   GreekUppercase Not IDNA2008
U+1F30 Greek GREEK SMALL LETTER IOTA WITH PSILI [100] historic polytoniko orthography
U+1F31 Greek GREEK SMALL LETTER IOTA WITH DASIA [100] historic polytoniko orthography
U+1F32 Greek GREEK SMALL LETTER IOTA WITH PSILI AND VARIA [100] historic polytoniko orthography
U+1F33 Greek GREEK SMALL LETTER IOTA WITH DASIA AND VARIA [100] historic polytoniko orthography
U+1F34 Greek GREEK SMALL LETTER IOTA WITH PSILI AND OXIA [100] historic polytoniko orthography
U+1F35 Greek GREEK SMALL LETTER IOTA WITH DASIA AND OXIA [100] historic polytoniko orthography
U+1F36 Greek GREEK SMALL LETTER IOTA WITH PSILI AND PERISPOMENI [100] historic polytoniko orthography
U+1F37 Greek GREEK SMALL LETTER IOTA WITH DASIA AND PERISPOMENI [100] historic polytoniko orthography
U+1F38 Greek GREEK CAPITAL LETTER IOTA WITH PSILI   GreekUppercase Not IDNA2008
U+1F39 Greek GREEK CAPITAL LETTER IOTA WITH DASIA   GreekUppercase Not IDNA2008
U+1F3A Greek GREEK CAPITAL LETTER IOTA WITH PSILI AND VARIA   GreekUppercase Not IDNA2008
U+1F3B Greek GREEK CAPITAL LETTER IOTA WITH DASIA AND VARIA   GreekUppercase Not IDNA2008
U+1F3C Greek GREEK CAPITAL LETTER IOTA WITH PSILI AND OXIA   GreekUppercase Not IDNA2008
U+1F3D Greek GREEK CAPITAL LETTER IOTA WITH DASIA AND OXIA   GreekUppercase Not IDNA2008
U+1F3E Greek GREEK CAPITAL LETTER IOTA WITH PSILI AND PERISPOMENI   GreekUppercase Not IDNA2008
U+1F3F Ἷ Greek GREEK CAPITAL LETTER IOTA WITH DASIA AND PERISPOMENI   GreekUppercase Not IDNA2008
U+1F40 Greek GREEK SMALL LETTER OMICRON WITH PSILI [100] historic polytoniko orthography
U+1F41 Greek GREEK SMALL LETTER OMICRON WITH DASIA [100] historic polytoniko orthography
U+1F42 Greek GREEK SMALL LETTER OMICRON WITH PSILI AND VARIA [100] historic polytoniko orthography
U+1F43 Greek GREEK SMALL LETTER OMICRON WITH DASIA AND VARIA [100] historic polytoniko orthography
U+1F44 Greek GREEK SMALL LETTER OMICRON WITH PSILI AND OXIA [100] historic polytoniko orthography
U+1F45 Greek GREEK SMALL LETTER OMICRON WITH DASIA AND OXIA [100] historic polytoniko orthography
U+1F48 Greek GREEK CAPITAL LETTER OMICRON WITH PSILI   GreekUppercase Not IDNA2008
U+1F49 Greek GREEK CAPITAL LETTER OMICRON WITH DASIA   GreekUppercase Not IDNA2008
U+1F4A Greek GREEK CAPITAL LETTER OMICRON WITH PSILI AND VARIA   GreekUppercase Not IDNA2008
U+1F4B Greek GREEK CAPITAL LETTER OMICRON WITH DASIA AND VARIA   GreekUppercase Not IDNA2008
U+1F4C Greek GREEK CAPITAL LETTER OMICRON WITH PSILI AND OXIA   GreekUppercase Not IDNA2008
U+1F4D Greek GREEK CAPITAL LETTER OMICRON WITH DASIA AND OXIA   GreekUppercase Not IDNA2008
U+1F50 Greek GREEK SMALL LETTER UPSILON WITH PSILI [100] historic polytoniko orthography
U+1F51 Greek GREEK SMALL LETTER UPSILON WITH DASIA [100] historic polytoniko orthography
U+1F52 Greek GREEK SMALL LETTER UPSILON WITH PSILI AND VARIA [100] historic polytoniko orthography
U+1F53 Greek GREEK SMALL LETTER UPSILON WITH DASIA AND VARIA [100] historic polytoniko orthography
U+1F54 Greek GREEK SMALL LETTER UPSILON WITH PSILI AND OXIA [100] historic polytoniko orthography
U+1F55 Greek GREEK SMALL LETTER UPSILON WITH DASIA AND OXIA [100] historic polytoniko orthography
U+1F56 Greek GREEK SMALL LETTER UPSILON WITH PSILI AND PERISPOMENI [100] historic polytoniko orthography
U+1F57 Greek GREEK SMALL LETTER UPSILON WITH DASIA AND PERISPOMENI [100] historic polytoniko orthography
U+1F59 Greek GREEK CAPITAL LETTER UPSILON WITH DASIA   GreekUppercase Not IDNA2008
U+1F5B Greek GREEK CAPITAL LETTER UPSILON WITH DASIA AND VARIA   GreekUppercase Not IDNA2008
U+1F5D Greek GREEK CAPITAL LETTER UPSILON WITH DASIA AND OXIA   GreekUppercase Not IDNA2008
U+1F5F Greek GREEK CAPITAL LETTER UPSILON WITH DASIA AND PERISPOMENI   GreekUppercase Not IDNA2008
U+1F60 Greek GREEK SMALL LETTER OMEGA WITH PSILI [100] historic polytoniko orthography
U+1F61 Greek GREEK SMALL LETTER OMEGA WITH DASIA [100] historic polytoniko orthography
U+1F62 Greek GREEK SMALL LETTER OMEGA WITH PSILI AND VARIA [100] historic polytoniko orthography
U+1F63 Greek GREEK SMALL LETTER OMEGA WITH DASIA AND VARIA [100] historic polytoniko orthography
U+1F64 Greek GREEK SMALL LETTER OMEGA WITH PSILI AND OXIA [100] historic polytoniko orthography
U+1F65 Greek GREEK SMALL LETTER OMEGA WITH DASIA AND OXIA [100] historic polytoniko orthography
U+1F66 Greek GREEK SMALL LETTER OMEGA WITH PSILI AND PERISPOMENI [100] historic polytoniko orthography
U+1F67 Greek GREEK SMALL LETTER OMEGA WITH DASIA AND PERISPOMENI [100] historic polytoniko orthography
U+1F68 Greek GREEK CAPITAL LETTER OMEGA WITH PSILI   GreekUppercase Not IDNA2008
U+1F69 Greek GREEK CAPITAL LETTER OMEGA WITH DASIA   GreekUppercase Not IDNA2008
U+1F6A Greek GREEK CAPITAL LETTER OMEGA WITH PSILI AND VARIA   GreekUppercase Not IDNA2008
U+1F6B Greek GREEK CAPITAL LETTER OMEGA WITH DASIA AND VARIA   GreekUppercase Not IDNA2008
U+1F6C Greek GREEK CAPITAL LETTER OMEGA WITH PSILI AND OXIA   GreekUppercase Not IDNA2008
U+1F6D Greek GREEK CAPITAL LETTER OMEGA WITH DASIA AND OXIA   GreekUppercase Not IDNA2008
U+1F6E Greek GREEK CAPITAL LETTER OMEGA WITH PSILI AND PERISPOMENI   GreekUppercase Not IDNA2008
U+1F6F Greek GREEK CAPITAL LETTER OMEGA WITH DASIA AND PERISPOMENI   GreekUppercase Not IDNA2008
U+1F70 Greek GREEK SMALL LETTER ALPHA WITH VARIA [100] historic polytoniko orthography
U+1F72 Greek GREEK SMALL LETTER EPSILON WITH VARIA [100] historic polytoniko orthography
U+1F74 Greek GREEK SMALL LETTER ETA WITH VARIA [100] historic polytoniko orthography
U+1F76 Greek GREEK SMALL LETTER IOTA WITH VARIA [100] historic polytoniko orthography
U+1F78 Greek GREEK SMALL LETTER OMICRON WITH VARIA [100] historic polytoniko orthography
U+1F7A Greek GREEK SMALL LETTER UPSILON WITH VARIA [100] historic polytoniko orthography
U+1F7C Greek GREEK SMALL LETTER OMEGA WITH VARIA [100] historic polytoniko orthography
U+1F80 Greek GREEK SMALL LETTER ALPHA WITH PSILI AND YPOGEGRAMMENI     NOT IDNA2008
U+1F81 Greek GREEK SMALL LETTER ALPHA WITH DASIA AND YPOGEGRAMMENI     NOT IDNA2008
U+1F82 Greek GREEK SMALL LETTER ALPHA WITH PSILI AND VARIA AND YPOGEGRAMMENI     NOT IDNA2008
U+1F83 Greek GREEK SMALL LETTER ALPHA WITH DASIA AND VARIA AND YPOGEGRAMMENI     NOT IDNA2008
U+1F84 Greek GREEK SMALL LETTER ALPHA WITH PSILI AND OXIA AND YPOGEGRAMMENI     NOT IDNA2008
U+1F85 Greek GREEK SMALL LETTER ALPHA WITH DASIA AND OXIA AND YPOGEGRAMMENI     NOT IDNA2008
U+1F86 Greek GREEK SMALL LETTER ALPHA WITH PSILI AND PERISPOMENI AND YPOGEGRAMMENI     NOT IDNA2008
U+1F87 Greek GREEK SMALL LETTER ALPHA WITH DASIA AND PERISPOMENI AND YPOGEGRAMMENI     NOT IDNA2008
U+1F88 Greek GREEK CAPITAL LETTER ALPHA WITH PSILI AND PROSGEGRAMMENI     NOT IDNA2008
U+1F89 Greek GREEK CAPITAL LETTER ALPHA WITH DASIA AND PROSGEGRAMMENI     NOT IDNA2008
U+1F8A Greek GREEK CAPITAL LETTER ALPHA WITH PSILI AND VARIA AND PROSGEGRAMMENI     NOT IDNA2008
U+1F8B Greek GREEK CAPITAL LETTER ALPHA WITH DASIA AND VARIA AND PROSGEGRAMMENI     NOT IDNA2008
U+1F8C Greek GREEK CAPITAL LETTER ALPHA WITH PSILI AND OXIA AND PROSGEGRAMMENI     NOT IDNA2008
U+1F8D Greek GREEK CAPITAL LETTER ALPHA WITH DASIA AND OXIA AND PROSGEGRAMMENI     NOT IDNA2008
U+1F8E Greek GREEK CAPITAL LETTER ALPHA WITH PSILI AND PERISPOMENI AND PROSGEGRAMMENI     NOT IDNA2008
U+1F8F Greek GREEK CAPITAL LETTER ALPHA WITH DASIA AND PERISPOMENI AND PROSGEGRAMMENI     NOT IDNA2008
U+1F90 Greek GREEK SMALL LETTER ETA WITH PSILI AND YPOGEGRAMMENI     NOT IDNA2008
U+1F91 Greek GREEK SMALL LETTER ETA WITH DASIA AND YPOGEGRAMMENI     NOT IDNA2008
U+1F92 Greek GREEK SMALL LETTER ETA WITH PSILI AND VARIA AND YPOGEGRAMMENI     NOT IDNA2008
U+1F93 Greek GREEK SMALL LETTER ETA WITH DASIA AND VARIA AND YPOGEGRAMMENI     NOT IDNA2008
U+1F94 Greek GREEK SMALL LETTER ETA WITH PSILI AND OXIA AND YPOGEGRAMMENI     NOT IDNA2008
U+1F95 Greek GREEK SMALL LETTER ETA WITH DASIA AND OXIA AND YPOGEGRAMMENI     NOT IDNA2008
U+1F96 Greek GREEK SMALL LETTER ETA WITH PSILI AND PERISPOMENI AND YPOGEGRAMMENI     NOT IDNA2008
U+1F97 Greek GREEK SMALL LETTER ETA WITH DASIA AND PERISPOMENI AND YPOGEGRAMMENI     NOT IDNA2008
U+1F98 Greek GREEK CAPITAL LETTER ETA WITH PSILI AND PROSGEGRAMMENI     NOT IDNA2008
U+1F99 Greek GREEK CAPITAL LETTER ETA WITH DASIA AND PROSGEGRAMMENI     NOT IDNA2008
U+1F9A Greek GREEK CAPITAL LETTER ETA WITH PSILI AND VARIA AND PROSGEGRAMMENI     NOT IDNA2008
U+1F9B Greek GREEK CAPITAL LETTER ETA WITH DASIA AND VARIA AND PROSGEGRAMMENI     NOT IDNA2008
U+1F9C Greek GREEK CAPITAL LETTER ETA WITH PSILI AND OXIA AND PROSGEGRAMMENI     NOT IDNA2008
U+1F9D Greek GREEK CAPITAL LETTER ETA WITH DASIA AND OXIA AND PROSGEGRAMMENI     NOT IDNA2008
U+1F9E Greek GREEK CAPITAL LETTER ETA WITH PSILI AND PERISPOMENI AND PROSGEGRAMMENI     NOT IDNA2008
U+1F9F Greek GREEK CAPITAL LETTER ETA WITH DASIA AND PERISPOMENI AND PROSGEGRAMMENI     NOT IDNA2008
U+1FA0 Greek GREEK SMALL LETTER OMEGA WITH PSILI AND YPOGEGRAMMENI     NOT IDNA2008
U+1FA1 Greek GREEK SMALL LETTER OMEGA WITH DASIA AND YPOGEGRAMMENI     NOT IDNA2008
U+1FA2 Greek GREEK SMALL LETTER OMEGA WITH PSILI AND VARIA AND YPOGEGRAMMENI     NOT IDNA2008
U+1FA3 Greek GREEK SMALL LETTER OMEGA WITH DASIA AND VARIA AND YPOGEGRAMMENI     NOT IDNA2008
U+1FA4 Greek GREEK SMALL LETTER OMEGA WITH PSILI AND OXIA AND YPOGEGRAMMENI     NOT IDNA2008
U+1FA5 Greek GREEK SMALL LETTER OMEGA WITH DASIA AND OXIA AND YPOGEGRAMMENI     NOT IDNA2008
U+1FA6 Greek GREEK SMALL LETTER OMEGA WITH PSILI AND PERISPOMENI AND YPOGEGRAMMENI     NOT IDNA2008
U+1FA7 Greek GREEK SMALL LETTER OMEGA WITH DASIA AND PERISPOMENI AND YPOGEGRAMMENI     NOT IDNA2008
U+1FA8 Greek GREEK CAPITAL LETTER OMEGA WITH PSILI AND PROSGEGRAMMENI     NOT IDNA2008
U+1FA9 Greek GREEK CAPITAL LETTER OMEGA WITH DASIA AND PROSGEGRAMMENI     NOT IDNA2008
U+1FAA Greek GREEK CAPITAL LETTER OMEGA WITH PSILI AND VARIA AND PROSGEGRAMMENI     NOT IDNA2008
U+1FAB Greek GREEK CAPITAL LETTER OMEGA WITH DASIA AND VARIA AND PROSGEGRAMMENI     NOT IDNA2008
U+1FAC Greek GREEK CAPITAL LETTER OMEGA WITH PSILI AND OXIA AND PROSGEGRAMMENI     NOT IDNA2008
U+1FAD Greek GREEK CAPITAL LETTER OMEGA WITH DASIA AND OXIA AND PROSGEGRAMMENI     NOT IDNA2008
U+1FAE Greek GREEK CAPITAL LETTER OMEGA WITH PSILI AND PERISPOMENI AND PROSGEGRAMMENI     NOT IDNA2008
U+1FAF Greek GREEK CAPITAL LETTER OMEGA WITH DASIA AND PERISPOMENI AND PROSGEGRAMMENI     NOT IDNA2008
U+1FB0 Greek GREEK SMALL LETTER ALPHA WITH VRACHY [100] technical  
U+1FB1 Greek GREEK SMALL LETTER ALPHA WITH MACRON [100] technical  
U+1FB2 Greek GREEK SMALL LETTER ALPHA WITH VARIA AND YPOGEGRAMMENI     NOT IDNA208
U+1FB3 Greek GREEK SMALL LETTER ALPHA WITH YPOGEGRAMMENI     NOT IDNA208
U+1FB4 Greek GREEK SMALL LETTER ALPHA WITH OXIA AND YPOGEGRAMMENI     NOT IDNA208
U+1FB6 Greek GREEK SMALL LETTER ALPHA WITH PERISPOMENI [100] historic polytoniko orthography
U+1FB7 Greek GREEK SMALL LETTER ALPHA WITH PERISPOMENI AND YPOGEGRAMMENI     NOT IDNA2008
U+1FB8 Greek GREEK CAPITAL LETTER ALPHA WITH VRACHY   GreekUppercase Not IDNA2008
U+1FB9 Greek GREEK CAPITAL LETTER ALPHA WITH MACRON   GreekUppercase Not IDNA2008
U+1FBA Greek GREEK CAPITAL LETTER ALPHA WITH VARIA   GreekUppercase Not IDNA2008
U+1FBC Greek GREEK CAPITAL LETTER ALPHA WITH PROSGEGRAMMENI     NOT IDNA2008
U+1FC2 Greek GREEK SMALL LETTER ETA WITH VARIA AND YPOGEGRAMMENI     NOT IDNA2008
U+1FC3 Greek GREEK SMALL LETTER ETA WITH YPOGEGRAMMENI     NOT IDNA2008
U+1FC4 Greek GREEK SMALL LETTER ETA WITH OXIA AND YPOGEGRAMMENI     NOT IDNA2008
U+1FC6 Greek GREEK SMALL LETTER ETA WITH PERISPOMENI [100] historic polytoniko orthography
U+1FC7 Greek GREEK SMALL LETTER ETA WITH PERISPOMENI AND YPOGEGRAMMENI     NOT IDNA2008
U+1FC8 Greek GREEK CAPITAL LETTER EPSILON WITH VARIA   GreekUppercase Not IDNA2008
U+1FCA Greek GREEK CAPITAL LETTER ETA WITH VARIA   GreekUppercase Not IDNA2008
U+1FCC Greek GREEK CAPITAL LETTER ETA WITH PROSGEGRAMMENI     NOT IDNA2008
U+1FD0 Greek GREEK SMALL LETTER IOTA WITH VRACHY [100] historic polytoniko orthography
U+1FD1 Greek GREEK SMALL LETTER IOTA WITH MACRON [100] historic polytoniko orthography
U+1FD2 Greek GREEK SMALL LETTER IOTA WITH DIALYTIKA AND VARIA [100] historic polytoniko orthography
U+1FD6 Greek GREEK SMALL LETTER IOTA WITH PERISPOMENI [100] historic polytoniko orthography
U+1FD7 Greek GREEK SMALL LETTER IOTA WITH DIALYTIKA AND PERISPOMENI [100] historic polytoniko orthography
U+1FD8 Greek GREEK CAPITAL LETTER IOTA WITH VRACHY   GreekUppercase Not IDNA2008
U+1FD9 Greek GREEK CAPITAL LETTER IOTA WITH MACRON   GreekUppercase Not IDNA2008
U+1FDA Greek GREEK CAPITAL LETTER IOTA WITH VARIA   GreekUppercase Not IDNA2008
U+1FDB Greek GREEK CAPITAL LETTER IOTA WITH OXIA   GreekUppercase Not IDNA2008
U+1FE0 Greek GREEK SMALL LETTER UPSILON WITH VRACHY [100] historic polytoniko orthography
U+1FE1 Greek GREEK SMALL LETTER UPSILON WITH MACRON [100] historic polytoniko orthography
U+1FE2 Greek GREEK SMALL LETTER UPSILON WITH DIALYTIKA AND VARIA [100] historic polytoniko orthography
U+1FE4 Greek GREEK SMALL LETTER RHO WITH PSILI [100] historic polytoniko orthography
U+1FE5 Greek GREEK SMALL LETTER RHO WITH DASIA [100] historic polytoniko orthography
U+1FE6 Greek GREEK SMALL LETTER UPSILON WITH PERISPOMENI [100] historic polytoniko orthography
U+1FE7 Greek GREEK SMALL LETTER UPSILON WITH DIALYTIKA AND PERISPOMENI [100] historic polytoniko orthography
U+1FE8 Greek GREEK CAPITAL LETTER UPSILON WITH VRACHY   GreekUppercase Not IDNA2008
U+1FE9 Greek GREEK CAPITAL LETTER UPSILON WITH MACRON   GreekUppercase Not IDNA2008
U+1FEA Greek GREEK CAPITAL LETTER UPSILON WITH VARIA   GreekUppercase Not IDNA2008
U+1FEB Greek GREEK CAPITAL LETTER UPSILON WITH OXIA   GreekUppercase Not IDNA2008
U+1FF2 Greek GREEK SMALL LETTER OMEGA WITH VARIA AND YPOGEGRAMMENI     NOT IDNA2008
U+1FF3 Greek GREEK SMALL LETTER OMEGA WITH YPOGEGRAMMENI     NOT IDNA2008
U+1FF4 Greek GREEK SMALL LETTER OMEGA WITH OXIA AND YPOGEGRAMMENI     NOT IDNA2008
U+1FF6 Greek GREEK SMALL LETTER OMEGA WITH PERISPOMENI [100] historic polytoniko orthography
U+1FF7 Greek GREEK SMALL LETTER OMEGA WITH PERISPOMENI AND YPOGEGRAMMENI     NOT IDNA2008
U+1FF8 Greek GREEK CAPITAL LETTER OMICRON WITH VARIA   GreekUppercase Not IDNA2008
U+1FFA Greek GREEK CAPITAL LETTER OMEGA WITH VARIA   GreekUppercase Not IDNA2008
U+1FFC Greek GREEK CAPITAL LETTER OMEGA WITH PROSGEGRAMMENI     NOT IDNA2008
U+2010 Common HYPHEN   Inclusion NOT IDNA2008
U+2019 Common RIGHT SINGLE QUOTATION MARK   Inclusion NOT IDNA2008
U+2027 Common HYPHENATION POINT   Inclusion NOT IDNA2008
U+2D27 Georgian GEORGIAN SMALL LETTER YN [100] religious_use Khutsuri
U+2D2D Georgian GEORGIAN SMALL LETTER AEN [100] religious_use Khutsuri
U+2DA0 Ethiopic ETHIOPIC SYLLABLE SSA [100] uncommon_use Sebatbeit
U+2DA1 Ethiopic ETHIOPIC SYLLABLE SSU [100] uncommon_use Sebatbeit
U+2DA2 Ethiopic ETHIOPIC SYLLABLE SSI [100] uncommon_use Sebatbeit
U+2DA3 Ethiopic ETHIOPIC SYLLABLE SSAA [100] uncommon_use Sebatbeit
U+2DA4 Ethiopic ETHIOPIC SYLLABLE SSEE [100] uncommon_use Sebatbeit
U+2DA5 Ethiopic ETHIOPIC SYLLABLE SSE [100] uncommon_use Sebatbeit
U+2DA6 Ethiopic ETHIOPIC SYLLABLE SSO [100] uncommon_use Sebatbeit
U+2DA8 Ethiopic ETHIOPIC SYLLABLE CCA [100] uncommon_use Sebatbeit
U+2DA9 Ethiopic ETHIOPIC SYLLABLE CCU [100] uncommon_use Sebatbeit
U+2DAA Ethiopic ETHIOPIC SYLLABLE CCI [100] uncommon_use Sebatbeit
U+2DAB Ethiopic ETHIOPIC SYLLABLE CCAA [100] uncommon_use Sebatbeit
U+2DAC Ethiopic ETHIOPIC SYLLABLE CCEE [100] uncommon_use Sebatbeit
U+2DAD Ethiopic ETHIOPIC SYLLABLE CCE [100] uncommon_use Sebatbeit
U+2DAE Ethiopic ETHIOPIC SYLLABLE CCO [100] uncommon_use Sebatbeit
U+2DB0 Ethiopic ETHIOPIC SYLLABLE ZZA [100] uncommon_use Sebatbeit
U+2DB1 Ethiopic ETHIOPIC SYLLABLE ZZU [100] uncommon_use Sebatbeit
U+2DB2 Ethiopic ETHIOPIC SYLLABLE ZZI [100] uncommon_use Sebatbeit
U+2DB3 Ethiopic ETHIOPIC SYLLABLE ZZAA [100] uncommon_use Sebatbeit
U+2DB4 Ethiopic ETHIOPIC SYLLABLE ZZEE [100] uncommon_use Sebatbeit
U+2DB5 Ethiopic ETHIOPIC SYLLABLE ZZE [100] uncommon_use Sebatbeit
U+2DB6 Ethiopic ETHIOPIC SYLLABLE ZZO [100] uncommon_use Sebatbeit
U+2DB8 Ethiopic ETHIOPIC SYLLABLE CCHA [100] uncommon_use Sebatbeit
U+2DB9 Ethiopic ETHIOPIC SYLLABLE CCHU [100] uncommon_use Sebatbeit
U+2DBA Ethiopic ETHIOPIC SYLLABLE CCHI [100] uncommon_use Sebatbeit
U+2DBB Ethiopic ETHIOPIC SYLLABLE CCHAA [100] uncommon_use Sebatbeit
U+2DBC Ethiopic ETHIOPIC SYLLABLE CCHEE [100] uncommon_use Sebatbeit
U+2DBD Ethiopic ETHIOPIC SYLLABLE CCHE [100] uncommon_use Sebatbeit
U+2DBE Ethiopic ETHIOPIC SYLLABLE CCHO [100] uncommon_use Sebatbeit
U+2DC0 Ethiopic ETHIOPIC SYLLABLE QYA [100] uncommon_use Sebatbeit
U+2DC1 Ethiopic ETHIOPIC SYLLABLE QYU [100] uncommon_use Sebatbeit
U+2DC2 Ethiopic ETHIOPIC SYLLABLE QYI [100] uncommon_use Sebatbeit
U+2DC3 Ethiopic ETHIOPIC SYLLABLE QYAA [100] uncommon_use Sebatbeit
U+2DC4 Ethiopic ETHIOPIC SYLLABLE QYEE [100] uncommon_use Sebatbeit
U+2DC5 Ethiopic ETHIOPIC SYLLABLE QYE [100] uncommon_use Sebatbeit
U+2DC6 Ethiopic ETHIOPIC SYLLABLE QYO [100] uncommon_use Sebatbeit
U+2DC8 Ethiopic ETHIOPIC SYLLABLE KYA [100] uncommon_use Sebatbeit
U+2DC9 Ethiopic ETHIOPIC SYLLABLE KYU [100] uncommon_use Sebatbeit
U+2DCA Ethiopic ETHIOPIC SYLLABLE KYI [100] uncommon_use Sebatbeit
U+2DCB Ethiopic ETHIOPIC SYLLABLE KYAA [100] uncommon_use Sebatbeit
U+2DCC Ethiopic ETHIOPIC SYLLABLE KYEE [100] uncommon_use Sebatbeit
U+2DCD Ethiopic ETHIOPIC SYLLABLE KYE [100] uncommon_use Sebatbeit
U+2DCE Ethiopic ETHIOPIC SYLLABLE KYO [100] uncommon_use Sebatbeit
U+2DD0 Ethiopic ETHIOPIC SYLLABLE XYA [100] uncommon_use Sebatbeit
U+2DD1 Ethiopic ETHIOPIC SYLLABLE XYU [100] uncommon_use Sebatbeit
U+2DD2 Ethiopic ETHIOPIC SYLLABLE XYI [100] uncommon_use Sebatbeit
U+2DD3 Ethiopic ETHIOPIC SYLLABLE XYAA [100] uncommon_use Sebatbeit
U+2DD4 Ethiopic ETHIOPIC SYLLABLE XYEE [100] uncommon_use Sebatbeit
U+2DD5 Ethiopic ETHIOPIC SYLLABLE XYE [100] uncommon_use Sebatbeit
U+2DD6 Ethiopic ETHIOPIC SYLLABLE XYO [100] uncommon_use Sebatbeit
U+2DD8 Ethiopic ETHIOPIC SYLLABLE GYA [100] uncommon_use Sebatbeit
U+2DD9 Ethiopic ETHIOPIC SYLLABLE GYU [100] uncommon_use Sebatbeit
U+2DDA Ethiopic ETHIOPIC SYLLABLE GYI [100] uncommon_use Sebatbeit
U+2DDB Ethiopic ETHIOPIC SYLLABLE GYAA [100] uncommon_use Sebatbeit
U+2DDC Ethiopic ETHIOPIC SYLLABLE GYEE [100] uncommon_use Sebatbeit
U+2DDD Ethiopic ETHIOPIC SYLLABLE GYE [100] uncommon_use Sebatbeit
U+2DDE Ethiopic ETHIOPIC SYLLABLE GYO [100] uncommon_use Sebatbeit
U+3099  ゙ Inherited COMBINING KATAKANA-HIRAGANA VOICED SOUND MARK [100] punctuation  
U+309A  ゚ Inherited COMBINING KATAKANA-HIRAGANA SEMI-VOICED SOUND MARK [100] punctuation  
U+3105 Bopomofo BOPOMOFO LETTER B [100] limited_use educational only
U+3106 Bopomofo BOPOMOFO LETTER P [100] limited_use educational only
U+3107 Bopomofo BOPOMOFO LETTER M [100] limited_use educational only
U+3108 Bopomofo BOPOMOFO LETTER F [100] limited_use educational only
U+3109 Bopomofo BOPOMOFO LETTER D [100] limited_use educational only
U+310A Bopomofo BOPOMOFO LETTER T [100] limited_use educational only
U+310B Bopomofo BOPOMOFO LETTER N [100] limited_use educational only
U+310C Bopomofo BOPOMOFO LETTER L [100] limited_use educational only
U+310D Bopomofo BOPOMOFO LETTER G [100] limited_use educational only
U+310E Bopomofo BOPOMOFO LETTER K [100] limited_use educational only
U+310F Bopomofo BOPOMOFO LETTER H [100] limited_use educational only
U+3110 Bopomofo BOPOMOFO LETTER J [100] limited_use educational only
U+3111 Bopomofo BOPOMOFO LETTER Q [100] limited_use educational only
U+3112 Bopomofo BOPOMOFO LETTER X [100] limited_use educational only
U+3113 Bopomofo BOPOMOFO LETTER ZH [100] limited_use educational only
U+3114 Bopomofo BOPOMOFO LETTER CH [100] limited_use educational only
U+3115 Bopomofo BOPOMOFO LETTER SH [100] limited_use educational only
U+3116 Bopomofo BOPOMOFO LETTER R [100] limited_use educational only
U+3117 Bopomofo BOPOMOFO LETTER Z [100] limited_use educational only
U+3118 Bopomofo BOPOMOFO LETTER C [100] limited_use educational only
U+3119 Bopomofo BOPOMOFO LETTER S [100] limited_use educational only
U+311A Bopomofo BOPOMOFO LETTER A [100] limited_use educational only
U+311B Bopomofo BOPOMOFO LETTER O [100] limited_use educational only
U+311C Bopomofo BOPOMOFO LETTER E [100] limited_use educational only
U+311D Bopomofo BOPOMOFO LETTER EH [100] limited_use educational only
U+311E Bopomofo BOPOMOFO LETTER AI [100] limited_use educational only
U+311F Bopomofo BOPOMOFO LETTER EI [100] limited_use educational only
U+3120 Bopomofo BOPOMOFO LETTER AU [100] limited_use educational only
U+3121 Bopomofo BOPOMOFO LETTER OU [100] limited_use educational only
U+3122 Bopomofo BOPOMOFO LETTER AN [100] limited_use educational only
U+3123 Bopomofo BOPOMOFO LETTER EN [100] limited_use educational only
U+3124 Bopomofo BOPOMOFO LETTER ANG [100] limited_use educational only
U+3125 Bopomofo BOPOMOFO LETTER ENG [100] limited_use educational only
U+3126 Bopomofo BOPOMOFO LETTER ER [100] limited_use educational only
U+3127 Bopomofo BOPOMOFO LETTER I [100] limited_use educational only
U+3128 Bopomofo BOPOMOFO LETTER U [100] limited_use educational only
U+3129 Bopomofo BOPOMOFO LETTER IU [100] limited_use educational only
U+312A Bopomofo BOPOMOFO LETTER V [100] limited_use educational only
U+312B Bopomofo BOPOMOFO LETTER NG [100] limited_use educational only
U+312C Bopomofo BOPOMOFO LETTER GN [100] limited_use educational only
U+312D Bopomofo BOPOMOFO LETTER IH [100] limited_use educational only
U+312F Bopomofo BOPOMOFO LETTER NN [100] limited_use educational only
U+31A0 Bopomofo BOPOMOFO LETTER BU [100] limited_use educational only
U+31A1 Bopomofo BOPOMOFO LETTER ZI [100] limited_use educational only
U+31A2 Bopomofo BOPOMOFO LETTER JI [100] limited_use educational only
U+31A3 Bopomofo BOPOMOFO LETTER GU [100] limited_use educational only
U+31A4 Bopomofo BOPOMOFO LETTER EE [100] limited_use educational only
U+31A5 Bopomofo BOPOMOFO LETTER ENN [100] limited_use educational only
U+31A6 Bopomofo BOPOMOFO LETTER OO [100] limited_use educational only
U+31A7 Bopomofo BOPOMOFO LETTER ONN [100] limited_use educational only
U+31A8 Bopomofo BOPOMOFO LETTER IR [100] limited_use educational only
U+31A9 Bopomofo BOPOMOFO LETTER ANN [100] limited_use educational only
U+31AA Bopomofo BOPOMOFO LETTER INN [100] limited_use educational only
U+31AB Bopomofo BOPOMOFO LETTER UNN [100] limited_use educational only
U+31AC Bopomofo BOPOMOFO LETTER IM [100] limited_use educational only
U+31AD Bopomofo BOPOMOFO LETTER NGG [100] limited_use educational only
U+31AE Bopomofo BOPOMOFO LETTER AINN [100] limited_use educational only
U+31AF Bopomofo BOPOMOFO LETTER AUNN [100] limited_use educational only
U+31B0 Bopomofo BOPOMOFO LETTER AM [100] limited_use educational only
U+31B1 Bopomofo BOPOMOFO LETTER OM [100] limited_use educational only
U+31B2 Bopomofo BOPOMOFO LETTER ONG [100] limited_use educational only
U+31B3 Bopomofo BOPOMOFO LETTER INNN [100] limited_use educational only
U+31B4 Bopomofo BOPOMOFO FINAL LETTER P [100] limited_use educational only
U+31B5 Bopomofo BOPOMOFO FINAL LETTER T [100] limited_use educational only
U+31B6 Bopomofo BOPOMOFO FINAL LETTER K [100] limited_use educational only
U+31B7 Bopomofo BOPOMOFO FINAL LETTER H [100] limited_use educational only
U+31B8 Bopomofo BOPOMOFO LETTER GH [100] limited_use educational only
U+31B9 Bopomofo BOPOMOFO LETTER LH [100] limited_use educational only
U+31BA Bopomofo BOPOMOFO LETTER ZY [100] limited_use educational only
U+A67F Cyrillic CYRILLIC PAYEROK [100] obsolete  
U+A717 Common MODIFIER LETTER DOT VERTICAL BAR [100] modifier no modifiers
U+A718 Common MODIFIER LETTER DOT SLASH [100] modifier no modifiers
U+A719 Common MODIFIER LETTER DOT HORIZONTAL BAR [100] modifier no modifiers
U+A71A Common MODIFIER LETTER LOWER RIGHT CORNER ANGLE [100] modifier no modifiers
U+A71B Common MODIFIER LETTER RAISED UP ARROW [100] modifier no modifiers
U+A71C Common MODIFIER LETTER RAISED DOWN ARROW [100] modifier no modifiers
U+A71D Common MODIFIER LETTER RAISED EXCLAMATION MARK [100] modifier no modifiers
U+A71E Common MODIFIER LETTER RAISED INVERTED EXCLAMATION MARK [100] modifier no modifiers
U+A71F Common MODIFIER LETTER LOW INVERTED EXCLAMATION MARK [100] modifier no modifiers
U+A788 Common MODIFIER LETTER LOW CIRCUMFLEX ACCENT [100] technical  
U+A792 Latin LATIN CAPITAL LETTER C WITH BAR   Uppercase Not IDNA2008
U+A793 Latin LATIN SMALL LETTER C WITH BAR [100] limited_use Nanai
U+A7C0 Latin LATIN CAPITAL LETTER OLD POLISH O   Uppercase Not yet analyzed for MSR
U+A7C1 Latin LATIN SMALL LETTER OLD POLISH O   tentatively-obsolete Old Polish
U+A7C2 Latin LATIN CAPITAL LETTER ANGLICANA W   Uppercase Not IDNA2008
U+A7C3 Latin LATIN SMALL LETTER ANGLICANA W [100] uncommon_use (medieval English/Cornish) (12.0 17/238)
U+A7C4 Latin LATIN CAPITAL LETTER C WITH PALATAL HOOK   Uppercase Not IDNA2008
U+A7C5 Latin LATIN CAPITAL LETTER S WITH HOOK   Uppercase Not IDNA2008
U+A7C6 Latin LATIN CAPITAL LETTER Z WITH PALATAL HOOK   Uppercase Not IDNA2008
U+A7C7 Latin LATIN CAPITAL LETTER D WITH SHORT STROKE OVERLAY     Not yet analyzed for MSR
U+A7C8 Latin LATIN SMALL LETTER D WITH SHORT STROKE OVERLAY   tentatively-obsolete Not yet analyzed for MSR
U+A7C9 Latin LATIN CAPITAL LETTER S WITH SHORT STROKE OVERLAY   Uppercase Not yet analyzed for MSR
U+A7CA Latin LATIN SMALL LETTER S WITH SHORT STROKE OVERLAY   tentatively-obsolete Not yet analyzed for MSR
U+A7D0 Latin LATIN CAPITAL LETTER CLOSED INSULAR G   Uppercase Not IDNA2008
U+A7D1 Latin LATIN SMALL LETTER CLOSED INSULAR G   tentatively-obsolete Ormulum
U+A7D3 Latin LATIN SMALL LETTER DOUBLE THORN   tentatively-obsolete Ormulum
U+A7D5 Latin LATIN SMALL LETTER DOUBLE WYNN   tentatively-obsolete Ormulum
U+A7D6 Latin LATIN CAPITAL LETTER MIDDLE SCOTS S   Uppercase Not IDNA2008
U+A7D7 Latin LATIN SMALL LETTER MIDDLE SCOTS S   tentatively-obsolete Middle Scots
U+A7D8 Latin LATIN CAPITAL LETTER SIGMOID S   Uppercase Not IDNA2008
U+A7D9 Latin LATIN SMALL LETTER SIGMOID S   tentatively-obsolete Middle Cornish, English, Scots
U+A9E7 Myanmar MYANMAR LETTER TAI LAING NYA [100] uncommon_use (Tai Laing) (7.0 11/130R)
U+A9E8 Myanmar MYANMAR LETTER TAI LAING FA [100] uncommon_use (Tai Laing) (7.0 11/130R)
U+A9E9 Myanmar MYANMAR LETTER TAI LAING GA [100] uncommon_use (Tai Laing) (7.0 11/130R)
U+A9EA Myanmar MYANMAR LETTER TAI LAING GHA [100] uncommon_use (Tai Laing) (7.0 11/130R)
U+A9EB Myanmar MYANMAR LETTER TAI LAING JA [100] uncommon_use (Tai Laing) (7.0 11/130R)
U+A9EC Myanmar MYANMAR LETTER TAI LAING JHA [100] uncommon_use (Tai Laing) (7.0 11/130R)
U+A9ED Myanmar MYANMAR LETTER TAI LAING DDA [100] uncommon_use (Tai Laing) (7.0 11/130R)
U+A9EE Myanmar MYANMAR LETTER TAI LAING DDHA [100] uncommon_use (Tai Laing) (7.0 11/130R)
U+A9EF Myanmar MYANMAR LETTER TAI LAING NNA [100] uncommon_use (Tai Laing) (7.0 11/130R)
U+A9FA Myanmar MYANMAR LETTER TAI LAING LLA [100] uncommon_use (Tai Laing) (7.0 11/130R)
U+A9FB Myanmar MYANMAR LETTER TAI LAING DA [100] uncommon_use (Tai Laing) (7.0 11/130R)
U+A9FC Myanmar MYANMAR LETTER TAI LAING DHA [100] uncommon_use (Tai Laing) (7.0 11/130R)
U+A9FD Myanmar MYANMAR LETTER TAI LAING BA [100] uncommon_use (Tai Laing) (7.0 11/130R)
U+A9FE Myanmar MYANMAR LETTER TAI LAING BHA [100] uncommon_use (Tai Laing) (7.0 11/130R)
U+AA60 Myanmar MYANMAR LETTER KHAMTI GA [100] uncommon_use Khamti Shan
U+AA61 Myanmar MYANMAR LETTER KHAMTI CA [100] uncommon_use Khamti Shan
U+AA62 Myanmar MYANMAR LETTER KHAMTI CHA [100] uncommon_use Khamti Shan
U+AA63 Myanmar MYANMAR LETTER KHAMTI JA [100] uncommon_use Khamti Shan
U+AA64 Myanmar MYANMAR LETTER KHAMTI JHA [100] uncommon_use Khamti Shan
U+AA65 Myanmar MYANMAR LETTER KHAMTI NYA [100] uncommon_use Khamti Shan
U+AA66 Myanmar MYANMAR LETTER KHAMTI TTA [100] uncommon_use Khamti Shan
U+AA67 Myanmar MYANMAR LETTER KHAMTI TTHA [100] uncommon_use Khamti Shan
U+AA68 Myanmar MYANMAR LETTER KHAMTI DDA [100] uncommon_use Khamti Shan
U+AA69 Myanmar MYANMAR LETTER KHAMTI DDHA [100] uncommon_use Khamti Shan
U+AA6A Myanmar MYANMAR LETTER KHAMTI DHA [100] uncommon_use Khamti Shan
U+AA6B Myanmar MYANMAR LETTER KHAMTI NA [100] uncommon_use Khamti Shan
U+AA6C Myanmar MYANMAR LETTER KHAMTI SA [100] uncommon_use Khamti Shan
U+AA6D Myanmar MYANMAR LETTER KHAMTI HA [100] uncommon_use Khamti Shan
U+AA6E Myanmar MYANMAR LETTER KHAMTI LLA [100] uncommon_use Khamti Shan
U+AA6F Myanmar MYANMAR LETTER KHAMTI FA [100] uncommon_use Khamti Shan
U+AA70 Myanmar MYANMAR MODIFIER LETTER KHAMTI REDUPLICATION [100] uncommon_use Khamti Shan
U+AA71 Myanmar MYANMAR LETTER KHAMTI XA [100] uncommon_use Khamti Shan
U+AA72 Myanmar MYANMAR LETTER KHAMTI ZA [100] uncommon_use Khamti Shan
U+AA73 Myanmar MYANMAR LETTER KHAMTI RA [100] uncommon_use Khamti Shan
U+AA74 Myanmar MYANMAR LOGOGRAM KHAMTI OAY [100] uncommon_use Khamti Shan
U+AA75 Myanmar MYANMAR LOGOGRAM KHAMTI QN [100] uncommon_use Khamti Shan
U+AA76 Myanmar MYANMAR LOGOGRAM KHAMTI HM [100] uncommon_use Khamti Shan
U+AA7A Myanmar MYANMAR LETTER AITON RA [100] uncommon_use Aiton
U+AA7C  ꩼ Myanmar MYANMAR SIGN TAI LAING TONE-2 [100] uncommon_use (Tai Laing) (7.0 11/130R)
U+AA7D  ꩽ Myanmar MYANMAR SIGN TAI LAING TONE-5 [100] uncommon_use (Tai Laing) (7.0 11/130R)
U+AA7E Myanmar MYANMAR LETTER SHWE PALAUNG CHA [100] uncommon_use (Shwe Palaung) (7.0 11/130R)
U+AA7F ꩿ Myanmar MYANMAR LETTER SHWE PALAUNG SHA [100] uncommon_use (Shwe Palaung) (7.0 11/130R)
U+AB11 Ethiopic ETHIOPIC SYLLABLE DZU [100] uncommon_use Gamo-Gofa-Dawro
U+AB12 Ethiopic ETHIOPIC SYLLABLE DZI [100] uncommon_use Gamo-Gofa-Dawro
U+AB13 Ethiopic ETHIOPIC SYLLABLE DZAA [100] uncommon_use Gamo-Gofa-Dawro
U+AB14 Ethiopic ETHIOPIC SYLLABLE DZEE [100] uncommon_use Gamo-Gofa-Dawro
U+AB15 Ethiopic ETHIOPIC SYLLABLE DZE [100] uncommon_use Gamo-Gofa-Dawro
U+AB16 Ethiopic ETHIOPIC SYLLABLE DZO [100] uncommon_use Gamo-Gofa-Dawro
U+AB20 Ethiopic ETHIOPIC SYLLABLE CCHHA [100] uncommon_use Gumuz
U+AB21 Ethiopic ETHIOPIC SYLLABLE CCHHU [100] uncommon_use Gumuz
U+AB22 Ethiopic ETHIOPIC SYLLABLE CCHHI [100] uncommon_use Gumuz
U+AB23 Ethiopic ETHIOPIC SYLLABLE CCHHAA [100] uncommon_use Gumuz
U+AB24 Ethiopic ETHIOPIC SYLLABLE CCHHEE [100] uncommon_use Gumuz
U+AB25 Ethiopic ETHIOPIC SYLLABLE CCHHE [100] uncommon_use Gumuz
U+AB26 Ethiopic ETHIOPIC SYLLABLE CCHHO [100] uncommon_use Gumuz
U+AB28 Ethiopic ETHIOPIC SYLLABLE BBA [100] uncommon_use Gumuz
U+AB29 Ethiopic ETHIOPIC SYLLABLE BBU [100] uncommon_use Gumuz
U+AB2A Ethiopic ETHIOPIC SYLLABLE BBI [100] uncommon_use Gumuz
U+AB2B Ethiopic ETHIOPIC SYLLABLE BBAA [100] uncommon_use Gumuz
U+AB2C Ethiopic ETHIOPIC SYLLABLE BBEE [100] uncommon_use Gumuz
U+AB2D Ethiopic ETHIOPIC SYLLABLE BBE [100] uncommon_use Gumuz
U+AB2E Ethiopic ETHIOPIC SYLLABLE BBO [100] uncommon_use Gumuz
U+AB66 Latin LATIN SMALL LETTER DZ DIGRAPH WITH RETROFLEX HOOK [100] uncommon_use (Sinology) (12.0 17/299 17/367)
U+AB67 Latin LATIN SMALL LETTER TS DIGRAPH WITH RETROFLEX HOOK [100] uncommon_use (Sinology) (12.0 17/299 17/367)
U+1133B 𑌻 Inherited COMBINING BINDU BELOW     excluded
U+1B11F 𛄟 Hiragana HIRAGANA LETTER ARCHAIC WU   tentatively-obsolete  
U+1B120 𛄠 Katakana KATAKANA LETTER ARCHAIC YI   tentatively-obsolete  
U+1B121 𛄡 Katakana KATAKANA LETTER ARCHAIC YE   tentatively-obsolete  
U+1B122 𛄢 Katakana KATAKANA LETTER ARCHAIC WU   tentatively-obsolete  
U+1B132 𛄲 Hiragana HIRAGANA LETTER SMALL KO   tentatively-obsolete  
U+1B150 𛅐 Hiragana HIRAGANA LETTER SMALL WI [100] obsolete 12.0 16/354 16/385R
U+1B151 𛅑 Hiragana HIRAGANA LETTER SMALL WE [100] obsolete 12.0 16/354 16/385R
U+1B152 𛅒 Hiragana HIRAGANA LETTER SMALL WO [100] obsolete 12.0 16/354 16/385R
U+1B155 𛅕 Katakana KATAKANA LETTER SMALL KO   tentatively-obsolete  
U+1B164 𛅤 Katakana KATAKANA LETTER SMALL WI [100] obsolete 12.0 16/354 16/385R
U+1B165 𛅥 Katakana KATAKANA LETTER SMALL WE [100] obsolete 12.0 16/354 16/385R
U+1B166 𛅦 Katakana KATAKANA LETTER SMALL WO [100] obsolete 12.0 16/354 16/385R
U+1B167 𛅧 Katakana KATAKANA LETTER SMALL N [100] obsolete 12.0 16/354 16/385R
U+1DF00 𝼀 Latin LATIN SMALL LETTER FENG DIGRAPH WITH TRILL   tentatively-technical IPA
U+1DF01 𝼁 Latin LATIN SMALL LETTER REVERSED SCRIPT G   tentatively-technical IPA
U+1DF02 𝼂 Latin LATIN LETTER SMALL CAPITAL TURNED G   Uppercase IPA
U+1DF03 𝼃 Latin LATIN SMALL LETTER REVERSED K   tentatively-technical IPA
U+1DF04 𝼄 Latin LATIN LETTER SMALL CAPITAL L WITH BELT   Uppercase IPA
U+1DF05 𝼅 Latin LATIN SMALL LETTER LEZH WITH RETROFLEX HOOK   tentatively-technical IPA
U+1DF06 𝼆 Latin LATIN SMALL LETTER TURNED Y WITH BELT   tentatively-technical IPA
U+1DF07 𝼇 Latin LATIN SMALL LETTER REVERSED ENG   tentatively-technical IPA
U+1DF08 𝼈 Latin LATIN SMALL LETTER TURNED R WITH LONG LEG AND RETROFLEX HOOK   tentatively-technical IPA
U+1DF09 𝼉 Latin LATIN SMALL LETTER T WITH HOOK AND RETROFLEX HOOK   tentatively-technical IPA
U+1DF0A 𝼊 Latin LATIN LETTER RETROFLEX CLICK WITH RETROFLEX HOOK   tentatively-technical IPA
U+1DF0B 𝼋 Latin LATIN SMALL LETTER ESH WITH DOUBLE BAR   tentatively-technical IPA
U+1DF0C 𝼌 Latin LATIN SMALL LETTER ESH WITH DOUBLE BAR AND CURL   tentatively-technical IPA
U+1DF0D 𝼍 Latin LATIN SMALL LETTER TURNED T WITH CURL   tentatively-technical IPA
U+1DF0E 𝼎 Latin LATIN LETTER INVERTED GLOTTAL STOP WITH CURL   tentatively-technical IPA
U+1DF0F 𝼏 Latin LATIN LETTER STRETCHED C WITH CURL   tentatively-technical IPA
U+1DF10 𝼐 Latin LATIN LETTER SMALL CAPITAL TURNED K   Uppercase IPA
U+1DF11 𝼑 Latin LATIN SMALL LETTER L WITH FISHHOOK   tentatively-technical IPA
U+1DF12 𝼒 Latin LATIN SMALL LETTER DEZH DIGRAPH WITH PALATAL HOOK   tentatively-technical IPA
U+1DF13 𝼓 Latin LATIN SMALL LETTER L WITH BELT AND PALATAL HOOK   tentatively-technical IPA
U+1DF14 𝼔 Latin LATIN SMALL LETTER ENG WITH PALATAL HOOK   tentatively-technical IPA
U+1DF15 𝼕 Latin LATIN SMALL LETTER TURNED R WITH PALATAL HOOK   tentatively-technical IPA
U+1DF16 𝼖 Latin LATIN SMALL LETTER R WITH FISHHOOK AND PALATAL HOOK   tentatively-technical IPA
U+1DF17 𝼗 Latin LATIN SMALL LETTER TESH DIGRAPH WITH PALATAL HOOK   tentatively-technical IPA
U+1DF18 𝼘 Latin LATIN SMALL LETTER EZH WITH PALATAL HOOK   tentatively-technical IPA
U+1DF19 𝼙 Latin LATIN SMALL LETTER DEZH DIGRAPH WITH RETROFLEX HOOK   tentatively-technical IPA
U+1DF1A 𝼚 Latin LATIN SMALL LETTER I WITH STROKE AND RETROFLEX HOOK   tentatively-technical IPA
U+1DF1B 𝼛 Latin LATIN SMALL LETTER O WITH RETROFLEX HOOK   tentatively-technical IPA
U+1DF1C 𝼜 Latin LATIN SMALL LETTER TESH DIGRAPH WITH RETROFLEX HOOK   tentatively-technical IPA
U+1DF1D 𝼝 Latin LATIN SMALL LETTER C WITH RETROFLEX HOOK   tentatively-technical IPA
U+1DF1E 𝼞 Latin LATIN SMALL LETTER S WITH CURL   tentatively-technical IPA
U+1DF25 𝼥 Latin LATIN SMALL LETTER D WITH MID-HEIGHT LEFT HOOK   tentatively-technical Malayalam transliteration
U+1DF26 𝼦 Latin LATIN SMALL LETTER L WITH MID-HEIGHT LEFT HOOK   tentatively-technical Malayalam transliteration
U+1DF27 𝼧 Latin LATIN SMALL LETTER N WITH MID-HEIGHT LEFT HOOK   tentatively-technical Malayalam transliteration
U+1DF28 𝼨 Latin LATIN SMALL LETTER R WITH MID-HEIGHT LEFT HOOK   tentatively-technical Malayalam transliteration
U+1DF29 𝼩 Latin LATIN SMALL LETTER S WITH MID-HEIGHT LEFT HOOK   tentatively-technical Malayalam transliteration
U+1DF2A 𝼪 Latin LATIN SMALL LETTER T WITH MID-HEIGHT LEFT HOOK   tentatively-technical Malayalam transliteration
U+1E08F 𞂏 Cyrillic COMBINING CYRILLIC SMALL LETTER BYELORUSSIAN-UKRAINIAN I   tentatively-obsolete  

Legend

Code Point
A code point or code point sequence.
Glyph
The shape displayed depends on the fonts available to your browser.
Script
Shows the script property value from the Unicode Character Database. Combining marks may have the value Inherited and code points used with more than one script may have the value Common.
Name
Shows the character or sequence name from the Unicode Character Database.
Ref
Links to the references associated with the code point or sequence, if any.
Tags
LGR-defined tag values. Any tags matching the Unicode script property are suppressed in this view.
Comment
The comment as given in the XML file. However, if the comment for this row consists only of the code point or sequence name, it is suppressed in this view. By convention, comments starting with “=” denote an alias. If present, the symbol ⍟ marks a default item shared among a set of LGRs.

Variants

This LGR does not specify any variants.

Classes, Rules and Actions

Character Classes

Implicit (except script) 35

The following table lists all named and implicit classes with their definition and a list of their members intersected with the current repertoire (for larger classes, this list is elided).

Name Definition Count Members or Ranges Ref Comment
implicit Tag=context-other 3 {0375 05F3-05F4}   Any character tagged as context-other
implicit Tag=Default_Ignorable 396→0 {}   Any character tagged as Default_Ignorable
implicit Tag=Deprecated 15→0 {}   Any character tagged as Deprecated
implicit Tag=duplicate 2 {0F7B 0F7D}   Any character tagged as duplicate
implicit Tag=Exclusion 21647→0 {}   Any character tagged as Exclusion
implicit Tag=GreekUppercase 67 {03FD-03FF 1EF2 1F08-1F0F 1F18-1F1D 1F28-1F2F 1F38-1F3F 1F48-1F4D 1F59 1F5B 1F5D 1F5F 1F68-1F6F 1FB8-1FBA 1FC8 1FCA 1FD8-1FDB 1FE8-1FEB 1FF8 1FFA}   Any character tagged as GreekUppercase
implicit Tag=historic 74 {1F00-1F07 1F10-1F15 1F20-1F27 1F30-1F37 1F40-1F45 1F50-1F57 1F60-1F67 1F70 1F72 1F74 1F76 1F78 1F7A 1F7C 1FB6 1FC6 1FD0-1FD2 1FD6-1FD7 1FE0-1FE2 1FE4-1FE7 ...}   Any character tagged as historic
implicit Tag=Inclusion 13→12 {0027 002E 003A 0375 058A 05F3-05F4 06FD-06FE 2010 2019 2027}   Any character tagged as Inclusion
implicit Tag=limited_use 72→70 {3105-312D 312F 31A0-31BA A793}   Any character tagged as limited_use
implicit Tag=Limited_Use 5271→0 {}   Any character tagged as Limited_Use
implicit Tag=modifier 9 {A717-A71F}   Any character tagged as modifier
implicit Tag=MSR-non-Han 2494→0 {}   Any character tagged as MSR-non-Han
implicit Tag=MSR-pending 28→0 {}   Any character tagged as MSR-pending
implicit Tag=numeric 2 {0F3E-0F3F}   Any character tagged as numeric
implicit Tag=obsolete 118→95 {0138 0375 037B-037D 03FC 049D 04A7 04B9 0515 0517 0519 051B 051D 051F 0521 0523 0527 0529 052F 063B-063F 06AC 077E-077F 093D 0960-0963 0971 09BD 09E0-09E3 ...}   Any character tagged as obsolete
implicit Tag=Obsolete 1620→0 {}   Any character tagged as Obsolete
implicit Tag=phonetic 1→0 {}   The character tagged as phonetic
implicit Tag=poetic 12 {0201 0203 0205 0207 0209 020B 020D 020F 0211 0213 0215 0217}   Any character tagged as poetic
implicit Tag=punctuation 5 {02BB 06FD-06FE 3099-309A}   Any character tagged as punctuation
implicit Tag=RefLGR 2332→0 {}   Any character tagged as RefLGR
implicit Tag=RefLGRBySequence 13→0 {}   Any character tagged as RefLGRBySequence
implicit Tag=religious_use 8 {0653 0950 0A74 0AD0 0BD0 0F00 2D27 2D2D}   Any character tagged as religious_use
implicit Tag=symbol 10 {03FC 0950 0A74 0AD0 0BD0 0EAF 0F00 0F35 0F37 0FC6}   Any character tagged as symbol
implicit Tag=technical 47→46 {0201 0203 0205 0207 0209 020B 020D 020F 0211 0213 0215 0217 02BC 02EC 030F-0311 0313-0314 0324-0325 032D-032E 0330 0335 0338-0339 0342 0559 06E5-06E6 097D ...}   Any character tagged as technical
implicit Tag=Technical 1625→0 {}   Any character tagged as Technical
implicit Tag=tentatively-excluded 4 {05CD 0CDD 0CF3 0ECE}   Any character tagged as tentatively-excluded
implicit Tag=tentatively-historic 1 {08B5}   The character tagged as tentatively-historic
implicit Tag=tentatively-limited_use 41→12 {0889-088E 08C8 0C3C 0C5D 0CDD 0CF3 0ECE}   Any character tagged as tentatively-limited_use
implicit Tag=tentatively-obsolete 339→15 {A7C1 A7C8 A7CA A7D1 A7D3 A7D5 A7D7 A7D9 1B11F-1B122 1B132 1B155 1E08F}   Any character tagged as tentatively-obsolete
implicit Tag=tentatively-punctuation 2→0 {}   Any character tagged as tentatively-punctuation
implicit Tag=tentatively-religious 46→25 {0870-0888}   Any character tagged as tentatively-religious
implicit Tag=tentatively-technical 34 {1DF00-1DF01 1DF03 1DF05-1DF0F 1DF11-1DF1E 1DF25-1DF2A}   Any character tagged as tentatively-technical
implicit Tag=uncommon_use 242→233 {048B 048D 048F 04C4 04C6 04C8 04CA 04CE 04F7 04FB 04FD 04FF 0511 0513 05EF 08B2 08B6-08BA 09FE 0A01 0AFA-0AFF 0C01 0C04 0C80 0D00 0D54-0D56 0E86 0E89 0E8C ...}   Any character tagged as uncommon_use
implicit Tag=Uncommon_Use 346→0 {}   Any character tagged as Uncommon_Use
implicit Tag=Uppercase 61 {0200 0202 0204 0206 0208 020A 020C 020E 0210 0212 0214 0216 048A 048C 048E 049C 04A6 04B8 04C3 04C5 04C7 04C9 04CD 04F6 04FA 04FC 04FE 0510 0512 0514 0516 ...}   Any character tagged as Uppercase

Legend

Members or Ranges
Lists the members of the class as code points (xxx) or as ranges of code points (xxx-yyy). Any class too numerous to list in full is elided with "...".
m→n
Indicates a set for which only n of its m members fall inside the repertoire.
Tag=ttt
A named or implicit class defined by all code points that share the given tag value (ttt).
Implicit
An anonymous class implicitly defined based on tag value and for which there is no named equivalent.

Whole label evaluation and context rules

The LGR does not define any rules.

Actions

The LGR does not define any actions.

Table of References

The following lists the references cited for specific code points, variants, classes, rules or actions in this LGR.

[EGIDS] Lewis and Simons, EGIDS: Expanded Graded Intergenerational Disruption Scale,” documented in [SIL-Ethnologue] and summarized here:
https://en.wikipedia.org/wiki/Expanded_Graded_Intergenerational_Disruption_Scale_(EGIDS)
[MSR] ICANN, “Maximal Starting Repertoire”,
https://www.icann.org/resources/pages/msr-2015-06-21-en
[Proposal-Arabic] “Proposal for Arabic Script Root Zone LGR”,
https://www.icann.org/en/system/files/files/arabic-lgr-proposal-18nov15-en.pdf</reference>
[RefLGR] ICANN, “Second-Level Reference Label Generation Rules”,
https://www.icann.org/resources/pages/second-level-lgr-2015-06-21-en
[RefLGR-Overview] ICANN, “Reference Label Generation Rules (LGR) for the Second Level — Overview and Summary”,
https://www.icann.org/sites/default/files/packages/lgr/lgr-second-level-overview-summary-25oct24-en.pdf
[RZ-LGR] ICANN, “Root Zone Label Generation Rules”,
https://www.icann.org/resources/pages/root-zone-lgr-2015-06-21-en
[SIL-Ethnologue] David M. Eberhard, Gary F. Simons & Charles D. Fennig (eds.). 2021. Ethnologue: Languages of the World, Twenty fourth edition. Dallas, Texas: SIL International. Online version available as
https://www.ethnologue.com
[100] ICANN, Maximal Starting Repertoire - MSR-5:Annotated Repertoire Tables, Non-CJK, Date: 2021-06-24, available as
https://www.icann.org/en/system/files/files/msr-5-non-cjk-24jun21-en.pdf