Reclassify Identifier_Type for characters in restricted use | Selected-recommended-IdentifierType-not-in-MSR-and-RefLGR |
---|
This document is mechanically formatted from the above XML file for the LGR. It provides additional summary data and explanatory text. The XML file remains the sole normative specification of the LGR.
Date | 2025-02-05 |
---|---|
LGR Version | 16.0.0 |
Unicode Version | 16.0.0 |
Description
Partially updates
L2/19-329R
Characters excluded from both MSR and Reference LGR but allowed in UTS#39
This document has been submitted as a UTC document. For convenience in documenting the character list, it is presented using an LGR template format. A few minor details of the boilerplate in that template may not be applicable in this context and should be disregarded.
The collection includes 713 characters that are not part of the Maximal Starting Repertoire [MSR] because they were found to be in limited or otherwise restricted use and that are also excluded from Reference LGR [RefLGR]. In addition, the set includes the uppercase equivalents for 125 of them (Latin, Cyrillic and Greek), for a total of 769 characters. These characters are listed as Recommended or Inclusion in UTS#39 as of Unicode Version 16.0.0.
In review, some characters were found to be in active general use after all, at a level consistent with an Identifier_Type of Recommended. Such usage has been annotated, and a reference documenting that use has been cited. For those characters, no change to the Identifier_Type is proposed here.
Recommendation
Except for exceptions noted in review, these 769 characters should not be considered Recommended, based on an independent analysis [MSR] that has found indications that they should have been considered Uncommon_Use, Obsolete or Technical, or Exclusion or Inclusion. These indications were based on information available at the time of encoding, such as character proposal documents correlated with available information on the use of the writing systems these characters were intended for. Where they are already assigned Identifier_Type Inclusion, no change is recommended.
More specifically the proposal is to update the Identifier_Types as follows, based on the classification provided and as amended in review:
- Proposed:Technical (155) for members of the sets identified as “technical”, “poetic”, “religious_use”, “modifier” or “symbol” and “Uppercase” (where lowercase is “technical”)
- Proposed:Obsolete (336) for members of the sets identified as “obsolete”, “historic” or “polytoniko” and “Uppercase” (where lowercase is “obsolete”)
- Proposed:Uncommon_Use (281) for members of the sets identified as “uncommon_use”, “limited_use” and “Uppercase” (where lowercase is “uncommon_use”)
- Proposed:Exclusion (2) for members of the sets identified as “duplicate”
- Proposed:Inclusion (1) for a subset of set identified as “punctuation”
- Review_Needed (0) for tags of “numeric”, “context-other”, “punctuation” and no classification in [MSR] (this is a placeholder, as discussion proceeds characters will be moved out of this category into another.)
The following tags mark existing Identifier_Type values; if no proposed value is also present, the proposal is to not change the Identifier_Type:
- Inclusion (11) for characters currently assigned Inclusion; usually, because [MSR] classifies them as “punctuation” and Inclusion was reaffirmed in review.
- Recommended (2) for characters currently assigned Recommended; usually, because this was reaffirmed in review.
Review note: the proposal lists multiple Identifier_Type values in 13 cases where both Uncommon_Use and Obsolete apply, 3 where Uncommon_Use and Technical apply, and 3 cases where Technical and Obsolete apply.
The table in Section 2, “Repertoire” lists the characters with their proposed or existing Identifier_Type values in the “Tags” column, together with references to the source of their classification, as well as other information, such as any known languages.
Review Needed
A few characters have been given no classification in [MSR]. These are for the most part not in IDNA2008, or they represent punctuation characters with Identifier_Type Inclusion, which require no change. Some characters are classified as “context*” which makes them ineligible for the [MSR] and [RZ-LGR], but not the [RefLGR], but the decision of including them in the default identifiers should be reaffirmed in review.
The best matching Identifier_Type for characters tagged only as “punctuation” may need review; there's a strong bias against intra-word punctuation in the Reference LGRs and a complete prohibition in the DNS Root Zone (“letter principle”). Arguably, most intra-word punctuation characters not generic enough for use in IDNs might perhaps best be considered for optional Inclusion instead of Recommended, unless there's an indication that the use is uncommon or obsolete.
Characters that were only identified as “symbol” in [MSR] have been proposed here as Proposed:Technical as that seems the most likely Identifier_Type, unless additional evidence should suggest a different classification.
Background
There are over a thousand non-Han characters with Identifier_Type Recommended that are proposed for consideration of reclassification because they appear to fail reasonable criteria for being needed in identifiers. They come in two sets. For the set documented here, an independent analysis [MSR] has found indications that they should have been considered Uncommon_Use, Obsolete or Technical based on information available at the time of encoding, including their uppercase equivalents and any native digits that can be considered obsolete. The second set contains characters that were tentatively retained as Recommended in the [MSR] but upon further review by local expert teams from the [RZ-LGR] or [RefLGR] projects were found to not be needed after all for any language or minority language in reasonably widespread use. That set is discussed in another document.
The analysis on which the recommendations in this document are based was originally carried out for the purposes of defining the repertoire for IDN Top Level Domain names for the DNS Root Zone. There are some restrictions that are specific to the Root Zone, such as a prohibition on digits, so a follow-on effort determined how to relax these restrictions in a manner appropriate for the needs of Second-Level Domains. This resulted in the Second-Level Reference Label Generation Rules [RefLGR]. The characters listed here are those that were not included in either the [MSR] or the [RefLGR].
Because the original analysis [MSR] found specific indications why the characters in this document should not be recommended, they have been broken out separately and are listed in this document with the rationale for that exclusion provided. Notably, the review for purposes of the [RefLGR] did find that a few characters tentatively excluded in the [MSR] were indeed appropriate for domain names on the Second Level, while most were not.
The implication here is that any character not included in this set should be classified with some value other than Recommended for Unicode's default identifiers—until such time as independent evidence to the contrary is produced.
Arriving at a precise cutoff for the various Identifier_Type categories is difficult because there is no single source or perfect information on the use of writing systems and the de facto general-purpose subset thereof. In addition, the details of such use may change over time. For determining when to suggest “uncommon_use”, the [EGIDS] classification was used as a proxy for the likely level of modern use for the writing system(s) for which the character was encoded, but further adjustments were made in expert review. Where the status of a character was in doubt, for the purposes of [MSR], the decision was made to refrain from rejecting a character, but to defer the decision to expert review in the [RefLGR] project. Accordingly, this document suggests that the UTC should consider the published results of the cited research as one of the better sources of information available and only deviate from it on the basis of even better information.
The latest published result of the [MSR] project has a cutoff of Unicode Version 11.0. The current document includes characters added as Recommended between versions 12.0 and 16.0, for which a tentative analysis following the same criteria as in [MSR] is presented here. These should be treated as “tentatively analyzed” and have been identified accordingly. At this point there are no indications that any of these would be included in a future update of the [MSR] as allowed, and inclusion in either [RZ-LGR] or [RefLGR] in the near term also does not appear likely.
All decisions for the classification of characters in [MSR], or inclusion in [RZ-LGR] and [RefLGR] are documented and sourced on the character level; the same is not true for Unicode's classification, so it is not easily possible to verify any of the decisions that underlie the classification published in UTS39. By first making the alignment proposed here and then carefully documenting deviations, a positive side effect might be that the classification overall becomes more transparent and reviewable.
For further background on specific dispositions for characters in the DNS Root Zone [RZ-LGR] and Second-Level Reference LGR [RefLGR], see the cited references and links therein.
Discussion and Review
Domain names are an important and deliberately conservative set of identifiers. That said, there may be other classes of identifiers that don't require the same level of restrictions, so this proposal should not be understood to suggest that Default Identifiers must be restricted to only those characters that are being recommended for IDNs. Rather, the purpose is to bring the facts discovered during the development of the IDN repertoire for the DNS Root Zone and the [RefLGR] to the attention of the Unicode Technical Committee, so that characters that were classified Recommended can be given additional scrutiny before confirming their status.
Indic Scripts
In review, the following character has been identified that may well have documented use:
- U+093D ऽ DEVANAGARI SIGN AVAGRAHA - this sign is claimed to be in use in modern Hindi for writing foreign sounds in informal settings (cool: कूऽल) or stretched native ones (माँऽऽऽ!). See [Avagraha]. The suggested usage seems tenuous for the purposes of default identifiers and the shape, at least in Devanagari, is confusable with ASCII. Those factors argue against retaining this character as Recommended and it has been proposed as Obsolete.
Review note: We might benefit from an affirmative decision on whether this type of usage meets our cutoff for default identifier, and if not, whether to assign it Obsolete, as proposed here, or Uncommon_Use or Technical instead.
Greek Script
There is some evidence that Greek polytoniko orthography is considered to be of limited utility for domain names. The [Greek-IDN-Case-Study] report considers it obsolete for Greek and not useful to modern users, citing difficulty in accessing it from modern keyboards. This decision makes sense in an identifier context, particularly for the Root Zone. [Proposal-Greek] which only contains monotonic Greek, suggests in an aside that there might be a use case for polytonic Greek for the Second Level, particularly for certain traditionalist or religious organizations. However, the [RefLGR] project has not received a request to add a reference LGR for the second level supporting that orthography, thus limiting its availability for contracted domains. This makes sense given the ratio of monotonic vs. polytonic orthographies in public life. The [Greek-ccTLD] is an example of a registry that does allow polytonic Greek.
To complicate the matter, much of the precomposed uppercase Greek letters used for polytonic Greek are not PVALID in IDNA2008 due to complexities with uppercasing. These complexities make supporting the orthography complicated in cases where identifiers are not limited to lowercase letters as in IDN2008. While any identifier implementation and, within the restrictions of IDNA2008, any domain name registry is free to support polytonic Greek, it appears a poor choice for default identifiers. Accordingly, the proposal is to treat the affected characters as Obsolete; this affects all characters in the Greek Extended block and the YPOGEGRAMMENI.
Arabic Script
The following issues have been raised in review with respect to the Arabic script.
- U+063D ؽ ARABIC LETTER FARSI YEH WITH INVERTED V - this letter can be found to be in use with Azerbaijani and should therefore not be considered “obsolete, historic”. For documentation, see [AZ] and [Azerbaijani-Encoding-Proposal]. It is proposed to retain this character as Recommended.
Tibetan Script
No definite recommendations can be made for the Tibetan script at this time. It is considered by ICANN as eligible for the Root Zone in principle, but work on defining the label generation rules has faced some obstacles and has not commenced. Nevertheless, Tibetan was covered in the MSR analysis, and for some characters a suggested adjustment of Identifier_Type can be provided; two are called out as being homoglyph digraphs, constituting a “duplicate” of a two-character sequence; these have been proposed for Exclusion.
Beyond applying that limited reassessment, it might be reasonable to reflect the preliminary nature of the understanding of the likely use of Tibetan in identifiers by also not giving any of the Tibetan characters an Identifier_Type Recommended until some technical body, project, or group has created a definite analysis of this script for identifier purposes. (Only Tibetan characters with proposed exclusion from the [MSR] have been included in this document).
Bopomofo Script
The [MSR] and [RefLGR] exclude the Bopomofo script, considering the entire script special use as it tends to be used almost exclusively in education. A separate proposal exists to change the status of the script in UAX31 to Limited_Use. If the proposal to reclassify the script is accepted, it would take precedence. Therefore, Bopomofo characters U+3105 ㄅ..U+312D ㄭ, U+312F ㄯ and U+31A0 ㆠ..U+31BA ㆺ have been removed from this document.
Uppercase Characters
Uppercase characters must share the Identifier_Type of their lowercase equivalent. This is to ensure that default identifiers do not change validity if case mapped. In assembling this data set, a few uppercase characters were spotted that are currently Recommended, but for which the lowercase equivalents were already not Recommended. For these, Identifier_Type values matching their lowercase equivalents are proposed here. In addition, it is recommended that an invariant test be created to automatically verify the relation between Identifier_Type for lowercase and uppercase characters. Because uppercase characters are not independently evaluated, their table entries do not cite a reference containing source information leading to the proposed Identifier_Type value.
Digits
If a writing system shares a script covered in the [MSR] but extensions for it were excluded, any native digits for that writing system are given proposed Identifier_Type values matching those of the letters.
Intra-Word or Word-Related Punctuation
A number of these have been made Inclusion in the existing Identifier_Type. However, some are Recommended.
- U+005F _ LOW LINE is currently Recommended, making it a bit of an outlier.
It might be useful to ask whether it wouldn't be cleaner for default identifiers to leave the entire set of these for explicit inclusion. That would separate the design of an identifier syntax from the default repertoire.
Combining Marks
Needed for NFD:—Where combining marks (such as U+3099 ゙ ) are only needed for decompositions of recommended characters, it was proposed to focus on the NFC format for Identifier_Type but document in UTS#39 that combining characters may be marked as Uncommon_Use even when they are in the NFD version of a modern language's exemplar characters.
The following text is proposed for UTS#39:
Where combining marks, such as U+3099 ゙ COMBINING KATAKANA-HIRAGANA VOICED SOUND MARK are only needed for NFD of recommended characters, they have been given Identifier_Type Uncommon_Use. Identifier systems that work with unnormalized text, or text in NDF may wish to add the full set of characters required in canonical decompositions.
High Risk Characters
Certain characters are particularly risky for security-relevant identifiers, such as IDNs. These include characters that look like punctuation marks. The usual mitigation mechanisms for visually equivalent characters apply to situations where both partners are in the repertoire. By definition, that is not the case for these, as the confusability is with delimiting punctuation, which would falsely suggest part of the string is outside the label. They should therefore never be recommended for default identifiers but given Identifier_Type Exclusion or Inclusion. Further analysis should be carried out in the context of a separate proposal.
Defaults for Newly Encoded Characters
Characters newly encoded for existing scripts in Unicode 17.0 and after should not be given Identifier_Type Recommended by default, as most often, scripts are supplemented with additional Technical or Obsolete characters. For modern characters, unless they are intended for support of a particularly vigorous language with well established orthography in widespread use, they should be considered Uncommon_Use by default, unless they are clearly Technical in nature. (Note: characters encoded since the cutoff for the last MSR version have been tentatively analyzed here using the same principles as in [MSR].)
Presentation of Data
The table in Section 2, “Repertoire" lists the characters affected by this proposal.
- All characters have tags indicating their existing or proposed Identifier_Type based on the classification in [MSR] and other available evidence, including that received in review.
- The classification of characters excluded from the MSR is a bit more fine-grained than the current values for Identifier Type, which means that the proposed Identifier_Type values correspond to a combination of MSR classifications. In some cases, this leads to two values for the proposed Identifier_Type.
- Uppercase equivalents are noted as Uppercase in the comments. As uppercase characters are not used in IDNA2008, they are not independently evaluated here but share the proposed Identifier_Type of their lowercase equivalent.
- Appropriate Identifier_Type values have been proposed for any other characters not in IDNA2008.
- Comments on characters give further information, such as the name of a language for which a character can be considered in limited, technical or historical use, or a UTC document reference.
Character Classes
The table in Section 4.1, “Character Classes” presents information about a number of sets collecting characters with different attributes or properties. For each set, a count of members is given. An optional arrow (→) indicates that only a smaller subset of the given set is actually found in this document. For example, the notation 5271→0 for the set of characters tagged with Identifier_Type Limited_Use indicates that none of the characters here currently have that Identifier_Type value, which is as expected, as this document only discusses characters that are currently Recommended or Inclusion.
Contributors
The excerpt that this proposal is based on was prepared by Asmus Freytag, based on published data found in [RefLGR] and reference information from [MSR]. For details on the process and contributors to those projects, see [RefLGR-Overview], in particular, Section 1, “Overview” and Section 6, “Contributors”. Mark Davis, Michel Suignard and Roozbeh Pournader have contributed feedback and/or additional classifications to this proposal.
Notable Changes and Updates
- U+063D ؽ ARABIC LETTER FARSI YEH WITH INVERTED V — The type for this character is retained as Recommended; it is anticipated that this change will also be reflected in a future update of [MSR]. The analysis for several recently encoded characters up to Unicode Version 16.0 has been added.
- U+02BB ʻ MODIFIER LETTER TURNED COMMA — The proposed type for this character has changed to Inclusion; this reflects the fact that this character is confusable with some syntax characters.
- U+0375 ͵ GREEK LOWER NUMERAL SIGN — The proposed type for this character has been changed from Inclusion to Technical; this reflects the fact that it is claimed to be in current use to mark text as numeric.
- U+0F3E ༾ TIBETAN SIGN YAR TSHES and U+0F3F ༿ TIBETAN SIGN MAR TSHES — The proposed type for these characters has changed to Technical; this reflects their use in converting numbers to astrological signs.
- U+3099 ゙ COMBINING KATAKANA-HIRAGANA VOICED SOUND MARK and U+309A ゚ COMBINING KATAKANA-HIRAGANA SEMI-VOICED SOUND MARK — The proposed type for these characters has been changed to Uncommon_Use; this reflects that their use is only required in NFD.
Appendix: Unicode Sets
The following are listings of the sets of characters for each proposed Identifier_Type value. These can be used to import the values into property data or to convert them to Unicode set notations for additional comparisons. Elements of the sets are space-separated and consist either of bare hex codes for single characters or a pair of hex codes separated by HYPHEN-MINUS to indicate a range.
Review_Needed()
Proposed:Inclusion(02BB)
Proposed:Exclusion(0F7B 0F7D)
Proposed:Technical(0200-0217 02BC 02EC 030F-0311 0313-0314 0324-0325 032D 032E 0330 0335 0338 0339 0342 0375 03FC 0559 0653 06E5-06E6 0870-0888 0950 097D 0A74 0AD0 0B82 0BD0 0EAF 0F00 0F35 0F37 0F3E-0F3F 0FC6 10F9 10FA 1E00-1E01 1E18-1E1B 1E2A-1E2D 1E72-1E77 1FB0-1FB1 2D27 2D2D A717-A71F A788 A7C5-A7C6 1DF00-1DF1E 1DF25-1DF2A)
Proposed:Obsolete(0138 0345 037B-037D 03FC-03FF 049C-049D 04A6-04A7 04B8-04B9 0514-0523 0526-0529 052E-052F 063B-063C 063E-063F 06AC 077E-077F 08B5 093D 0960-0963 0971 09BD 09E0-09E3 0ABD 0AE0-0AE3 0B3D 0B60-0B61 0C3D 0C60-0C61 0CBD 0CE0-0CE3 0CF1-0CF2 0D3A 0D3D 0D4C 0D4E 0D60-0D61 0F6A 0F82-0F83 0F86-0F8F 1050-1059 17DC 1F00-1F15 1F18-1F1D 1F20-1F45 1F48-1F4D 1F50-1F57 1F59 1F5B 1F5D 1F5F-1F70 1F72 1F74 1F76 1F78 1F7A 1F7C 1F80-1FB4 1FB6-1FBA 1FBC 1FC2-1FC4 1FC6-1FC8 1FCA 1FCC 1FD0-1FD2 1FD6-1FDB 1FE0-1FE2 1FE4-1FEB 1FF2-1FF4 1FF6-1FF8 1FFA 1FFC A67F A7C0-A7C1 A7C4 A7C7-A7CA A7D0-A7D1 A7D3 A7D5-A7D9 1B11F-1B122 1B132 1B150-1B152 1B155 1B164-1B167 1E08F)
Proposed:Uncommon_Use(048A-048F 04C3-04CA 04CD-04CE 04F6-04F7 04FA-04FF 0510-0513 05EF 0889-088E 08B2 08B6-08BA 08C3-08C8 09FE 0A01 0AFA-0AFF 0B55 0C01 0C04 0C3C 0C5D 0C80 0CDD 0CF3 0D00 0D54-0D56 0E86 0E89 0E8C 0E8E-0E93 0E98 0EA0 0EA8-0EA9 0EAC 0EBA 0ECE 0F6A 0F6B-0F6C 0F82-0F83 0FAE-0FAF 0FB0 0FC6 1050-1059 1065-106D 106E-1070 1071 1072-1074 108E 109A-109B 109C-109D 10F9 10FA 10FD 10FE-10FF 1380-138F 2DA0-2DA6 2DA8-2DAE 2DB0-2DB6 2DB8-2DBE 2DC0-2DC6 2DC8-2DCE 2DD0-2DD6 2DD8-2DDE A792-A793 A7C2-A7C3 A9E7-A9EF A9F0-A9FE AA60-AA76 AA7A AA7C-AA7D AA7E-AA7F AB11-AB16 AB20-AB26 AB28-AB2E AB66-AB67 1133B)
Repertoire
Repertoire Summary
Number of elements in repertoire | 769 |
---|---|
Longest code point sequence | 1 |
Repertoire by Code Point
The following table lists the repertoire by code point (or code point sequence). The data in the Script and Name column are extracted from the Unicode character database. Where a comment in the original LGR is equal to the character name, it has been suppressed.
See also the legend provided below the table.
Code Point |
Glyph | Script | Name | Ref | Tags | Comment |
---|---|---|---|---|---|---|
U+0027 | ' | Common | APOSTROPHE | Inclusion | Not IDN2008 | |
U+002E | . | Common | FULL STOP | Inclusion | Not IDN2008 | |
U+003A | : | Common | COLON | Inclusion | Not IDN2008 | |
U+005F | _ | Common | LOW LINE | Recommended | Not IDN2008; why is this not “Inclusion”? | |
U+0138 | ĸ | Latin | LATIN SMALL LETTER KRA | [100] | Proposed:Obsolete | Greenlandic |
U+0200 | Ȁ | Latin | LATIN CAPITAL LETTER A WITH DOUBLE GRAVE | Proposed:Technical | Not IDNA2008; Uppercase | |
U+0201 | ȁ | Latin | LATIN SMALL LETTER A WITH DOUBLE GRAVE | [100] | Proposed:Technical | tone in poetry |
U+0202 | Ȃ | Latin | LATIN CAPITAL LETTER A WITH INVERTED BREVE | Proposed:Technical | Not IDNA2008; Uppercase | |
U+0203 | ȃ | Latin | LATIN SMALL LETTER A WITH INVERTED BREVE | [100] | Proposed:Technical | tone in poetry |
U+0204 | Ȅ | Latin | LATIN CAPITAL LETTER E WITH DOUBLE GRAVE | Proposed:Technical | Not IDNA2008; Uppercase | |
U+0205 | ȅ | Latin | LATIN SMALL LETTER E WITH DOUBLE GRAVE | [100] | Proposed:Technical | tone in poetry |
U+0206 | Ȇ | Latin | LATIN CAPITAL LETTER E WITH INVERTED BREVE | Proposed:Technical | Not IDNA2008; Uppercase | |
U+0207 | ȇ | Latin | LATIN SMALL LETTER E WITH INVERTED BREVE | [100] | Proposed:Technical | tone in poetry |
U+0208 | Ȉ | Latin | LATIN CAPITAL LETTER I WITH DOUBLE GRAVE | Proposed:Technical | Not IDNA2008; Uppercase | |
U+0209 | ȉ | Latin | LATIN SMALL LETTER I WITH DOUBLE GRAVE | [100] | Proposed:Technical | tone in poetry |
U+020A | Ȋ | Latin | LATIN CAPITAL LETTER I WITH INVERTED BREVE | Proposed:Technical | Not IDNA2008; Uppercase | |
U+020B | ȋ | Latin | LATIN SMALL LETTER I WITH INVERTED BREVE | [100] | Proposed:Technical | tone in poetry |
U+020C | Ȍ | Latin | LATIN CAPITAL LETTER O WITH DOUBLE GRAVE | Proposed:Technical | Not IDNA2008; Uppercase | |
U+020D | ȍ | Latin | LATIN SMALL LETTER O WITH DOUBLE GRAVE | [100] | Proposed:Technical | tone in poetry |
U+020E | Ȏ | Latin | LATIN CAPITAL LETTER O WITH INVERTED BREVE | Proposed:Technical | Not IDNA2008; Uppercase | |
U+020F | ȏ | Latin | LATIN SMALL LETTER O WITH INVERTED BREVE | [100] | Proposed:Technical | tone in poetry |
U+0210 | Ȑ | Latin | LATIN CAPITAL LETTER R WITH DOUBLE GRAVE | Proposed:Technical | Not IDNA2008; Uppercase | |
U+0211 | ȑ | Latin | LATIN SMALL LETTER R WITH DOUBLE GRAVE | [100] | Proposed:Technical | tone in poetry |
U+0212 | Ȓ | Latin | LATIN CAPITAL LETTER R WITH INVERTED BREVE | Proposed:Technical | Not IDNA2008; Uppercase | |
U+0213 | ȓ | Latin | LATIN SMALL LETTER R WITH INVERTED BREVE | [100] | Proposed:Technical | tone in poetry |
U+0214 | Ȕ | Latin | LATIN CAPITAL LETTER U WITH DOUBLE GRAVE | Proposed:Technical | Not IDNA2008; Uppercase | |
U+0215 | ȕ | Latin | LATIN SMALL LETTER U WITH DOUBLE GRAVE | [100] | Proposed:Technical | tone in poetry |
U+0216 | Ȗ | Latin | LATIN CAPITAL LETTER U WITH INVERTED BREVE | Proposed:Technical | Not IDNA2008; Uppercase | |
U+0217 | ȗ | Latin | LATIN SMALL LETTER U WITH INVERTED BREVE | [100] | Proposed:Technical | tone in poetry |
U+02BB | ʻ | Common | MODIFIER LETTER TURNED COMMA | [100] | Proposed:Inclusion | excluded from domain names because of similarity to punctuation |
U+02BC | ʼ | Common | MODIFIER LETTER APOSTROPHE | [100] | Proposed:Technical | excluded from domain names because of similarity to punctuation |
U+02EC | ˬ | Common | MODIFIER LETTER VOICING | [100] | Proposed:Technical | |
U+030F | ̏ | Inherited | COMBINING DOUBLE GRAVE ACCENT | [100] | Proposed:Technical | Serbian and Croatian poetics |
U+0310 | ̐ | Inherited | COMBINING CANDRABINDU | [100] | Proposed:Technical | |
U+0311 | ̑ | Inherited | COMBINING INVERTED BREVE | [100] | Proposed:Technical | Serbian and Croatian poetics |
U+0313 | ̓ | Inherited | COMBINING COMMA ABOVE | [100] | Proposed:Technical | |
U+0314 | ̔ | Inherited | COMBINING REVERSED COMMA ABOVE | [100] | Proposed:Technical | |
U+0324 | ̤ | Inherited | COMBINING DIAERESIS BELOW | [100] | Proposed:Technical | |
U+0325 | ̥ | Inherited | COMBINING RING BELOW | [100] | Proposed:Technical | |
U+032D | ̭ | Inherited | COMBINING CIRCUMFLEX ACCENT BELOW | [100] | Proposed:Technical | |
U+032E | ̮ | Inherited | COMBINING BREVE BELOW | [100] | Proposed:Technical | |
U+0330 | ̰ | Inherited | COMBINING TILDE BELOW | [100] | Proposed:Technical | |
U+0335 | ̵ | Inherited | COMBINING SHORT STROKE OVERLAY | [100] | Proposed:Technical | |
U+0338 | ̸ | Inherited | COMBINING LONG SOLIDUS OVERLAY | [100] | Proposed:Technical | |
U+0339 | ̹ | Inherited | COMBINING RIGHT HALF RING BELOW | [100] | Proposed:Technical | |
U+0342 | ͂ | Inherited | COMBINING GREEK PERISPOMENI | [100] | Proposed:Technical | |
U+0345 | ͅ | Inherited | COMBINING GREEK YPOGEGRAMMENI | Proposed:Obsolete | NOT IDNA2008 | |
U+0375 | ͵ | Greek | GREEK LOWER NUMERAL SIGN | [100] | Proposed:Technical | IDNA2008 CONTEXTO, used to mark letters as numeric |
U+037B | ͻ | Greek | GREEK SMALL REVERSED LUNATE SIGMA SYMBOL | [100] | Proposed:Obsolete | |
U+037C | ͼ | Greek | GREEK SMALL DOTTED LUNATE SIGMA SYMBOL | [100] | Proposed:Obsolete | |
U+037D | ͽ | Greek | GREEK SMALL REVERSED DOTTED LUNATE SIGMA SYMBOL | [100] | Proposed:Obsolete | |
U+03FC | ϼ | Greek | GREEK RHO WITH STROKE SYMBOL | [100] | Proposed:Obsolete, Proposed:Technical | |
U+03FD | Ͻ | Greek | GREEK CAPITAL REVERSED LUNATE SIGMA SYMBOL | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+03FE | Ͼ | Greek | GREEK CAPITAL DOTTED LUNATE SIGMA SYMBOL | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+03FF | Ͽ | Greek | GREEK CAPITAL REVERSED DOTTED LUNATE SIGMA SYMBOL | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+048A | Ҋ | Cyrillic | CYRILLIC CAPITAL LETTER SHORT I WITH TAIL | Proposed:Uncommon_Use | Not IDNA2008; Uppercase | |
U+048B | ҋ | Cyrillic | CYRILLIC SMALL LETTER SHORT I WITH TAIL | [100] | Proposed:Uncommon_Use | nearly extinct |
U+048C | Ҍ | Cyrillic | CYRILLIC CAPITAL LETTER SEMISOFT SIGN | Proposed:Uncommon_Use | Not IDNA2008; Uppercase | |
U+048D | ҍ | Cyrillic | CYRILLIC SMALL LETTER SEMISOFT SIGN | [100] | Proposed:Uncommon_Use | nearly extinct |
U+048E | Ҏ | Cyrillic | CYRILLIC CAPITAL LETTER ER WITH TICK | Proposed:Uncommon_Use | Not IDNA2008; Uppercase | |
U+048F | ҏ | Cyrillic | CYRILLIC SMALL LETTER ER WITH TICK | [100] | Proposed:Uncommon_Use | nearly extinct |
U+049C | Ҝ | Cyrillic | CYRILLIC CAPITAL LETTER KA WITH VERTICAL STROKE | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+049D | ҝ | Cyrillic | CYRILLIC SMALL LETTER KA WITH VERTICAL STROKE | [100] | Proposed:Obsolete | Azerbaijani |
U+04A6 | Ҧ | Cyrillic | CYRILLIC CAPITAL LETTER PE WITH MIDDLE HOOK | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+04A7 | ҧ | Cyrillic | CYRILLIC SMALL LETTER PE WITH MIDDLE HOOK | [100] | Proposed:Obsolete | Abkhazian |
U+04B8 | Ҹ | Cyrillic | CYRILLIC CAPITAL LETTER CHE WITH VERTICAL STROKE | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+04B9 | ҹ | Cyrillic | CYRILLIC SMALL LETTER CHE WITH VERTICAL STROKE | [100] | Proposed:Obsolete | Azerbaijani |
U+04C3 | Ӄ | Cyrillic | CYRILLIC CAPITAL LETTER KA WITH HOOK | Proposed:Uncommon_Use | Not IDNA2008; Uppercase | |
U+04C4 | ӄ | Cyrillic | CYRILLIC SMALL LETTER KA WITH HOOK | [100] | Proposed:Uncommon_Use | threatened |
U+04C5 | Ӆ | Cyrillic | CYRILLIC CAPITAL LETTER EL WITH TAIL | Proposed:Uncommon_Use | Not IDNA2008; Uppercase | |
U+04C6 | ӆ | Cyrillic | CYRILLIC SMALL LETTER EL WITH TAIL | [100] | Proposed:Uncommon_Use | nearly extinct |
U+04C7 | Ӈ | Cyrillic | CYRILLIC CAPITAL LETTER EN WITH HOOK | Proposed:Uncommon_Use | Not IDNA2008; Uppercase | |
U+04C8 | ӈ | Cyrillic | CYRILLIC SMALL LETTER EN WITH HOOK | [100] | Proposed:Uncommon_Use | threatened |
U+04C9 | Ӊ | Cyrillic | CYRILLIC CAPITAL LETTER EN WITH TAIL | Proposed:Uncommon_Use | Not IDNA2008; Uppercase | |
U+04CA | ӊ | Cyrillic | CYRILLIC SMALL LETTER EN WITH TAIL | [100] | Proposed:Uncommon_Use | nearly extinct |
U+04CD | Ӎ | Cyrillic | CYRILLIC CAPITAL LETTER EM WITH TAIL | Proposed:Uncommon_Use | Not IDNA2008; Uppercase | |
U+04CE | ӎ | Cyrillic | CYRILLIC SMALL LETTER EM WITH TAIL | [100] | Proposed:Uncommon_Use | nearly extinct |
U+04F6 | Ӷ | Cyrillic | CYRILLIC CAPITAL LETTER GHE WITH DESCENDER | Proposed:Uncommon_Use | Not IDNA2008; Uppercase | |
U+04F7 | ӷ | Cyrillic | CYRILLIC SMALL LETTER GHE WITH DESCENDER | [100] | Proposed:Uncommon_Use | educational |
U+04FA | Ӻ | Cyrillic | CYRILLIC CAPITAL LETTER GHE WITH STROKE AND HOOK | Proposed:Uncommon_Use | Not IDNA2008; Uppercase | |
U+04FB | ӻ | Cyrillic | CYRILLIC SMALL LETTER GHE WITH STROKE AND HOOK | [100] | Proposed:Uncommon_Use | nearly extinct |
U+04FC | Ӽ | Cyrillic | CYRILLIC CAPITAL LETTER HA WITH HOOK | Proposed:Uncommon_Use | Not IDNA2008; Uppercase | |
U+04FD | ӽ | Cyrillic | CYRILLIC SMALL LETTER HA WITH HOOK | [100] | Proposed:Uncommon_Use | nearly extinct |
U+04FE | Ӿ | Cyrillic | CYRILLIC CAPITAL LETTER HA WITH STROKE | Proposed:Uncommon_Use | Not IDNA2008; Uppercase | |
U+04FF | ӿ | Cyrillic | CYRILLIC SMALL LETTER HA WITH STROKE | [100] | Proposed:Uncommon_Use | nearly extinct |
U+0510 | Ԑ | Cyrillic | CYRILLIC CAPITAL LETTER REVERSED ZE | Proposed:Uncommon_Use | Not IDNA2008; Uppercase | |
U+0511 | ԑ | Cyrillic | CYRILLIC SMALL LETTER REVERSED ZE | [100] | Proposed:Uncommon_Use | threatened |
U+0512 | Ԓ | Cyrillic | CYRILLIC CAPITAL LETTER EL WITH HOOK | Proposed:Uncommon_Use | Not IDNA2008; Uppercase | |
U+0513 | ԓ | Cyrillic | CYRILLIC SMALL LETTER EL WITH HOOK | [100] | Proposed:Uncommon_Use | threatened |
U+0514 | Ԕ | Cyrillic | CYRILLIC CAPITAL LETTER LHA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+0515 | ԕ | Cyrillic | CYRILLIC SMALL LETTER LHA | [100] | Proposed:Obsolete | Mordvin |
U+0516 | Ԗ | Cyrillic | CYRILLIC CAPITAL LETTER RHA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+0517 | ԗ | Cyrillic | CYRILLIC SMALL LETTER RHA | [100] | Proposed:Obsolete | Mordvin |
U+0518 | Ԙ | Cyrillic | CYRILLIC CAPITAL LETTER YAE | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+0519 | ԙ | Cyrillic | CYRILLIC SMALL LETTER YAE | [100] | Proposed:Obsolete | Mordvin |
U+051A | Ԛ | Cyrillic | CYRILLIC CAPITAL LETTER QA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+051B | ԛ | Cyrillic | CYRILLIC SMALL LETTER QA | [100] | Proposed:Obsolete | Abkhaz, Kurdish |
U+051C | Ԝ | Cyrillic | CYRILLIC CAPITAL LETTER WE | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+051D | ԝ | Cyrillic | CYRILLIC SMALL LETTER WE | [100] | Proposed:Obsolete | Abkhaz, Kurdish |
U+051E | Ԟ | Cyrillic | CYRILLIC CAPITAL LETTER ALEUT KA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+051F | ԟ | Cyrillic | CYRILLIC SMALL LETTER ALEUT KA | [100] | Proposed:Obsolete | Aleut |
U+0520 | Ԡ | Cyrillic | CYRILLIC CAPITAL LETTER EL WITH MIDDLE HOOK | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+0521 | ԡ | Cyrillic | CYRILLIC SMALL LETTER EL WITH MIDDLE HOOK | [100] | Proposed:Obsolete | Abkhaz, Chuvash |
U+0522 | Ԣ | Cyrillic | CYRILLIC CAPITAL LETTER EN WITH MIDDLE HOOK | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+0523 | ԣ | Cyrillic | CYRILLIC SMALL LETTER EN WITH MIDDLE HOOK | [100] | Proposed:Obsolete | Chuvash |
U+0526 | Ԧ | Cyrillic | CYRILLIC CAPITAL LETTER SHHA WITH DESCENDER | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+0527 | ԧ | Cyrillic | CYRILLIC SMALL LETTER SHHA WITH DESCENDER | [100] | Proposed:Obsolete | Azerbaijani |
U+0528 | Ԩ | Cyrillic | CYRILLIC CAPITAL LETTER EN WITH LEFT HOOK | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+0529 | ԩ | Cyrillic | CYRILLIC SMALL LETTER EN WITH LEFT HOOK | [100] | Proposed:Obsolete | (Orok) (7.0) |
U+052E | Ԯ | Cyrillic | CYRILLIC CAPITAL LETTER EL WITH DESCENDER | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+052F | ԯ | Cyrillic | CYRILLIC SMALL LETTER EL WITH DESCENDER | [100] | Proposed:Obsolete | (Khanty, Nenets) (7.0) |
U+0559 | ՙ | Armenian | ARMENIAN MODIFIER LETTER LEFT HALF RING | [100] | Proposed:Technical | |
U+058A | ֊ | Armenian | ARMENIAN HYPHEN | Inclusion | NOT IDNA2008 | |
U+05EF | ׯ | Hebrew | HEBREW YOD TRIANGLE | [100] | Proposed:Uncommon_Use | 11.0 16/305 |
U+05F3 | ׳ | Hebrew | HEBREW PUNCTUATION GERESH | [100] | Inclusion | IDNA2008 CONTEXTO |
U+05F4 | ״ | Hebrew | HEBREW PUNCTUATION GERSHAYIM | [100] | Inclusion | IDNA2008 CONTEXTO |
U+063B | ػ | Arabic | ARABIC LETTER KEHEH WITH TWO DOTS ABOVE | [100] | Proposed:Obsolete | historic |
U+063C | ؼ | Arabic | ARABIC LETTER KEHEH WITH THREE DOTS BELOW | [100] | Proposed:Obsolete | historic |
U+063D | ؽ | Arabic | ARABIC LETTER FARSI YEH WITH INVERTED V | [100], [AZ] | Recommended | Azerbaijani |
U+063E | ؾ | Arabic | ARABIC LETTER FARSI YEH WITH TWO DOTS ABOVE | [100] | Proposed:Obsolete | |
U+063F | ؿ | Arabic | ARABIC LETTER FARSI YEH WITH THREE DOTS ABOVE | [100] | Proposed:Obsolete | historic |
U+0653 | ٓ | Inherited | ARABIC MADDAH ABOVE | [100] | Proposed:Technical | |
U+06AC | ڬ | Arabic | ARABIC LETTER KAF WITH DOT ABOVE | [100] | Proposed:Obsolete | (old Malay-Jawi)*use 0762 instead |
U+06E5 | ۥ | Arabic | ARABIC SMALL WAW | [100] | Proposed:Technical | religious annotation |
U+06E6 | ۦ | Arabic | ARABIC SMALL YEH | [100] | Proposed:Technical | religious annotation |
U+06FD | ۽ | Arabic | ARABIC SIGN SINDHI AMPERSAND | [100] | Inclusion | |
U+06FE | ۾ | Arabic | ARABIC SIGN SINDHI POSTPOSITION MEN | [100] | Inclusion | |
U+077E | ݾ | Arabic | ARABIC LETTER SEEN WITH INVERTED V | [100] | Proposed:Obsolete | early Persian |
U+077F | ݿ | Arabic | ARABIC LETTER KAF WITH TWO DOTS ABOVE | [100] | Proposed:Obsolete | early Persian |
U+0870 | ࡰ | Arabic | ARABIC LETTER ALEF WITH ATTACHED FATHA | Proposed:Technical | 14.0 religious use | |
U+0871 | ࡱ | Arabic | ARABIC LETTER ALEF WITH ATTACHED TOP RIGHT FATHA | Proposed:Technical | 14.0 religious use | |
U+0872 | ࡲ | Arabic | ARABIC LETTER ALEF WITH RIGHT MIDDLE STROKE | Proposed:Technical | 14.0 religious use | |
U+0873 | ࡳ | Arabic | ARABIC LETTER ALEF WITH LEFT MIDDLE STROKE | Proposed:Technical | 14.0 religious use | |
U+0874 | ࡴ | Arabic | ARABIC LETTER ALEF WITH ATTACHED KASRA | Proposed:Technical | 14.0 religious use | |
U+0875 | ࡵ | Arabic | ARABIC LETTER ALEF WITH ATTACHED BOTTOM RIGHT KASRA | Proposed:Technical | 14.0 religious use | |
U+0876 | ࡶ | Arabic | ARABIC LETTER ALEF WITH ATTACHED ROUND DOT ABOVE | Proposed:Technical | 14.0 religious use | |
U+0877 | ࡷ | Arabic | ARABIC LETTER ALEF WITH ATTACHED RIGHT ROUND DOT | Proposed:Technical | 14.0 religious use | |
U+0878 | ࡸ | Arabic | ARABIC LETTER ALEF WITH ATTACHED LEFT ROUND DOT | Proposed:Technical | 14.0 religious use | |
U+0879 | ࡹ | Arabic | ARABIC LETTER ALEF WITH ATTACHED ROUND DOT BELOW | Proposed:Technical | 14.0 religious use | |
U+087A | ࡺ | Arabic | ARABIC LETTER ALEF WITH DOT ABOVE | Proposed:Technical | 14.0 religious use | |
U+087B | ࡻ | Arabic | ARABIC LETTER ALEF WITH ATTACHED TOP RIGHT FATHA AND DOT ABOVE | Proposed:Technical | 14.0 religious use | |
U+087C | ࡼ | Arabic | ARABIC LETTER ALEF WITH RIGHT MIDDLE STROKE AND DOT ABOVE | Proposed:Technical | 14.0 religious use | |
U+087D | ࡽ | Arabic | ARABIC LETTER ALEF WITH ATTACHED BOTTOM RIGHT KASRA AND DOT ABOVE | Proposed:Technical | 14.0 religious use | |
U+087E | ࡾ | Arabic | ARABIC LETTER ALEF WITH ATTACHED TOP RIGHT FATHA AND LEFT RING | Proposed:Technical | 14.0 religious use | |
U+087F | ࡿ | Arabic | ARABIC LETTER ALEF WITH RIGHT MIDDLE STROKE AND LEFT RING | Proposed:Technical | 14.0 religious use | |
U+0880 | ࢀ | Arabic | ARABIC LETTER ALEF WITH ATTACHED BOTTOM RIGHT KASRA AND LEFT RING | Proposed:Technical | 14.0 religious use | |
U+0881 | ࢁ | Arabic | ARABIC LETTER ALEF WITH ATTACHED RIGHT HAMZA | Proposed:Technical | 14.0 religious use | |
U+0882 | ࢂ | Arabic | ARABIC LETTER ALEF WITH ATTACHED LEFT HAMZA | Proposed:Technical | 14.0 religious use | |
U+0883 | ࢃ | Arabic | ARABIC TATWEEL WITH OVERSTRUCK HAMZA | Proposed:Technical | 14.0 religious use | |
U+0884 | ࢄ | Arabic | ARABIC TATWEEL WITH OVERSTRUCK WAW | Proposed:Technical | 14.0 religious use | |
U+0885 | ࢅ | Arabic | ARABIC TATWEEL WITH TWO DOTS BELOW | Proposed:Technical | 14.0 religious use | |
U+0886 | ࢆ | Arabic | ARABIC LETTER THIN YEH | Proposed:Technical | 14.0 religious use | |
U+0887 | ࢇ | Arabic | ARABIC BASELINE ROUND DOT | Proposed:Technical | 14.0 religious use | |
U+0888 | ࢈ | Arabic | ARABIC RAISED ROUND DOT | Proposed:Technical | 14.0 religious use | |
U+0889 | ࢉ | Arabic | ARABIC LETTER NOON WITH INVERTED SMALL V | Proposed:Uncommon_Use | ||
U+088A | ࢊ | Arabic | ARABIC LETTER HAH WITH INVERTED SMALL V BELOW | Proposed:Uncommon_Use | Bosnian | |
U+088B | ࢋ | Arabic | ARABIC LETTER TAH WITH DOT BELOW | Proposed:Uncommon_Use | Pegon | |
U+088C | ࢌ | Arabic | ARABIC LETTER TAH WITH THREE DOTS BELOW | Proposed:Uncommon_Use | Pegon | |
U+088D | ࢍ | Arabic | ARABIC LETTER KEHEH WITH TWO DOTS VERTICALLY BELOW | Proposed:Uncommon_Use | Pegon | |
U+088E | ࢎ | Arabic | ARABIC VERTICAL TAIL | Proposed:Uncommon_Use | historic abbreviation mark | |
U+08B2 | ࢲ | Arabic | ARABIC LETTER ZAIN WITH INVERTED V ABOVE | [100] | Proposed:Uncommon_Use | (Berber) (7.0) |
U+08B5 | ࢵ | Arabic | ARABIC LETTER QAF WITH DOT BELOW AND NO DOTS ABOVE | Proposed:Obsolete | early Arabic | |
U+08B6 | ࢶ | Arabic | ARABIC LETTER BEH WITH SMALL MEEM ABOVE | [100] | Proposed:Uncommon_Use | (Bravanese) (9.0 13/178) |
U+08B7 | ࢷ | Arabic | ARABIC LETTER PEH WITH SMALL MEEM ABOVE | [100] | Proposed:Uncommon_Use | (Bravanese) (9.0 13/178) |
U+08B8 | ࢸ | Arabic | ARABIC LETTER TEH WITH SMALL TEH ABOVE | [100] | Proposed:Uncommon_Use | (Bravanese) (9.0 13/178) |
U+08B9 | ࢹ | Arabic | ARABIC LETTER REH WITH SMALL NOON ABOVE | [100] | Proposed:Uncommon_Use | (Bravanese) (9.0 13/178) |
U+08BA | ࢺ | Arabic | ARABIC LETTER YEH WITH TWO DOTS BELOW AND SMALL NOON ABOVE | [100] | Proposed:Uncommon_Use | (Bravanese) (9.0 13/178) |
U+08C3 | ࣃ | Arabic | ARABIC LETTER GHAIN WITH THREE DOTS ABOVE | Proposed:Uncommon_Use | ||
U+08C4 | ࣄ | Arabic | ARABIC LETTER AFRICAN QAF WITH THREE DOTS ABOVE | Proposed:Uncommon_Use | ||
U+08C5 | ࣅ | Arabic | ARABIC LETTER JEEM WITH THREE DOTS ABOVE | Proposed:Uncommon_Use | ||
U+08C6 | ࣆ | Arabic | ARABIC LETTER JEEM WITH THREE DOTS BELOW | Proposed:Uncommon_Use | ||
U+08C7 | ࣇ | Arabic | ARABIC LETTER LAM WITH SMALL ARABIC LETTER TAH ABOVE | Proposed:Uncommon_Use | Punjabi 19/112 | |
U+08C8 | ࣈ | Arabic | ARABIC LETTER GRAF | Proposed:Uncommon_Use | Balti | |
U+093D | ऽ | Devanagari | DEVANAGARI SIGN AVAGRAHA | [100] | Proposed:Obsolete | Sanskrit |
U+0950 | ॐ | Devanagari | DEVANAGARI OM | [100] | Proposed:Technical | |
U+0960 | ॠ | Devanagari | DEVANAGARI LETTER VOCALIC RR | [100] | Proposed:Obsolete | Sanskrit |
U+0961 | ॡ | Devanagari | DEVANAGARI LETTER VOCALIC LL | [100] | Proposed:Obsolete | Sanskrit |
U+0962 | ॢ | Devanagari | DEVANAGARI VOWEL SIGN VOCALIC L | [100] | Proposed:Obsolete | Sanskrit |
U+0963 | ॣ | Devanagari | DEVANAGARI VOWEL SIGN VOCALIC LL | [100] | Proposed:Obsolete | Sanskrit |
U+0971 | ॱ | Devanagari | DEVANAGARI SIGN HIGH SPACING DOT | [100] | Proposed:Obsolete | Sanskrit |
U+097D | ॽ | Devanagari | DEVANAGARI LETTER GLOTTAL STOP | [100] | Proposed:Technical | Limbu |
U+09BD | ঽ | Bengali | BENGALI SIGN AVAGRAHA | [100] | Proposed:Obsolete | Sanskrit |
U+09E0 | ৠ | Bengali | BENGALI LETTER VOCALIC RR | [100] | Proposed:Obsolete | Sanskrit |
U+09E1 | ৡ | Bengali | BENGALI LETTER VOCALIC LL | [100] | Proposed:Obsolete | Sanskrit |
U+09E2 | ৢ | Bengali | BENGALI VOWEL SIGN VOCALIC L | [100] | Proposed:Obsolete | Sanskrit |
U+09E3 | ৣ | Bengali | BENGALI VOWEL SIGN VOCALIC LL | [100] | Proposed:Obsolete | Sanskrit |
U+09FE | ৾ | Bengali | BENGALI SANDHI MARK | [100] | Proposed:Uncommon_Use | (Sanskrit) (11.0 16/322) |
U+0A01 | ਁ | Gurmukhi | GURMUKHI SIGN ADAK BINDI | [100] | Proposed:Uncommon_Use | |
U+0A74 | ੴ | Gurmukhi | GURMUKHI EK ONKAR | [100] | Proposed:Technical | |
U+0ABD | ઽ | Gujarati | GUJARATI SIGN AVAGRAHA | [100] | Proposed:Obsolete | Sanskrit |
U+0AD0 | ૐ | Gujarati | GUJARATI OM | [100] | Proposed:Technical | |
U+0AE0 | ૠ | Gujarati | GUJARATI LETTER VOCALIC RR | [100] | Proposed:Obsolete | Sanskrit |
U+0AE1 | ૡ | Gujarati | GUJARATI LETTER VOCALIC LL | [100] | Proposed:Obsolete | Sanskrit |
U+0AE2 | ૢ | Gujarati | GUJARATI VOWEL SIGN VOCALIC L | [100] | Proposed:Obsolete | Sanskrit |
U+0AE3 | ૣ | Gujarati | GUJARATI VOWEL SIGN VOCALIC LL | [100] | Proposed:Obsolete | Sanskrit |
U+0AFA | ૺ | Gujarati | GUJARATI SIGN SUKUN | [100] | Proposed:Uncommon_Use | (Arabic transliteration) (10.0 13/143) |
U+0AFB | ૻ | Gujarati | GUJARATI SIGN SHADDA | [100] | Proposed:Uncommon_Use | (Arabic transliteration) (10.0 13/143) |
U+0AFC | ૼ | Gujarati | GUJARATI SIGN MADDAH | [100] | Proposed:Uncommon_Use | (Arabic transliteration) (10.0 13/143) |
U+0AFD | ૽ | Gujarati | GUJARATI SIGN THREE-DOT NUKTA ABOVE | [100] | Proposed:Uncommon_Use | (Arabic transliteration) (10.0 13/143) |
U+0AFE | ૾ | Gujarati | GUJARATI SIGN CIRCLE NUKTA ABOVE | [100] | Proposed:Uncommon_Use | (Arabic transliteration) (10.0 13/143) |
U+0AFF | ૿ | Gujarati | GUJARATI SIGN TWO-CIRCLE NUKTA ABOVE | [100] | Proposed:Uncommon_Use | (Arabic transliteration) (10.0 13/143) |
U+0B3D | ଽ | Oriya | ORIYA SIGN AVAGRAHA | [100] | Proposed:Obsolete | Sanskrit |
U+0B55 | ୕ | Oriya | ORIYA SIGN OVERLINE | Proposed:Uncommon_Use | Kuvi 19/005 | |
U+0B60 | ୠ | Oriya | ORIYA LETTER VOCALIC RR | [100] | Proposed:Obsolete | Sanskrit |
U+0B61 | ୡ | Oriya | ORIYA LETTER VOCALIC LL | [100] | Proposed:Obsolete | Sanskrit |
U+0B82 | ஂ | Tamil | TAMIL SIGN ANUSVARA | [100] | Proposed:Technical | not used in Tamil |
U+0BD0 | ௐ | Tamil | TAMIL OM | [100] | Proposed:Technical | |
U+0C01 | ఁ | Telugu | TELUGU SIGN CANDRABINDU | [100] | Proposed:Uncommon_Use | |
U+0C04 | ఄ | Telugu | TELUGU SIGN COMBINING ANUSVARA ABOVE | [100] | Proposed:Uncommon_Use | (Prakrit) (11.0 16/285) |
U+0C3C | ఼ | Telugu | TELUGU SIGN NUKTA | Proposed:Uncommon_Use | 19/401 | |
U+0C3D | ఽ | Telugu | TELUGU SIGN AVAGRAHA | [100] | Proposed:Obsolete | Sanskrit |
U+0C5D | ౝ | Telugu | TELUGU LETTER NAKAARA POLLU | Proposed:Uncommon_Use | 11/409 | |
U+0C60 | ౠ | Telugu | TELUGU LETTER VOCALIC RR | [100] | Proposed:Obsolete | Sanskrit |
U+0C61 | ౡ | Telugu | TELUGU LETTER VOCALIC LL | [100] | Proposed:Obsolete | Sanskrit |
U+0C80 | ಀ | Kannada | KANNADA SIGN SPACING CANDRABINDU | [100] | Proposed:Uncommon_Use | (Badaga) (9.0 14/153) |
U+0CBD | ಽ | Kannada | KANNADA SIGN AVAGRAHA | [100] | Proposed:Obsolete | Sanskrit |
U+0CDD | ೝ | Kannada | KANNADA LETTER NAKAARA POLLU | Proposed:Uncommon_Use | 13/228 | |
U+0CE0 | ೠ | Kannada | KANNADA LETTER VOCALIC RR | [100] | Proposed:Obsolete | Sanskrit |
U+0CE1 | ೡ | Kannada | KANNADA LETTER VOCALIC LL | [100] | Proposed:Obsolete | Sanskrit |
U+0CE2 | ೢ | Kannada | KANNADA VOWEL SIGN VOCALIC L | [100] | Proposed:Obsolete | Sanskrit |
U+0CE3 | ೣ | Kannada | KANNADA VOWEL SIGN VOCALIC LL | [100] | Proposed:Obsolete | Sanskrit |
U+0CF1 | ೱ | Kannada | KANNADA SIGN JIHVAMULIYA | [100] | Proposed:Obsolete | Sankrit |
U+0CF2 | ೲ | Kannada | KANNADA SIGN UPADHMANIYA | [100] | Proposed:Obsolete | Sankrit |
U+0CF3 | ೳ | Kannada | KANNADA SIGN COMBINING ANUSVARA ABOVE RIGHT | Proposed:Uncommon_Use | ||
U+0D00 | ഀ | Malayalam | MALAYALAM SIGN COMBINING ANUSVARA ABOVE | [100] | Proposed:Uncommon_Use | (Prakrit) (10.0 14/003) |
U+0D3A | ഺ | Malayalam | MALAYALAM LETTER TTTA | [100] | Proposed:Obsolete | Historic |
U+0D3D | ഽ | Malayalam | MALAYALAM SIGN AVAGRAHA | [100] | Proposed:Obsolete | Sanskrit |
U+0D4C | ൌ | Malayalam | MALAYALAM VOWEL SIGN AU | [100] | Proposed:Obsolete | Archaic |
U+0D4E | ൎ | Malayalam | MALAYALAM LETTER DOT REPH | [100] | Proposed:Obsolete | Historic |
U+0D54 | ൔ | Malayalam | MALAYALAM LETTER CHILLU M | [100] | Proposed:Uncommon_Use | (Chillu) (9.0 14/013) |
U+0D55 | ൕ | Malayalam | MALAYALAM LETTER CHILLU Y | [100] | Proposed:Uncommon_Use | (Chillu) (9.0 14/013) |
U+0D56 | ൖ | Malayalam | MALAYALAM LETTER CHILLU LLL | [100] | Proposed:Uncommon_Use | (Chillu) (9.0 14/013) |
U+0D60 | ൠ | Malayalam | MALAYALAM LETTER VOCALIC RR | [100] | Proposed:Obsolete | Sanskrit |
U+0D61 | ൡ | Malayalam | MALAYALAM LETTER VOCALIC LL | [100] | Proposed:Obsolete | Sanskrit |
U+0E86 | ຆ | Lao | LAO LETTER PALI GHA | [100] | Proposed:Uncommon_Use | Pali/Sanskrit |
U+0E89 | ຉ | Lao | LAO LETTER PALI CHA | [100] | Proposed:Uncommon_Use | Pali/Sanskrit |
U+0E8C | ຌ | Lao | LAO LETTER PALI JHA | [100] | Proposed:Uncommon_Use | Pali/Sanskrit |
U+0E8E | ຎ | Lao | LAO LETTER PALI NYA | [100] | Proposed:Uncommon_Use | (Pali/Sanskrit) (12.0 17/106) |
U+0E8F | ຏ | Lao | LAO LETTER PALI TTA | [100] | Proposed:Uncommon_Use | (Pali/Sanskrit) (12.0 17/106) |
U+0E90 | ຐ | Lao | LAO LETTER PALI TTHA | [100] | Proposed:Uncommon_Use | (Pali/Sanskrit) (12.0 17/106) |
U+0E91 | ຑ | Lao | LAO LETTER PALI DDA | [100] | Proposed:Uncommon_Use | (Pali/Sanskrit) (12.0 17/106) |
U+0E92 | ຒ | Lao | LAO LETTER PALI DDHA | [100] | Proposed:Uncommon_Use | (Pali/Sanskrit) (12.0 17/106) |
U+0E93 | ຓ | Lao | LAO LETTER PALI NNA | [100] | Proposed:Uncommon_Use | (Pali/Sanskrit) (12.0 17/106) |
U+0E98 | ຘ | Lao | LAO LETTER PALI DHA | [100] | Proposed:Uncommon_Use | Pali/Sanskrit |
U+0EA0 | ຠ | Lao | LAO LETTER PALI BHA | [100] | Proposed:Uncommon_Use | Pali/Sanskrit |
U+0EA8 | ຨ | Lao | LAO LETTER SANSKRIT SHA | [100] | Proposed:Uncommon_Use | (Pali/Sanskrit) (12.0 17/106) |
U+0EA9 | ຩ | Lao | LAO LETTER SANSKRIT SSA | [100] | Proposed:Uncommon_Use | (Pali/Sanskrit) (12.0 17/106) |
U+0EAC | ຬ | Lao | LAO LETTER PALI LLA | [100] | Proposed:Uncommon_Use | Pali/Sanskrit |
U+0EAF | ຯ | Lao | LAO ELLIPSIS | [100] | Proposed:Technical | |
U+0EBA | ຺ | Lao | LAO SIGN PALI VIRAMA | [100] | Proposed:Uncommon_Use | Pali/Sanskrit |
U+0ECE | ໎ | Lao | LAO YAMAKKAN | Proposed:Uncommon_Use | Pali | |
U+0F00 | ༀ | Tibetan | TIBETAN SYLLABLE OM | [100] | Proposed:Technical | |
U+0F35 | ༵ | Tibetan | TIBETAN MARK NGAS BZUNG NYI ZLA | [100] | Proposed:Technical | honorific, emphasis |
U+0F37 | ༷ | Tibetan | TIBETAN MARK NGAS BZUNG SGOR RTAGS | [100] | Proposed:Technical | emphasis |
U+0F3E | ༾ | Tibetan | TIBETAN SIGN YAR TSHES | [100] | Proposed:Technical | almanacs |
U+0F3F | ༿ | Tibetan | TIBETAN SIGN MAR TSHES | [100] | Proposed:Technical | almanacs |
U+0F6A | ཪ | Tibetan | TIBETAN LETTER FIXED-FORM RA | [100] | Proposed:Obsolete, Proposed:Uncommon_Use | Sanskrit |
U+0F6B | ཫ | Tibetan | TIBETAN LETTER KKA | [100] | Proposed:Uncommon_Use | Balti |
U+0F6C | ཬ | Tibetan | TIBETAN LETTER RRA | [100] | Proposed:Uncommon_Use | Balti |
U+0F7B | ཻ | Tibetan | TIBETAN VOWEL SIGN EE | [100] | Proposed:Exclusion | homoglyph (digraph of 0F7A 0F7A) |
U+0F7D | ཽ | Tibetan | TIBETAN VOWEL SIGN OO | [100] | Proposed:Exclusion | homoglyph (digraph of 0F7C 0F7C) |
U+0F82 | ྂ | Tibetan | TIBETAN SIGN NYI ZLA NAA DA | [100] | Proposed:Obsolete, Proposed:Uncommon_Use | Sanskrit |
U+0F83 | ྃ | Tibetan | TIBETAN SIGN SNA LDAN | [100] | Proposed:Obsolete, Proposed:Uncommon_Use | Sanskrit |
U+0F86 | ྆ | Tibetan | TIBETAN SIGN LCI RTAGS | [100] | Proposed:Obsolete | historic |
U+0F87 | ྇ | Tibetan | TIBETAN SIGN YANG RTAGS | [100] | Proposed:Obsolete | historic |
U+0F88 | ྈ | Tibetan | TIBETAN SIGN LCE TSA CAN | [100] | Proposed:Obsolete | historic |
U+0F89 | ྉ | Tibetan | TIBETAN SIGN MCHU CAN | [100] | Proposed:Obsolete | historic |
U+0F8A | ྊ | Tibetan | TIBETAN SIGN GRU CAN RGYINGS | [100] | Proposed:Obsolete | historic |
U+0F8B | ྋ | Tibetan | TIBETAN SIGN GRU MED RGYINGS | [100] | Proposed:Obsolete | historic |
U+0F8C | ྌ | Tibetan | TIBETAN SIGN INVERTED MCHU CAN | [100] | Proposed:Obsolete | historic |
U+0F8D | ྍ | Tibetan | TIBETAN SUBJOINED SIGN LCE TSA CAN | [100] | Proposed:Obsolete | historic |
U+0F8E | ྎ | Tibetan | TIBETAN SUBJOINED SIGN MCHU CAN | [100] | Proposed:Obsolete | historic |
U+0F8F | ྏ | Tibetan | TIBETAN SUBJOINED SIGN INVERTED MCHU CAN | [100] | Proposed:Obsolete | historic |
U+0FAE | ྮ | Tibetan | TIBETAN SUBJOINED LETTER ZHA | [100] | Proposed:Uncommon_Use | |
U+0FAF | ྯ | Tibetan | TIBETAN SUBJOINED LETTER ZA | [100] | Proposed:Uncommon_Use | |
U+0FB0 | ྰ | Tibetan | TIBETAN SUBJOINED LETTER -A | [100] | Proposed:Uncommon_Use | |
U+0FC6 | ࿆ | Tibetan | TIBETAN SYMBOL PADMA GDAN | [100] | Proposed:Technical, Proposed:Uncommon_Use | |
U+1050 | ၐ | Myanmar | MYANMAR LETTER SHA | [100] | Proposed:Obsolete, Proposed:Uncommon_Use | (Pali) (Sanskrit) |
U+1051 | ၑ | Myanmar | MYANMAR LETTER SSA | [100] | Proposed:Obsolete, Proposed:Uncommon_Use | (Pali) (Sanskrit) |
U+1052 | ၒ | Myanmar | MYANMAR LETTER VOCALIC R | [100] | Proposed:Obsolete, Proposed:Uncommon_Use | (Pali) (Sanskrit) |
U+1053 | ၓ | Myanmar | MYANMAR LETTER VOCALIC RR | [100] | Proposed:Obsolete, Proposed:Uncommon_Use | (Pali) (Sanskrit) |
U+1054 | ၔ | Myanmar | MYANMAR LETTER VOCALIC L | [100] | Proposed:Obsolete, Proposed:Uncommon_Use | (Pali) (Sanskrit) |
U+1055 | ၕ | Myanmar | MYANMAR LETTER VOCALIC LL | [100] | Proposed:Obsolete, Proposed:Uncommon_Use | (Pali) (Sanskrit) |
U+1056 | ၖ | Myanmar | MYANMAR VOWEL SIGN VOCALIC R | [100] | Proposed:Obsolete, Proposed:Uncommon_Use | (Pali) (Sanskrit) |
U+1057 | ၗ | Myanmar | MYANMAR VOWEL SIGN VOCALIC RR | [100] | Proposed:Obsolete, Proposed:Uncommon_Use | (Pali) (Sanskrit) |
U+1058 | ၘ | Myanmar | MYANMAR VOWEL SIGN VOCALIC L | [100] | Proposed:Obsolete, Proposed:Uncommon_Use | (Pali) (Sanskrit) |
U+1059 | ၙ | Myanmar | MYANMAR VOWEL SIGN VOCALIC LL | [100] | Proposed:Obsolete, Proposed:Uncommon_Use | (Pali) (Sanskrit) |
U+1065 | ၥ | Myanmar | MYANMAR LETTER WESTERN PWO KAREN THA | [100] | Proposed:Uncommon_Use | Western Pwo Karen |
U+1066 | ၦ | Myanmar | MYANMAR LETTER WESTERN PWO KAREN PWA | [100] | Proposed:Uncommon_Use | Western Pwo Karen |
U+1067 | ၧ | Myanmar | MYANMAR VOWEL SIGN WESTERN PWO KAREN EU | [100] | Proposed:Uncommon_Use | Western Pwo Karen |
U+1068 | ၨ | Myanmar | MYANMAR VOWEL SIGN WESTERN PWO KAREN UE | [100] | Proposed:Uncommon_Use | Western Pwo Karen |
U+1069 | ၩ | Myanmar | MYANMAR SIGN WESTERN PWO KAREN TONE-1 | [100] | Proposed:Uncommon_Use | Western Pwo Karen |
U+106A | ၪ | Myanmar | MYANMAR SIGN WESTERN PWO KAREN TONE-2 | [100] | Proposed:Uncommon_Use | Western Pwo Karen |
U+106B | ၫ | Myanmar | MYANMAR SIGN WESTERN PWO KAREN TONE-3 | [100] | Proposed:Uncommon_Use | Western Pwo Karen |
U+106C | ၬ | Myanmar | MYANMAR SIGN WESTERN PWO KAREN TONE-4 | [100] | Proposed:Uncommon_Use | Western Pwo Karen |
U+106D | ၭ | Myanmar | MYANMAR SIGN WESTERN PWO KAREN TONE-5 | [100] | Proposed:Uncommon_Use | Western Pwo Karen |
U+106E | ၮ | Myanmar | MYANMAR LETTER EASTERN PWO KAREN NNA | [100] | Proposed:Uncommon_Use | Eastern Pwo Karen |
U+106F | ၯ | Myanmar | MYANMAR LETTER EASTERN PWO KAREN YWA | [100] | Proposed:Uncommon_Use | Eastern Pwo Karen |
U+1070 | ၰ | Myanmar | MYANMAR LETTER EASTERN PWO KAREN GHWA | [100] | Proposed:Uncommon_Use | Eastern Pwo Karen |
U+1071 | ၱ | Myanmar | MYANMAR VOWEL SIGN GEBA KAREN I | [100] | Proposed:Uncommon_Use | Geba Karen |
U+1072 | ၲ | Myanmar | MYANMAR VOWEL SIGN KAYAH OE | [100] | Proposed:Uncommon_Use | Kayah |
U+1073 | ၳ | Myanmar | MYANMAR VOWEL SIGN KAYAH U | [100] | Proposed:Uncommon_Use | Kayah |
U+1074 | ၴ | Myanmar | MYANMAR VOWEL SIGN KAYAH EE | [100] | Proposed:Uncommon_Use | Kayah |
U+108E | ႎ | Myanmar | MYANMAR LETTER RUMAI PALAUNG FA | [100] | Proposed:Uncommon_Use | Rumai Palaung |
U+109A | ႚ | Myanmar | MYANMAR SIGN KHAMTI TONE-1 | [100] | Proposed:Uncommon_Use | Kamti Shan |
U+109B | ႛ | Myanmar | MYANMAR SIGN KHAMTI TONE-3 | [100] | Proposed:Uncommon_Use | Kamti Shan |
U+109C | ႜ | Myanmar | MYANMAR VOWEL SIGN AITON A | [100] | Proposed:Uncommon_Use | Aiton, Phake |
U+109D | ႝ | Myanmar | MYANMAR VOWEL SIGN AITON AI | [100] | Proposed:Uncommon_Use | Aiton, Phake |
U+10F9 | ჹ | Georgian | GEORGIAN LETTER TURNED GAN | [100] | Proposed:Technical, Proposed:Uncommon_Use | educational |
U+10FA | ჺ | Georgian | GEORGIAN LETTER AIN | [100] | Proposed:Technical, Proposed:Uncommon_Use | threatened |
U+10FD | ჽ | Georgian | GEORGIAN LETTER AEN | [100] | Proposed:Uncommon_Use | Ossetian, Abkhaz |
U+10FE | ჾ | Georgian | GEORGIAN LETTER HARD SIGN | [100] | Proposed:Uncommon_Use | Ossetian, Abkhaz |
U+10FF | ჿ | Georgian | GEORGIAN LETTER LABIAL SIGN | [100] | Proposed:Uncommon_Use | Ossetian, Abkhaz |
U+1380 | ᎀ | Ethiopic | ETHIOPIC SYLLABLE SEBATBEIT MWA | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+1381 | ᎁ | Ethiopic | ETHIOPIC SYLLABLE MWI | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+1382 | ᎂ | Ethiopic | ETHIOPIC SYLLABLE MWEE | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+1383 | ᎃ | Ethiopic | ETHIOPIC SYLLABLE MWE | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+1384 | ᎄ | Ethiopic | ETHIOPIC SYLLABLE SEBATBEIT BWA | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+1385 | ᎅ | Ethiopic | ETHIOPIC SYLLABLE BWI | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+1386 | ᎆ | Ethiopic | ETHIOPIC SYLLABLE BWEE | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+1387 | ᎇ | Ethiopic | ETHIOPIC SYLLABLE BWE | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+1388 | ᎈ | Ethiopic | ETHIOPIC SYLLABLE SEBATBEIT FWA | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+1389 | ᎉ | Ethiopic | ETHIOPIC SYLLABLE FWI | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+138A | ᎊ | Ethiopic | ETHIOPIC SYLLABLE FWEE | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+138B | ᎋ | Ethiopic | ETHIOPIC SYLLABLE FWE | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+138C | ᎌ | Ethiopic | ETHIOPIC SYLLABLE SEBATBEIT PWA | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+138D | ᎍ | Ethiopic | ETHIOPIC SYLLABLE PWI | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+138E | ᎎ | Ethiopic | ETHIOPIC SYLLABLE PWEE | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+138F | ᎏ | Ethiopic | ETHIOPIC SYLLABLE PWE | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+17DC | ៜ | Khmer | KHMER SIGN AVAKRAHASANYA | [100] | Proposed:Obsolete | Sanskrit |
U+1E00 | Ḁ | Latin | LATIN CAPITAL LETTER A WITH RING BELOW | Proposed:Technical | Not IDNA2008; Uppercase | |
U+1E01 | ḁ | Latin | LATIN SMALL LETTER A WITH RING BELOW | [100] | Proposed:Technical | |
U+1E18 | Ḙ | Latin | LATIN CAPITAL LETTER E WITH CIRCUMFLEX BELOW | Proposed:Technical | Not IDNA2008; Uppercase | |
U+1E19 | ḙ | Latin | LATIN SMALL LETTER E WITH CIRCUMFLEX BELOW | [100] | Proposed:Technical | |
U+1E1A | Ḛ | Latin | LATIN CAPITAL LETTER E WITH TILDE BELOW | Proposed:Technical | Not IDNA2008; Uppercase | |
U+1E1B | ḛ | Latin | LATIN SMALL LETTER E WITH TILDE BELOW | [100] | Proposed:Technical | |
U+1E2A | Ḫ | Latin | LATIN CAPITAL LETTER H WITH BREVE BELOW | Proposed:Technical | Not IDNA2008; Uppercase | |
U+1E2B | ḫ | Latin | LATIN SMALL LETTER H WITH BREVE BELOW | [100] | Proposed:Technical | Semitic transliteration |
U+1E2C | Ḭ | Latin | LATIN CAPITAL LETTER I WITH TILDE BELOW | Proposed:Technical | Not IDNA2008; Uppercase | |
U+1E2D | ḭ | Latin | LATIN SMALL LETTER I WITH TILDE BELOW | [100] | Proposed:Technical | |
U+1E72 | Ṳ | Latin | LATIN CAPITAL LETTER U WITH DIAERESIS BELOW | Proposed:Technical | Not IDNA2008; Uppercase | |
U+1E73 | ṳ | Latin | LATIN SMALL LETTER U WITH DIAERESIS BELOW | [100] | Proposed:Technical | |
U+1E74 | Ṵ | Latin | LATIN CAPITAL LETTER U WITH TILDE BELOW | Proposed:Technical | Not IDNA2008; Uppercase | |
U+1E75 | ṵ | Latin | LATIN SMALL LETTER U WITH TILDE BELOW | [100] | Proposed:Technical | |
U+1E76 | Ṷ | Latin | LATIN CAPITAL LETTER U WITH CIRCUMFLEX BELOW | Proposed:Technical | Not IDNA2008; Uppercase | |
U+1E77 | ṷ | Latin | LATIN SMALL LETTER U WITH CIRCUMFLEX BELOW | [100] | Proposed:Technical | |
U+1F00 | ἀ | Greek | GREEK SMALL LETTER ALPHA WITH PSILI | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F01 | ἁ | Greek | GREEK SMALL LETTER ALPHA WITH DASIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F02 | ἂ | Greek | GREEK SMALL LETTER ALPHA WITH PSILI AND VARIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F03 | ἃ | Greek | GREEK SMALL LETTER ALPHA WITH DASIA AND VARIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F04 | ἄ | Greek | GREEK SMALL LETTER ALPHA WITH PSILI AND OXIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F05 | ἅ | Greek | GREEK SMALL LETTER ALPHA WITH DASIA AND OXIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F06 | ἆ | Greek | GREEK SMALL LETTER ALPHA WITH PSILI AND PERISPOMENI | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F07 | ἇ | Greek | GREEK SMALL LETTER ALPHA WITH DASIA AND PERISPOMENI | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F08 | Ἀ | Greek | GREEK CAPITAL LETTER ALPHA WITH PSILI | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F09 | Ἁ | Greek | GREEK CAPITAL LETTER ALPHA WITH DASIA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F0A | Ἂ | Greek | GREEK CAPITAL LETTER ALPHA WITH PSILI AND VARIA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F0B | Ἃ | Greek | GREEK CAPITAL LETTER ALPHA WITH DASIA AND VARIA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F0C | Ἄ | Greek | GREEK CAPITAL LETTER ALPHA WITH PSILI AND OXIA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F0D | Ἅ | Greek | GREEK CAPITAL LETTER ALPHA WITH DASIA AND OXIA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F0E | Ἆ | Greek | GREEK CAPITAL LETTER ALPHA WITH PSILI AND PERISPOMENI | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F0F | Ἇ | Greek | GREEK CAPITAL LETTER ALPHA WITH DASIA AND PERISPOMENI | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F10 | ἐ | Greek | GREEK SMALL LETTER EPSILON WITH PSILI | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F11 | ἑ | Greek | GREEK SMALL LETTER EPSILON WITH DASIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F12 | ἒ | Greek | GREEK SMALL LETTER EPSILON WITH PSILI AND VARIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F13 | ἓ | Greek | GREEK SMALL LETTER EPSILON WITH DASIA AND VARIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F14 | ἔ | Greek | GREEK SMALL LETTER EPSILON WITH PSILI AND OXIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F15 | ἕ | Greek | GREEK SMALL LETTER EPSILON WITH DASIA AND OXIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F18 | Ἐ | Greek | GREEK CAPITAL LETTER EPSILON WITH PSILI | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F19 | Ἑ | Greek | GREEK CAPITAL LETTER EPSILON WITH DASIA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F1A | Ἒ | Greek | GREEK CAPITAL LETTER EPSILON WITH PSILI AND VARIA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F1B | Ἓ | Greek | GREEK CAPITAL LETTER EPSILON WITH DASIA AND VARIA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F1C | Ἔ | Greek | GREEK CAPITAL LETTER EPSILON WITH PSILI AND OXIA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F1D | Ἕ | Greek | GREEK CAPITAL LETTER EPSILON WITH DASIA AND OXIA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F20 | ἠ | Greek | GREEK SMALL LETTER ETA WITH PSILI | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F21 | ἡ | Greek | GREEK SMALL LETTER ETA WITH DASIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F22 | ἢ | Greek | GREEK SMALL LETTER ETA WITH PSILI AND VARIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F23 | ἣ | Greek | GREEK SMALL LETTER ETA WITH DASIA AND VARIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F24 | ἤ | Greek | GREEK SMALL LETTER ETA WITH PSILI AND OXIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F25 | ἥ | Greek | GREEK SMALL LETTER ETA WITH DASIA AND OXIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F26 | ἦ | Greek | GREEK SMALL LETTER ETA WITH PSILI AND PERISPOMENI | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F27 | ἧ | Greek | GREEK SMALL LETTER ETA WITH DASIA AND PERISPOMENI | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F28 | Ἠ | Greek | GREEK CAPITAL LETTER ETA WITH PSILI | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F29 | Ἡ | Greek | GREEK CAPITAL LETTER ETA WITH DASIA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F2A | Ἢ | Greek | GREEK CAPITAL LETTER ETA WITH PSILI AND VARIA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F2B | Ἣ | Greek | GREEK CAPITAL LETTER ETA WITH DASIA AND VARIA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F2C | Ἤ | Greek | GREEK CAPITAL LETTER ETA WITH PSILI AND OXIA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F2D | Ἥ | Greek | GREEK CAPITAL LETTER ETA WITH DASIA AND OXIA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F2E | Ἦ | Greek | GREEK CAPITAL LETTER ETA WITH PSILI AND PERISPOMENI | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F2F | Ἧ | Greek | GREEK CAPITAL LETTER ETA WITH DASIA AND PERISPOMENI | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F30 | ἰ | Greek | GREEK SMALL LETTER IOTA WITH PSILI | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F31 | ἱ | Greek | GREEK SMALL LETTER IOTA WITH DASIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F32 | ἲ | Greek | GREEK SMALL LETTER IOTA WITH PSILI AND VARIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F33 | ἳ | Greek | GREEK SMALL LETTER IOTA WITH DASIA AND VARIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F34 | ἴ | Greek | GREEK SMALL LETTER IOTA WITH PSILI AND OXIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F35 | ἵ | Greek | GREEK SMALL LETTER IOTA WITH DASIA AND OXIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F36 | ἶ | Greek | GREEK SMALL LETTER IOTA WITH PSILI AND PERISPOMENI | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F37 | ἷ | Greek | GREEK SMALL LETTER IOTA WITH DASIA AND PERISPOMENI | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F38 | Ἰ | Greek | GREEK CAPITAL LETTER IOTA WITH PSILI | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F39 | Ἱ | Greek | GREEK CAPITAL LETTER IOTA WITH DASIA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F3A | Ἲ | Greek | GREEK CAPITAL LETTER IOTA WITH PSILI AND VARIA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F3B | Ἳ | Greek | GREEK CAPITAL LETTER IOTA WITH DASIA AND VARIA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F3C | Ἴ | Greek | GREEK CAPITAL LETTER IOTA WITH PSILI AND OXIA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F3D | Ἵ | Greek | GREEK CAPITAL LETTER IOTA WITH DASIA AND OXIA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F3E | Ἶ | Greek | GREEK CAPITAL LETTER IOTA WITH PSILI AND PERISPOMENI | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F3F | Ἷ | Greek | GREEK CAPITAL LETTER IOTA WITH DASIA AND PERISPOMENI | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F40 | ὀ | Greek | GREEK SMALL LETTER OMICRON WITH PSILI | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F41 | ὁ | Greek | GREEK SMALL LETTER OMICRON WITH DASIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F42 | ὂ | Greek | GREEK SMALL LETTER OMICRON WITH PSILI AND VARIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F43 | ὃ | Greek | GREEK SMALL LETTER OMICRON WITH DASIA AND VARIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F44 | ὄ | Greek | GREEK SMALL LETTER OMICRON WITH PSILI AND OXIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F45 | ὅ | Greek | GREEK SMALL LETTER OMICRON WITH DASIA AND OXIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F48 | Ὀ | Greek | GREEK CAPITAL LETTER OMICRON WITH PSILI | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F49 | Ὁ | Greek | GREEK CAPITAL LETTER OMICRON WITH DASIA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F4A | Ὂ | Greek | GREEK CAPITAL LETTER OMICRON WITH PSILI AND VARIA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F4B | Ὃ | Greek | GREEK CAPITAL LETTER OMICRON WITH DASIA AND VARIA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F4C | Ὄ | Greek | GREEK CAPITAL LETTER OMICRON WITH PSILI AND OXIA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F4D | Ὅ | Greek | GREEK CAPITAL LETTER OMICRON WITH DASIA AND OXIA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F50 | ὐ | Greek | GREEK SMALL LETTER UPSILON WITH PSILI | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F51 | ὑ | Greek | GREEK SMALL LETTER UPSILON WITH DASIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F52 | ὒ | Greek | GREEK SMALL LETTER UPSILON WITH PSILI AND VARIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F53 | ὓ | Greek | GREEK SMALL LETTER UPSILON WITH DASIA AND VARIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F54 | ὔ | Greek | GREEK SMALL LETTER UPSILON WITH PSILI AND OXIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F55 | ὕ | Greek | GREEK SMALL LETTER UPSILON WITH DASIA AND OXIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F56 | ὖ | Greek | GREEK SMALL LETTER UPSILON WITH PSILI AND PERISPOMENI | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F57 | ὗ | Greek | GREEK SMALL LETTER UPSILON WITH DASIA AND PERISPOMENI | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F59 | Ὑ | Greek | GREEK CAPITAL LETTER UPSILON WITH DASIA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F5B | Ὓ | Greek | GREEK CAPITAL LETTER UPSILON WITH DASIA AND VARIA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F5D | Ὕ | Greek | GREEK CAPITAL LETTER UPSILON WITH DASIA AND OXIA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F5F | Ὗ | Greek | GREEK CAPITAL LETTER UPSILON WITH DASIA AND PERISPOMENI | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F60 | ὠ | Greek | GREEK SMALL LETTER OMEGA WITH PSILI | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F61 | ὡ | Greek | GREEK SMALL LETTER OMEGA WITH DASIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F62 | ὢ | Greek | GREEK SMALL LETTER OMEGA WITH PSILI AND VARIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F63 | ὣ | Greek | GREEK SMALL LETTER OMEGA WITH DASIA AND VARIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F64 | ὤ | Greek | GREEK SMALL LETTER OMEGA WITH PSILI AND OXIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F65 | ὥ | Greek | GREEK SMALL LETTER OMEGA WITH DASIA AND OXIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F66 | ὦ | Greek | GREEK SMALL LETTER OMEGA WITH PSILI AND PERISPOMENI | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F67 | ὧ | Greek | GREEK SMALL LETTER OMEGA WITH DASIA AND PERISPOMENI | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F68 | Ὠ | Greek | GREEK CAPITAL LETTER OMEGA WITH PSILI | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F69 | Ὡ | Greek | GREEK CAPITAL LETTER OMEGA WITH DASIA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F6A | Ὢ | Greek | GREEK CAPITAL LETTER OMEGA WITH PSILI AND VARIA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F6B | Ὣ | Greek | GREEK CAPITAL LETTER OMEGA WITH DASIA AND VARIA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F6C | Ὤ | Greek | GREEK CAPITAL LETTER OMEGA WITH PSILI AND OXIA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F6D | Ὥ | Greek | GREEK CAPITAL LETTER OMEGA WITH DASIA AND OXIA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F6E | Ὦ | Greek | GREEK CAPITAL LETTER OMEGA WITH PSILI AND PERISPOMENI | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F6F | Ὧ | Greek | GREEK CAPITAL LETTER OMEGA WITH DASIA AND PERISPOMENI | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F70 | ὰ | Greek | GREEK SMALL LETTER ALPHA WITH VARIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F72 | ὲ | Greek | GREEK SMALL LETTER EPSILON WITH VARIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F74 | ὴ | Greek | GREEK SMALL LETTER ETA WITH VARIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F76 | ὶ | Greek | GREEK SMALL LETTER IOTA WITH VARIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F78 | ὸ | Greek | GREEK SMALL LETTER OMICRON WITH VARIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F7A | ὺ | Greek | GREEK SMALL LETTER UPSILON WITH VARIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F7C | ὼ | Greek | GREEK SMALL LETTER OMEGA WITH VARIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1F80 | ᾀ | Greek | GREEK SMALL LETTER ALPHA WITH PSILI AND YPOGEGRAMMENI | Proposed:Obsolete | NOT IDNA2008 | |
U+1F81 | ᾁ | Greek | GREEK SMALL LETTER ALPHA WITH DASIA AND YPOGEGRAMMENI | Proposed:Obsolete | NOT IDNA2008 | |
U+1F82 | ᾂ | Greek | GREEK SMALL LETTER ALPHA WITH PSILI AND VARIA AND YPOGEGRAMMENI | Proposed:Obsolete | NOT IDNA2008 | |
U+1F83 | ᾃ | Greek | GREEK SMALL LETTER ALPHA WITH DASIA AND VARIA AND YPOGEGRAMMENI | Proposed:Obsolete | NOT IDNA2008 | |
U+1F84 | ᾄ | Greek | GREEK SMALL LETTER ALPHA WITH PSILI AND OXIA AND YPOGEGRAMMENI | Proposed:Obsolete | NOT IDNA2008 | |
U+1F85 | ᾅ | Greek | GREEK SMALL LETTER ALPHA WITH DASIA AND OXIA AND YPOGEGRAMMENI | Proposed:Obsolete | NOT IDNA2008 | |
U+1F86 | ᾆ | Greek | GREEK SMALL LETTER ALPHA WITH PSILI AND PERISPOMENI AND YPOGEGRAMMENI | Proposed:Obsolete | NOT IDNA2008 | |
U+1F87 | ᾇ | Greek | GREEK SMALL LETTER ALPHA WITH DASIA AND PERISPOMENI AND YPOGEGRAMMENI | Proposed:Obsolete | NOT IDNA2008 | |
U+1F88 | ᾈ | Greek | GREEK CAPITAL LETTER ALPHA WITH PSILI AND PROSGEGRAMMENI | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F89 | ᾉ | Greek | GREEK CAPITAL LETTER ALPHA WITH DASIA AND PROSGEGRAMMENI | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F8A | ᾊ | Greek | GREEK CAPITAL LETTER ALPHA WITH PSILI AND VARIA AND PROSGEGRAMMENI | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F8B | ᾋ | Greek | GREEK CAPITAL LETTER ALPHA WITH DASIA AND VARIA AND PROSGEGRAMMENI | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F8C | ᾌ | Greek | GREEK CAPITAL LETTER ALPHA WITH PSILI AND OXIA AND PROSGEGRAMMENI | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F8D | ᾍ | Greek | GREEK CAPITAL LETTER ALPHA WITH DASIA AND OXIA AND PROSGEGRAMMENI | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F8E | ᾎ | Greek | GREEK CAPITAL LETTER ALPHA WITH PSILI AND PERISPOMENI AND PROSGEGRAMMENI | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F8F | ᾏ | Greek | GREEK CAPITAL LETTER ALPHA WITH DASIA AND PERISPOMENI AND PROSGEGRAMMENI | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F90 | ᾐ | Greek | GREEK SMALL LETTER ETA WITH PSILI AND YPOGEGRAMMENI | Proposed:Obsolete | NOT IDNA2008 | |
U+1F91 | ᾑ | Greek | GREEK SMALL LETTER ETA WITH DASIA AND YPOGEGRAMMENI | Proposed:Obsolete | NOT IDNA2008 | |
U+1F92 | ᾒ | Greek | GREEK SMALL LETTER ETA WITH PSILI AND VARIA AND YPOGEGRAMMENI | Proposed:Obsolete | NOT IDNA2008 | |
U+1F93 | ᾓ | Greek | GREEK SMALL LETTER ETA WITH DASIA AND VARIA AND YPOGEGRAMMENI | Proposed:Obsolete | NOT IDNA2008 | |
U+1F94 | ᾔ | Greek | GREEK SMALL LETTER ETA WITH PSILI AND OXIA AND YPOGEGRAMMENI | Proposed:Obsolete | NOT IDNA2008 | |
U+1F95 | ᾕ | Greek | GREEK SMALL LETTER ETA WITH DASIA AND OXIA AND YPOGEGRAMMENI | Proposed:Obsolete | NOT IDNA2008 | |
U+1F96 | ᾖ | Greek | GREEK SMALL LETTER ETA WITH PSILI AND PERISPOMENI AND YPOGEGRAMMENI | Proposed:Obsolete | NOT IDNA2008 | |
U+1F97 | ᾗ | Greek | GREEK SMALL LETTER ETA WITH DASIA AND PERISPOMENI AND YPOGEGRAMMENI | Proposed:Obsolete | NOT IDNA2008 | |
U+1F98 | ᾘ | Greek | GREEK CAPITAL LETTER ETA WITH PSILI AND PROSGEGRAMMENI | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F99 | ᾙ | Greek | GREEK CAPITAL LETTER ETA WITH DASIA AND PROSGEGRAMMENI | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F9A | ᾚ | Greek | GREEK CAPITAL LETTER ETA WITH PSILI AND VARIA AND PROSGEGRAMMENI | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F9B | ᾛ | Greek | GREEK CAPITAL LETTER ETA WITH DASIA AND VARIA AND PROSGEGRAMMENI | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F9C | ᾜ | Greek | GREEK CAPITAL LETTER ETA WITH PSILI AND OXIA AND PROSGEGRAMMENI | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F9D | ᾝ | Greek | GREEK CAPITAL LETTER ETA WITH DASIA AND OXIA AND PROSGEGRAMMENI | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F9E | ᾞ | Greek | GREEK CAPITAL LETTER ETA WITH PSILI AND PERISPOMENI AND PROSGEGRAMMENI | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1F9F | ᾟ | Greek | GREEK CAPITAL LETTER ETA WITH DASIA AND PERISPOMENI AND PROSGEGRAMMENI | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1FA0 | ᾠ | Greek | GREEK SMALL LETTER OMEGA WITH PSILI AND YPOGEGRAMMENI | Proposed:Obsolete | NOT IDNA2008 | |
U+1FA1 | ᾡ | Greek | GREEK SMALL LETTER OMEGA WITH DASIA AND YPOGEGRAMMENI | Proposed:Obsolete | NOT IDNA2008 | |
U+1FA2 | ᾢ | Greek | GREEK SMALL LETTER OMEGA WITH PSILI AND VARIA AND YPOGEGRAMMENI | Proposed:Obsolete | NOT IDNA2008 | |
U+1FA3 | ᾣ | Greek | GREEK SMALL LETTER OMEGA WITH DASIA AND VARIA AND YPOGEGRAMMENI | Proposed:Obsolete | NOT IDNA2008 | |
U+1FA4 | ᾤ | Greek | GREEK SMALL LETTER OMEGA WITH PSILI AND OXIA AND YPOGEGRAMMENI | Proposed:Obsolete | NOT IDNA2008 | |
U+1FA5 | ᾥ | Greek | GREEK SMALL LETTER OMEGA WITH DASIA AND OXIA AND YPOGEGRAMMENI | Proposed:Obsolete | NOT IDNA2008 | |
U+1FA6 | ᾦ | Greek | GREEK SMALL LETTER OMEGA WITH PSILI AND PERISPOMENI AND YPOGEGRAMMENI | Proposed:Obsolete | NOT IDNA2008 | |
U+1FA7 | ᾧ | Greek | GREEK SMALL LETTER OMEGA WITH DASIA AND PERISPOMENI AND YPOGEGRAMMENI | Proposed:Obsolete | NOT IDNA2008 | |
U+1FA8 | ᾨ | Greek | GREEK CAPITAL LETTER OMEGA WITH PSILI AND PROSGEGRAMMENI | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1FA9 | ᾩ | Greek | GREEK CAPITAL LETTER OMEGA WITH DASIA AND PROSGEGRAMMENI | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1FAA | ᾪ | Greek | GREEK CAPITAL LETTER OMEGA WITH PSILI AND VARIA AND PROSGEGRAMMENI | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1FAB | ᾫ | Greek | GREEK CAPITAL LETTER OMEGA WITH DASIA AND VARIA AND PROSGEGRAMMENI | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1FAC | ᾬ | Greek | GREEK CAPITAL LETTER OMEGA WITH PSILI AND OXIA AND PROSGEGRAMMENI | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1FAD | ᾭ | Greek | GREEK CAPITAL LETTER OMEGA WITH DASIA AND OXIA AND PROSGEGRAMMENI | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1FAE | ᾮ | Greek | GREEK CAPITAL LETTER OMEGA WITH PSILI AND PERISPOMENI AND PROSGEGRAMMENI | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1FAF | ᾯ | Greek | GREEK CAPITAL LETTER OMEGA WITH DASIA AND PERISPOMENI AND PROSGEGRAMMENI | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1FB0 | ᾰ | Greek | GREEK SMALL LETTER ALPHA WITH VRACHY | [100] | Proposed:Obsolete, Proposed:Technical | |
U+1FB1 | ᾱ | Greek | GREEK SMALL LETTER ALPHA WITH MACRON | [100] | Proposed:Obsolete, Proposed:Technical | |
U+1FB2 | ᾲ | Greek | GREEK SMALL LETTER ALPHA WITH VARIA AND YPOGEGRAMMENI | Proposed:Obsolete | NOT IDNA208 | |
U+1FB3 | ᾳ | Greek | GREEK SMALL LETTER ALPHA WITH YPOGEGRAMMENI | Proposed:Obsolete | NOT IDNA208 | |
U+1FB4 | ᾴ | Greek | GREEK SMALL LETTER ALPHA WITH OXIA AND YPOGEGRAMMENI | Proposed:Obsolete | NOT IDNA208 | |
U+1FB6 | ᾶ | Greek | GREEK SMALL LETTER ALPHA WITH PERISPOMENI | [100] | Proposed:Obsolete | polytoniko orthography |
U+1FB7 | ᾷ | Greek | GREEK SMALL LETTER ALPHA WITH PERISPOMENI AND YPOGEGRAMMENI | Proposed:Obsolete | NOT IDNA2008 | |
U+1FB8 | Ᾰ | Greek | GREEK CAPITAL LETTER ALPHA WITH VRACHY | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1FB9 | Ᾱ | Greek | GREEK CAPITAL LETTER ALPHA WITH MACRON | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1FBA | Ὰ | Greek | GREEK CAPITAL LETTER ALPHA WITH VARIA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1FBC | ᾼ | Greek | GREEK CAPITAL LETTER ALPHA WITH PROSGEGRAMMENI | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1FC2 | ῂ | Greek | GREEK SMALL LETTER ETA WITH VARIA AND YPOGEGRAMMENI | Proposed:Obsolete | NOT IDNA2008 | |
U+1FC3 | ῃ | Greek | GREEK SMALL LETTER ETA WITH YPOGEGRAMMENI | Proposed:Obsolete | NOT IDNA2008 | |
U+1FC4 | ῄ | Greek | GREEK SMALL LETTER ETA WITH OXIA AND YPOGEGRAMMENI | Proposed:Obsolete | NOT IDNA2008 | |
U+1FC6 | ῆ | Greek | GREEK SMALL LETTER ETA WITH PERISPOMENI | [100] | Proposed:Obsolete | polytoniko orthography |
U+1FC7 | ῇ | Greek | GREEK SMALL LETTER ETA WITH PERISPOMENI AND YPOGEGRAMMENI | Proposed:Obsolete | NOT IDNA2008 | |
U+1FC8 | Ὲ | Greek | GREEK CAPITAL LETTER EPSILON WITH VARIA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1FCA | Ὴ | Greek | GREEK CAPITAL LETTER ETA WITH VARIA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1FCC | ῌ | Greek | GREEK CAPITAL LETTER ETA WITH PROSGEGRAMMENI | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1FD0 | ῐ | Greek | GREEK SMALL LETTER IOTA WITH VRACHY | [100] | Proposed:Obsolete | polytoniko orthography |
U+1FD1 | ῑ | Greek | GREEK SMALL LETTER IOTA WITH MACRON | [100] | Proposed:Obsolete | polytoniko orthography |
U+1FD2 | ῒ | Greek | GREEK SMALL LETTER IOTA WITH DIALYTIKA AND VARIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1FD6 | ῖ | Greek | GREEK SMALL LETTER IOTA WITH PERISPOMENI | [100] | Proposed:Obsolete | polytoniko orthography |
U+1FD7 | ῗ | Greek | GREEK SMALL LETTER IOTA WITH DIALYTIKA AND PERISPOMENI | [100] | Proposed:Obsolete | polytoniko orthography |
U+1FD8 | Ῐ | Greek | GREEK CAPITAL LETTER IOTA WITH VRACHY | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1FD9 | Ῑ | Greek | GREEK CAPITAL LETTER IOTA WITH MACRON | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1FDA | Ὶ | Greek | GREEK CAPITAL LETTER IOTA WITH VARIA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1FDB | Ί | Greek | GREEK CAPITAL LETTER IOTA WITH OXIA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1FE0 | ῠ | Greek | GREEK SMALL LETTER UPSILON WITH VRACHY | [100] | Proposed:Obsolete | polytoniko orthography |
U+1FE1 | ῡ | Greek | GREEK SMALL LETTER UPSILON WITH MACRON | [100] | Proposed:Obsolete | polytoniko orthography |
U+1FE2 | ῢ | Greek | GREEK SMALL LETTER UPSILON WITH DIALYTIKA AND VARIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1FE4 | ῤ | Greek | GREEK SMALL LETTER RHO WITH PSILI | [100] | Proposed:Obsolete | polytoniko orthography |
U+1FE5 | ῥ | Greek | GREEK SMALL LETTER RHO WITH DASIA | [100] | Proposed:Obsolete | polytoniko orthography |
U+1FE6 | ῦ | Greek | GREEK SMALL LETTER UPSILON WITH PERISPOMENI | [100] | Proposed:Obsolete | polytoniko orthography |
U+1FE7 | ῧ | Greek | GREEK SMALL LETTER UPSILON WITH DIALYTIKA AND PERISPOMENI | [100] | Proposed:Obsolete | polytoniko orthography |
U+1FE8 | Ῠ | Greek | GREEK CAPITAL LETTER UPSILON WITH VRACHY | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1FE9 | Ῡ | Greek | GREEK CAPITAL LETTER UPSILON WITH MACRON | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1FEA | Ὺ | Greek | GREEK CAPITAL LETTER UPSILON WITH VARIA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1FEB | Ύ | Greek | GREEK CAPITAL LETTER UPSILON WITH OXIA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1FF2 | ῲ | Greek | GREEK SMALL LETTER OMEGA WITH VARIA AND YPOGEGRAMMENI | Proposed:Obsolete | NOT IDNA2008 | |
U+1FF3 | ῳ | Greek | GREEK SMALL LETTER OMEGA WITH YPOGEGRAMMENI | Proposed:Obsolete | NOT IDNA2008 | |
U+1FF4 | ῴ | Greek | GREEK SMALL LETTER OMEGA WITH OXIA AND YPOGEGRAMMENI | Proposed:Obsolete | NOT IDNA2008 | |
U+1FF6 | ῶ | Greek | GREEK SMALL LETTER OMEGA WITH PERISPOMENI | [100] | Proposed:Obsolete | polytoniko orthography |
U+1FF7 | ῷ | Greek | GREEK SMALL LETTER OMEGA WITH PERISPOMENI AND YPOGEGRAMMENI | Proposed:Obsolete | NOT IDNA2008 | |
U+1FF8 | Ὸ | Greek | GREEK CAPITAL LETTER OMICRON WITH VARIA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1FFA | Ὼ | Greek | GREEK CAPITAL LETTER OMEGA WITH VARIA | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+1FFC | ῼ | Greek | GREEK CAPITAL LETTER OMEGA WITH PROSGEGRAMMENI | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+2010 | ‐ | Common | HYPHEN | Inclusion | NOT IDNA2008 | |
U+2019 | ’ | Common | RIGHT SINGLE QUOTATION MARK | Inclusion | NOT IDNA2008 | |
U+2027 | ‧ | Common | HYPHENATION POINT | Inclusion | NOT IDNA2008 | |
U+2D27 | ⴧ | Georgian | GEORGIAN SMALL LETTER YN | [100] | Proposed:Technical | Khutsuri |
U+2D2D | ⴭ | Georgian | GEORGIAN SMALL LETTER AEN | [100] | Proposed:Technical | Khutsuri |
U+2DA0 | ⶠ | Ethiopic | ETHIOPIC SYLLABLE SSA | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DA1 | ⶡ | Ethiopic | ETHIOPIC SYLLABLE SSU | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DA2 | ⶢ | Ethiopic | ETHIOPIC SYLLABLE SSI | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DA3 | ⶣ | Ethiopic | ETHIOPIC SYLLABLE SSAA | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DA4 | ⶤ | Ethiopic | ETHIOPIC SYLLABLE SSEE | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DA5 | ⶥ | Ethiopic | ETHIOPIC SYLLABLE SSE | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DA6 | ⶦ | Ethiopic | ETHIOPIC SYLLABLE SSO | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DA8 | ⶨ | Ethiopic | ETHIOPIC SYLLABLE CCA | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DA9 | ⶩ | Ethiopic | ETHIOPIC SYLLABLE CCU | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DAA | ⶪ | Ethiopic | ETHIOPIC SYLLABLE CCI | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DAB | ⶫ | Ethiopic | ETHIOPIC SYLLABLE CCAA | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DAC | ⶬ | Ethiopic | ETHIOPIC SYLLABLE CCEE | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DAD | ⶭ | Ethiopic | ETHIOPIC SYLLABLE CCE | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DAE | ⶮ | Ethiopic | ETHIOPIC SYLLABLE CCO | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DB0 | ⶰ | Ethiopic | ETHIOPIC SYLLABLE ZZA | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DB1 | ⶱ | Ethiopic | ETHIOPIC SYLLABLE ZZU | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DB2 | ⶲ | Ethiopic | ETHIOPIC SYLLABLE ZZI | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DB3 | ⶳ | Ethiopic | ETHIOPIC SYLLABLE ZZAA | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DB4 | ⶴ | Ethiopic | ETHIOPIC SYLLABLE ZZEE | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DB5 | ⶵ | Ethiopic | ETHIOPIC SYLLABLE ZZE | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DB6 | ⶶ | Ethiopic | ETHIOPIC SYLLABLE ZZO | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DB8 | ⶸ | Ethiopic | ETHIOPIC SYLLABLE CCHA | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DB9 | ⶹ | Ethiopic | ETHIOPIC SYLLABLE CCHU | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DBA | ⶺ | Ethiopic | ETHIOPIC SYLLABLE CCHI | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DBB | ⶻ | Ethiopic | ETHIOPIC SYLLABLE CCHAA | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DBC | ⶼ | Ethiopic | ETHIOPIC SYLLABLE CCHEE | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DBD | ⶽ | Ethiopic | ETHIOPIC SYLLABLE CCHE | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DBE | ⶾ | Ethiopic | ETHIOPIC SYLLABLE CCHO | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DC0 | ⷀ | Ethiopic | ETHIOPIC SYLLABLE QYA | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DC1 | ⷁ | Ethiopic | ETHIOPIC SYLLABLE QYU | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DC2 | ⷂ | Ethiopic | ETHIOPIC SYLLABLE QYI | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DC3 | ⷃ | Ethiopic | ETHIOPIC SYLLABLE QYAA | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DC4 | ⷄ | Ethiopic | ETHIOPIC SYLLABLE QYEE | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DC5 | ⷅ | Ethiopic | ETHIOPIC SYLLABLE QYE | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DC6 | ⷆ | Ethiopic | ETHIOPIC SYLLABLE QYO | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DC8 | ⷈ | Ethiopic | ETHIOPIC SYLLABLE KYA | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DC9 | ⷉ | Ethiopic | ETHIOPIC SYLLABLE KYU | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DCA | ⷊ | Ethiopic | ETHIOPIC SYLLABLE KYI | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DCB | ⷋ | Ethiopic | ETHIOPIC SYLLABLE KYAA | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DCC | ⷌ | Ethiopic | ETHIOPIC SYLLABLE KYEE | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DCD | ⷍ | Ethiopic | ETHIOPIC SYLLABLE KYE | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DCE | ⷎ | Ethiopic | ETHIOPIC SYLLABLE KYO | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DD0 | ⷐ | Ethiopic | ETHIOPIC SYLLABLE XYA | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DD1 | ⷑ | Ethiopic | ETHIOPIC SYLLABLE XYU | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DD2 | ⷒ | Ethiopic | ETHIOPIC SYLLABLE XYI | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DD3 | ⷓ | Ethiopic | ETHIOPIC SYLLABLE XYAA | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DD4 | ⷔ | Ethiopic | ETHIOPIC SYLLABLE XYEE | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DD5 | ⷕ | Ethiopic | ETHIOPIC SYLLABLE XYE | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DD6 | ⷖ | Ethiopic | ETHIOPIC SYLLABLE XYO | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DD8 | ⷘ | Ethiopic | ETHIOPIC SYLLABLE GYA | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DD9 | ⷙ | Ethiopic | ETHIOPIC SYLLABLE GYU | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DDA | ⷚ | Ethiopic | ETHIOPIC SYLLABLE GYI | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DDB | ⷛ | Ethiopic | ETHIOPIC SYLLABLE GYAA | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DDC | ⷜ | Ethiopic | ETHIOPIC SYLLABLE GYEE | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DDD | ⷝ | Ethiopic | ETHIOPIC SYLLABLE GYE | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+2DDE | ⷞ | Ethiopic | ETHIOPIC SYLLABLE GYO | [100] | Proposed:Uncommon_Use | Sebatbeit |
U+3099 | ゙ | Inherited | COMBINING KATAKANA-HIRAGANA VOICED SOUND MARK | [100] | Proposed:Uncommon_Use | required only for NFD |
U+309A | ゚ | Inherited | COMBINING KATAKANA-HIRAGANA SEMI-VOICED SOUND MARK | [100] | Proposed:Uncommon_Use | required only for NFD |
U+A67F | ꙿ | Cyrillic | CYRILLIC PAYEROK | [100] | Proposed:Obsolete | |
U+A717 | ꜗ | Common | MODIFIER LETTER DOT VERTICAL BAR | [100] | Proposed:Technical | no modifiers |
U+A718 | ꜘ | Common | MODIFIER LETTER DOT SLASH | [100] | Proposed:Technical | no modifiers |
U+A719 | ꜙ | Common | MODIFIER LETTER DOT HORIZONTAL BAR | [100] | Proposed:Technical | no modifiers |
U+A71A | ꜚ | Common | MODIFIER LETTER LOWER RIGHT CORNER ANGLE | [100] | Proposed:Technical | no modifiers |
U+A71B | ꜛ | Common | MODIFIER LETTER RAISED UP ARROW | [100] | Proposed:Technical | no modifiers |
U+A71C | ꜜ | Common | MODIFIER LETTER RAISED DOWN ARROW | [100] | Proposed:Technical | no modifiers |
U+A71D | ꜝ | Common | MODIFIER LETTER RAISED EXCLAMATION MARK | [100] | Proposed:Technical | no modifiers |
U+A71E | ꜞ | Common | MODIFIER LETTER RAISED INVERTED EXCLAMATION MARK | [100] | Proposed:Technical | no modifiers |
U+A71F | ꜟ | Common | MODIFIER LETTER LOW INVERTED EXCLAMATION MARK | [100] | Proposed:Technical | no modifiers |
U+A788 | ꞈ | Common | MODIFIER LETTER LOW CIRCUMFLEX ACCENT | [100] | Proposed:Technical | |
U+A792 | Ꞓ | Latin | LATIN CAPITAL LETTER C WITH BAR | Proposed:Uncommon_Use | Not IDNA2008; Uppercase | |
U+A793 | ꞓ | Latin | LATIN SMALL LETTER C WITH BAR | [100] | Proposed:Uncommon_Use | Nanai |
U+A7C0 | Ꟁ | Latin | LATIN CAPITAL LETTER OLD POLISH O | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+A7C1 | ꟁ | Latin | LATIN SMALL LETTER OLD POLISH O | Proposed:Obsolete | Old Polish | |
U+A7C2 | Ꟃ | Latin | LATIN CAPITAL LETTER ANGLICANA W | Proposed:Uncommon_Use | Not IDNA2008; Uppercase | |
U+A7C3 | ꟃ | Latin | LATIN SMALL LETTER ANGLICANA W | [100] | Proposed:Uncommon_Use | (medieval English/Cornish) (12.0 17/238) |
U+A7C4 | Ꞔ | Latin | LATIN CAPITAL LETTER C WITH PALATAL HOOK | Proposed:Obsolete | NOT IN IDNA2008; this is the uppercase of Obsolete A794 | |
U+A7C5 | Ʂ | Latin | LATIN CAPITAL LETTER S WITH HOOK | Proposed:Technical | NOT in IDNA2008; this is the uppercase of Technical 0282 | |
U+A7C6 | Ᶎ | Latin | LATIN CAPITAL LETTER Z WITH PALATAL HOOK | Proposed:Technical | NOT in IDNA2008; this is the uppercase of Technical 1D8E | |
U+A7C7 | Ꟈ | Latin | LATIN CAPITAL LETTER D WITH SHORT STROKE OVERLAY | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+A7C8 | ꟈ | Latin | LATIN SMALL LETTER D WITH SHORT STROKE OVERLAY | Proposed:Obsolete | Gaullish | |
U+A7C9 | Ꟊ | Latin | LATIN CAPITAL LETTER S WITH SHORT STROKE OVERLAY | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+A7CA | ꟊ | Latin | LATIN SMALL LETTER S WITH SHORT STROKE OVERLAY | Proposed:Obsolete | Gaullish | |
U+A7D0 | Ꟑ | Latin | LATIN CAPITAL LETTER CLOSED INSULAR G | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+A7D1 | ꟑ | Latin | LATIN SMALL LETTER CLOSED INSULAR G | Proposed:Obsolete | Ormulum | |
U+A7D3 | ꟓ | Latin | LATIN SMALL LETTER DOUBLE THORN | Proposed:Obsolete | Ormulum | |
U+A7D5 | ꟕ | Latin | LATIN SMALL LETTER DOUBLE WYNN | Proposed:Obsolete | Ormulum | |
U+A7D6 | Ꟗ | Latin | LATIN CAPITAL LETTER MIDDLE SCOTS S | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+A7D7 | ꟗ | Latin | LATIN SMALL LETTER MIDDLE SCOTS S | Proposed:Obsolete | Middle Scots | |
U+A7D8 | Ꟙ | Latin | LATIN CAPITAL LETTER SIGMOID S | Proposed:Obsolete | Not IDNA2008; Uppercase | |
U+A7D9 | ꟙ | Latin | LATIN SMALL LETTER SIGMOID S | Proposed:Obsolete | Middle Cornish, English, Scots | |
U+A9E7 | ꧧ | Myanmar | MYANMAR LETTER TAI LAING NYA | [100] | Proposed:Uncommon_Use | (Tai Laing) (7.0 11/130R) |
U+A9E8 | ꧨ | Myanmar | MYANMAR LETTER TAI LAING FA | [100] | Proposed:Uncommon_Use | (Tai Laing) (7.0 11/130R) |
U+A9E9 | ꧩ | Myanmar | MYANMAR LETTER TAI LAING GA | [100] | Proposed:Uncommon_Use | (Tai Laing) (7.0 11/130R) |
U+A9EA | ꧪ | Myanmar | MYANMAR LETTER TAI LAING GHA | [100] | Proposed:Uncommon_Use | (Tai Laing) (7.0 11/130R) |
U+A9EB | ꧫ | Myanmar | MYANMAR LETTER TAI LAING JA | [100] | Proposed:Uncommon_Use | (Tai Laing) (7.0 11/130R) |
U+A9EC | ꧬ | Myanmar | MYANMAR LETTER TAI LAING JHA | [100] | Proposed:Uncommon_Use | (Tai Laing) (7.0 11/130R) |
U+A9ED | ꧭ | Myanmar | MYANMAR LETTER TAI LAING DDA | [100] | Proposed:Uncommon_Use | (Tai Laing) (7.0 11/130R) |
U+A9EE | ꧮ | Myanmar | MYANMAR LETTER TAI LAING DDHA | [100] | Proposed:Uncommon_Use | (Tai Laing) (7.0 11/130R) |
U+A9EF | ꧯ | Myanmar | MYANMAR LETTER TAI LAING NNA | [100] | Proposed:Uncommon_Use | (Tai Laing) (7.0 11/130R) |
U+A9F0 | ꧰ | Myanmar | MYANMAR TAI LAING DIGIT ZERO | Proposed:Uncommon_Use | (Tai Laing) (7.0 11/130R) | |
U+A9F1 | ꧱ | Myanmar | MYANMAR TAI LAING DIGIT ONE | Proposed:Uncommon_Use | (Tai Laing) (7.0 11/130R) | |
U+A9F2 | ꧲ | Myanmar | MYANMAR TAI LAING DIGIT TWO | Proposed:Uncommon_Use | (Tai Laing) (7.0 11/130R) | |
U+A9F3 | ꧳ | Myanmar | MYANMAR TAI LAING DIGIT THREE | Proposed:Uncommon_Use | (Tai Laing) (7.0 11/130R) | |
U+A9F4 | ꧴ | Myanmar | MYANMAR TAI LAING DIGIT FOUR | Proposed:Uncommon_Use | (Tai Laing) (7.0 11/130R) | |
U+A9F5 | ꧵ | Myanmar | MYANMAR TAI LAING DIGIT FIVE | Proposed:Uncommon_Use | (Tai Laing) (7.0 11/130R) | |
U+A9F6 | ꧶ | Myanmar | MYANMAR TAI LAING DIGIT SIX | Proposed:Uncommon_Use | (Tai Laing) (7.0 11/130R) | |
U+A9F7 | ꧷ | Myanmar | MYANMAR TAI LAING DIGIT SEVEN | Proposed:Uncommon_Use | (Tai Laing) (7.0 11/130R) | |
U+A9F8 | ꧸ | Myanmar | MYANMAR TAI LAING DIGIT EIGHT | Proposed:Uncommon_Use | (Tai Laing) (7.0 11/130R) | |
U+A9F9 | ꧹ | Myanmar | MYANMAR TAI LAING DIGIT NINE | Proposed:Uncommon_Use | (Tai Laing) (7.0 11/130R) | |
U+A9FA | ꧺ | Myanmar | MYANMAR LETTER TAI LAING LLA | [100] | Proposed:Uncommon_Use | (Tai Laing) (7.0 11/130R) |
U+A9FB | ꧻ | Myanmar | MYANMAR LETTER TAI LAING DA | [100] | Proposed:Uncommon_Use | (Tai Laing) (7.0 11/130R) |
U+A9FC | ꧼ | Myanmar | MYANMAR LETTER TAI LAING DHA | [100] | Proposed:Uncommon_Use | (Tai Laing) (7.0 11/130R) |
U+A9FD | ꧽ | Myanmar | MYANMAR LETTER TAI LAING BA | [100] | Proposed:Uncommon_Use | (Tai Laing) (7.0 11/130R) |
U+A9FE | ꧾ | Myanmar | MYANMAR LETTER TAI LAING BHA | [100] | Proposed:Uncommon_Use | (Tai Laing) (7.0 11/130R) |
U+AA60 | ꩠ | Myanmar | MYANMAR LETTER KHAMTI GA | [100] | Proposed:Uncommon_Use | Khamti Shan |
U+AA61 | ꩡ | Myanmar | MYANMAR LETTER KHAMTI CA | [100] | Proposed:Uncommon_Use | Khamti Shan |
U+AA62 | ꩢ | Myanmar | MYANMAR LETTER KHAMTI CHA | [100] | Proposed:Uncommon_Use | Khamti Shan |
U+AA63 | ꩣ | Myanmar | MYANMAR LETTER KHAMTI JA | [100] | Proposed:Uncommon_Use | Khamti Shan |
U+AA64 | ꩤ | Myanmar | MYANMAR LETTER KHAMTI JHA | [100] | Proposed:Uncommon_Use | Khamti Shan |
U+AA65 | ꩥ | Myanmar | MYANMAR LETTER KHAMTI NYA | [100] | Proposed:Uncommon_Use | Khamti Shan |
U+AA66 | ꩦ | Myanmar | MYANMAR LETTER KHAMTI TTA | [100] | Proposed:Uncommon_Use | Khamti Shan |
U+AA67 | ꩧ | Myanmar | MYANMAR LETTER KHAMTI TTHA | [100] | Proposed:Uncommon_Use | Khamti Shan |
U+AA68 | ꩨ | Myanmar | MYANMAR LETTER KHAMTI DDA | [100] | Proposed:Uncommon_Use | Khamti Shan |
U+AA69 | ꩩ | Myanmar | MYANMAR LETTER KHAMTI DDHA | [100] | Proposed:Uncommon_Use | Khamti Shan |
U+AA6A | ꩪ | Myanmar | MYANMAR LETTER KHAMTI DHA | [100] | Proposed:Uncommon_Use | Khamti Shan |
U+AA6B | ꩫ | Myanmar | MYANMAR LETTER KHAMTI NA | [100] | Proposed:Uncommon_Use | Khamti Shan |
U+AA6C | ꩬ | Myanmar | MYANMAR LETTER KHAMTI SA | [100] | Proposed:Uncommon_Use | Khamti Shan |
U+AA6D | ꩭ | Myanmar | MYANMAR LETTER KHAMTI HA | [100] | Proposed:Uncommon_Use | Khamti Shan |
U+AA6E | ꩮ | Myanmar | MYANMAR LETTER KHAMTI LLA | [100] | Proposed:Uncommon_Use | Khamti Shan |
U+AA6F | ꩯ | Myanmar | MYANMAR LETTER KHAMTI FA | [100] | Proposed:Uncommon_Use | Khamti Shan |
U+AA70 | ꩰ | Myanmar | MYANMAR MODIFIER LETTER KHAMTI REDUPLICATION | [100] | Proposed:Uncommon_Use | Khamti Shan |
U+AA71 | ꩱ | Myanmar | MYANMAR LETTER KHAMTI XA | [100] | Proposed:Uncommon_Use | Khamti Shan |
U+AA72 | ꩲ | Myanmar | MYANMAR LETTER KHAMTI ZA | [100] | Proposed:Uncommon_Use | Khamti Shan |
U+AA73 | ꩳ | Myanmar | MYANMAR LETTER KHAMTI RA | [100] | Proposed:Uncommon_Use | Khamti Shan |
U+AA74 | ꩴ | Myanmar | MYANMAR LOGOGRAM KHAMTI OAY | [100] | Proposed:Uncommon_Use | Khamti Shan |
U+AA75 | ꩵ | Myanmar | MYANMAR LOGOGRAM KHAMTI QN | [100] | Proposed:Uncommon_Use | Khamti Shan |
U+AA76 | ꩶ | Myanmar | MYANMAR LOGOGRAM KHAMTI HM | [100] | Proposed:Uncommon_Use | Khamti Shan |
U+AA7A | ꩺ | Myanmar | MYANMAR LETTER AITON RA | [100] | Proposed:Uncommon_Use | Aiton |
U+AA7C | ꩼ | Myanmar | MYANMAR SIGN TAI LAING TONE-2 | [100] | Proposed:Uncommon_Use | (Tai Laing) (7.0 11/130R) |
U+AA7D | ꩽ | Myanmar | MYANMAR SIGN TAI LAING TONE-5 | [100] | Proposed:Uncommon_Use | (Tai Laing) (7.0 11/130R) |
U+AA7E | ꩾ | Myanmar | MYANMAR LETTER SHWE PALAUNG CHA | [100] | Proposed:Uncommon_Use | (Shwe Palaung) (7.0 11/130R) |
U+AA7F | ꩿ | Myanmar | MYANMAR LETTER SHWE PALAUNG SHA | [100] | Proposed:Uncommon_Use | (Shwe Palaung) (7.0 11/130R) |
U+AB11 | ꬑ | Ethiopic | ETHIOPIC SYLLABLE DZU | [100] | Proposed:Uncommon_Use | Gamo-Gofa-Dawro |
U+AB12 | ꬒ | Ethiopic | ETHIOPIC SYLLABLE DZI | [100] | Proposed:Uncommon_Use | Gamo-Gofa-Dawro |
U+AB13 | ꬓ | Ethiopic | ETHIOPIC SYLLABLE DZAA | [100] | Proposed:Uncommon_Use | Gamo-Gofa-Dawro |
U+AB14 | ꬔ | Ethiopic | ETHIOPIC SYLLABLE DZEE | [100] | Proposed:Uncommon_Use | Gamo-Gofa-Dawro |
U+AB15 | ꬕ | Ethiopic | ETHIOPIC SYLLABLE DZE | [100] | Proposed:Uncommon_Use | Gamo-Gofa-Dawro |
U+AB16 | ꬖ | Ethiopic | ETHIOPIC SYLLABLE DZO | [100] | Proposed:Uncommon_Use | Gamo-Gofa-Dawro |
U+AB20 | ꬠ | Ethiopic | ETHIOPIC SYLLABLE CCHHA | [100] | Proposed:Uncommon_Use | Gumuz |
U+AB21 | ꬡ | Ethiopic | ETHIOPIC SYLLABLE CCHHU | [100] | Proposed:Uncommon_Use | Gumuz |
U+AB22 | ꬢ | Ethiopic | ETHIOPIC SYLLABLE CCHHI | [100] | Proposed:Uncommon_Use | Gumuz |
U+AB23 | ꬣ | Ethiopic | ETHIOPIC SYLLABLE CCHHAA | [100] | Proposed:Uncommon_Use | Gumuz |
U+AB24 | ꬤ | Ethiopic | ETHIOPIC SYLLABLE CCHHEE | [100] | Proposed:Uncommon_Use | Gumuz |
U+AB25 | ꬥ | Ethiopic | ETHIOPIC SYLLABLE CCHHE | [100] | Proposed:Uncommon_Use | Gumuz |
U+AB26 | ꬦ | Ethiopic | ETHIOPIC SYLLABLE CCHHO | [100] | Proposed:Uncommon_Use | Gumuz |
U+AB28 | ꬨ | Ethiopic | ETHIOPIC SYLLABLE BBA | [100] | Proposed:Uncommon_Use | Gumuz |
U+AB29 | ꬩ | Ethiopic | ETHIOPIC SYLLABLE BBU | [100] | Proposed:Uncommon_Use | Gumuz |
U+AB2A | ꬪ | Ethiopic | ETHIOPIC SYLLABLE BBI | [100] | Proposed:Uncommon_Use | Gumuz |
U+AB2B | ꬫ | Ethiopic | ETHIOPIC SYLLABLE BBAA | [100] | Proposed:Uncommon_Use | Gumuz |
U+AB2C | ꬬ | Ethiopic | ETHIOPIC SYLLABLE BBEE | [100] | Proposed:Uncommon_Use | Gumuz |
U+AB2D | ꬭ | Ethiopic | ETHIOPIC SYLLABLE BBE | [100] | Proposed:Uncommon_Use | Gumuz |
U+AB2E | ꬮ | Ethiopic | ETHIOPIC SYLLABLE BBO | [100] | Proposed:Uncommon_Use | Gumuz |
U+AB66 | ꭦ | Latin | LATIN SMALL LETTER DZ DIGRAPH WITH RETROFLEX HOOK | [100] | Proposed:Uncommon_Use | (Sinology) (12.0 17/299 17/367) |
U+AB67 | ꭧ | Latin | LATIN SMALL LETTER TS DIGRAPH WITH RETROFLEX HOOK | [100] | Proposed:Uncommon_Use | (Sinology) (12.0 17/299 17/367) |
U+1133B | 𑌻 | Inherited | COMBINING BINDU BELOW | Proposed:Uncommon_Use | Grantha | |
U+1B11F | 𛄟 | Hiragana | HIRAGANA LETTER ARCHAIC WU | Proposed:Obsolete | ||
U+1B120 | 𛄠 | Katakana | KATAKANA LETTER ARCHAIC YI | Proposed:Obsolete | ||
U+1B121 | 𛄡 | Katakana | KATAKANA LETTER ARCHAIC YE | Proposed:Obsolete | ||
U+1B122 | 𛄢 | Katakana | KATAKANA LETTER ARCHAIC WU | Proposed:Obsolete | ||
U+1B132 | 𛄲 | Hiragana | HIRAGANA LETTER SMALL KO | Proposed:Obsolete | ||
U+1B150 | 𛅐 | Hiragana | HIRAGANA LETTER SMALL WI | [100] | Proposed:Obsolete | 12.0 16/354 16/385R |
U+1B151 | 𛅑 | Hiragana | HIRAGANA LETTER SMALL WE | [100] | Proposed:Obsolete | 12.0 16/354 16/385R |
U+1B152 | 𛅒 | Hiragana | HIRAGANA LETTER SMALL WO | [100] | Proposed:Obsolete | 12.0 16/354 16/385R |
U+1B155 | 𛅕 | Katakana | KATAKANA LETTER SMALL KO | Proposed:Obsolete | ||
U+1B164 | 𛅤 | Katakana | KATAKANA LETTER SMALL WI | [100] | Proposed:Obsolete | 12.0 16/354 16/385R |
U+1B165 | 𛅥 | Katakana | KATAKANA LETTER SMALL WE | [100] | Proposed:Obsolete | 12.0 16/354 16/385R |
U+1B166 | 𛅦 | Katakana | KATAKANA LETTER SMALL WO | [100] | Proposed:Obsolete | 12.0 16/354 16/385R |
U+1B167 | 𛅧 | Katakana | KATAKANA LETTER SMALL N | [100] | Proposed:Obsolete | 12.0 16/354 16/385R |
U+1DF00 | 𝼀 | Latin | LATIN SMALL LETTER FENG DIGRAPH WITH TRILL | Proposed:Technical | IPA | |
U+1DF01 | 𝼁 | Latin | LATIN SMALL LETTER REVERSED SCRIPT G | Proposed:Technical | IPA | |
U+1DF02 | 𝼂 | Latin | LATIN LETTER SMALL CAPITAL TURNED G | Proposed:Technical | IPA | |
U+1DF03 | 𝼃 | Latin | LATIN SMALL LETTER REVERSED K | Proposed:Technical | IPA | |
U+1DF04 | 𝼄 | Latin | LATIN LETTER SMALL CAPITAL L WITH BELT | Proposed:Technical | IPA | |
U+1DF05 | 𝼅 | Latin | LATIN SMALL LETTER LEZH WITH RETROFLEX HOOK | Proposed:Technical | IPA | |
U+1DF06 | 𝼆 | Latin | LATIN SMALL LETTER TURNED Y WITH BELT | Proposed:Technical | IPA | |
U+1DF07 | 𝼇 | Latin | LATIN SMALL LETTER REVERSED ENG | Proposed:Technical | IPA | |
U+1DF08 | 𝼈 | Latin | LATIN SMALL LETTER TURNED R WITH LONG LEG AND RETROFLEX HOOK | Proposed:Technical | IPA | |
U+1DF09 | 𝼉 | Latin | LATIN SMALL LETTER T WITH HOOK AND RETROFLEX HOOK | Proposed:Technical | IPA | |
U+1DF0A | 𝼊 | Latin | LATIN LETTER RETROFLEX CLICK WITH RETROFLEX HOOK | Proposed:Technical | IPA | |
U+1DF0B | 𝼋 | Latin | LATIN SMALL LETTER ESH WITH DOUBLE BAR | Proposed:Technical | IPA | |
U+1DF0C | 𝼌 | Latin | LATIN SMALL LETTER ESH WITH DOUBLE BAR AND CURL | Proposed:Technical | IPA | |
U+1DF0D | 𝼍 | Latin | LATIN SMALL LETTER TURNED T WITH CURL | Proposed:Technical | IPA | |
U+1DF0E | 𝼎 | Latin | LATIN LETTER INVERTED GLOTTAL STOP WITH CURL | Proposed:Technical | IPA | |
U+1DF0F | 𝼏 | Latin | LATIN LETTER STRETCHED C WITH CURL | Proposed:Technical | IPA | |
U+1DF10 | 𝼐 | Latin | LATIN LETTER SMALL CAPITAL TURNED K | Proposed:Technical | IPA | |
U+1DF11 | 𝼑 | Latin | LATIN SMALL LETTER L WITH FISHHOOK | Proposed:Technical | IPA | |
U+1DF12 | 𝼒 | Latin | LATIN SMALL LETTER DEZH DIGRAPH WITH PALATAL HOOK | Proposed:Technical | IPA | |
U+1DF13 | 𝼓 | Latin | LATIN SMALL LETTER L WITH BELT AND PALATAL HOOK | Proposed:Technical | IPA | |
U+1DF14 | 𝼔 | Latin | LATIN SMALL LETTER ENG WITH PALATAL HOOK | Proposed:Technical | IPA | |
U+1DF15 | 𝼕 | Latin | LATIN SMALL LETTER TURNED R WITH PALATAL HOOK | Proposed:Technical | IPA | |
U+1DF16 | 𝼖 | Latin | LATIN SMALL LETTER R WITH FISHHOOK AND PALATAL HOOK | Proposed:Technical | IPA | |
U+1DF17 | 𝼗 | Latin | LATIN SMALL LETTER TESH DIGRAPH WITH PALATAL HOOK | Proposed:Technical | IPA | |
U+1DF18 | 𝼘 | Latin | LATIN SMALL LETTER EZH WITH PALATAL HOOK | Proposed:Technical | IPA | |
U+1DF19 | 𝼙 | Latin | LATIN SMALL LETTER DEZH DIGRAPH WITH RETROFLEX HOOK | Proposed:Technical | IPA | |
U+1DF1A | 𝼚 | Latin | LATIN SMALL LETTER I WITH STROKE AND RETROFLEX HOOK | Proposed:Technical | IPA | |
U+1DF1B | 𝼛 | Latin | LATIN SMALL LETTER O WITH RETROFLEX HOOK | Proposed:Technical | IPA | |
U+1DF1C | 𝼜 | Latin | LATIN SMALL LETTER TESH DIGRAPH WITH RETROFLEX HOOK | Proposed:Technical | IPA | |
U+1DF1D | 𝼝 | Latin | LATIN SMALL LETTER C WITH RETROFLEX HOOK | Proposed:Technical | IPA | |
U+1DF1E | 𝼞 | Latin | LATIN SMALL LETTER S WITH CURL | Proposed:Technical | IPA | |
U+1DF25 | 𝼥 | Latin | LATIN SMALL LETTER D WITH MID-HEIGHT LEFT HOOK | Proposed:Technical | Malayalam transliteration | |
U+1DF26 | 𝼦 | Latin | LATIN SMALL LETTER L WITH MID-HEIGHT LEFT HOOK | Proposed:Technical | Malayalam transliteration | |
U+1DF27 | 𝼧 | Latin | LATIN SMALL LETTER N WITH MID-HEIGHT LEFT HOOK | Proposed:Technical | Malayalam transliteration | |
U+1DF28 | 𝼨 | Latin | LATIN SMALL LETTER R WITH MID-HEIGHT LEFT HOOK | Proposed:Technical | Malayalam transliteration | |
U+1DF29 | 𝼩 | Latin | LATIN SMALL LETTER S WITH MID-HEIGHT LEFT HOOK | Proposed:Technical | Malayalam transliteration | |
U+1DF2A | 𝼪 | Latin | LATIN SMALL LETTER T WITH MID-HEIGHT LEFT HOOK | Proposed:Technical | Malayalam transliteration | |
U+1E08F | 𞂏 | Cyrillic | COMBINING CYRILLIC SMALL LETTER BYELORUSSIAN-UKRAINIAN I | Proposed:Obsolete |
Legend
- Code Point
- A code point or code point sequence.
- Glyph
- The shape displayed depends on the fonts available to your browser.
- Script
- Shows the script property value from the Unicode Character Database. Combining marks may have the value Inherited and code points used with more than one script may have the value Common.
- Name
- Shows the character or sequence name from the Unicode Character Database.
- Ref
- Links to the references associated with the code point or sequence, if any.
- Tags
- LGR-defined tag values. Any tags matching the Unicode script property are suppressed in this view.
- Comment
- The comment as given in the XML file. However, if the comment for this row consists only of the code point or sequence name, it is suppressed in this view. By convention, comments starting with “=” denote an alias. If present, the symbol ⍟ marks a default item shared among a set of LGRs.
Variants
This LGR does not specify any variants.
Classes, Rules and Actions
Character Classes
Number of named classes | 25 |
---|---|
Implicit (except script) | 17 |
The following table lists all named and implicit classes with their definition and a list of their members intersected with the current repertoire (for larger classes, this list is elided).
Name | Definition | Count | Members or Ranges | Ref | Comment |
---|---|---|---|---|---|
Digits | Prop=gc:Nd | 760→10 | {A9F0-A9F9} | Any character matching Unicode property General_Category:Decimal_Number | |
Uppercase | Prop=gc:Lu | 1858→125 | {0200 0202 0204 0206 0208 020A 020C 020E 0210 0212 0214 0216 03FD-03FF 048A 048C 048E 049C 04A6 04B8 04C3 04C5 04C7 04C9 04CD 04F6 04FA 04FC 04FE 0510 0512 ...} | Any character matching Unicode property General_Category:Uppercase_Letter | |
numeric | explicit | 2 | {0F3E-0F3F} | Characters classed as “numeric” in the [MSR] | |
context-other | explicit | 3 | {0375 05F3-05F4} | Characters classed as “context-other” in the [MSR] | |
punctuation | explicit | 5 | {02BB 06FD-06FE 3099-309A} | Characters classed as “punctuation” in the [MSR] | |
duplicate | explicit | 2 | {0F7B 0F7D} | Characters classed as “duplicate” in the [MSR] | |
modifier | explicit | 9 | {A717-A71F} | Characters classed as “modifier” in the [MSR] | |
poetic | explicit | 12 | {0201 0203 0205 0207 0209 020B 020D 020F 0211 0213 0215 0217} | Characters classed as “poetic” in the [MSR] | |
religious_use | explicit | 8 | {0653 0950 0A74 0AD0 0BD0 0F00 2D27 2D2D} | Characters classed as “numeric” in the [MSR] | |
tentatively-religious_use | explicit | 25 | {0870-0888} | Characters tentatively classed as “religious_use” | |
symbol | explicit | 10 | {03FC 0950 0A74 0AD0 0BD0 0EAF 0F00 0F35 0F37 0FC6} | Characters classed as “symbol” in the [MSR] | |
technical | explicit | 46 | {0201 0203 0205 0207 0209 020B 020D 020F 0211 0213 0215 0217 02BC 02EC 030F-0311 0313-0314 0324-0325 032D-032E 0330 0335 0338-0339 0342 0559 06E5-06E6 097D 0B82 10F9-10FA 1E01 1E19 1E1B 1E2B 1E2D 1E73 1E75 1E77 1FB0-1FB1 A788} | Characters classed as “technical” in the [MSR] | |
tentatively-technical | explicit | 40 | {0375 0F3E-0F3F 1DF00-1DF1E 1DF25-1DF2A} | Characters tentatively classed as “technical” | |
uppercase-technical | explicit | 22 | {0200 0202 0204 0206 0208 020A 020C 020E 0210 0212 0214 0216 1E00 1E18 1E1A 1E2A 1E2C 1E72 1E74 1E76 A7C5-A7C6} | Uppercase equivalents of lowercase “technical” | |
historic | explicit | 74 | {1F00-1F07 1F10-1F15 1F20-1F27 1F30-1F37 1F40-1F45 1F50-1F57 1F60-1F67 1F70 1F72 1F74 1F76 1F78 1F7A 1F7C 1FB6 1FC6 1FD0-1FD2 1FD6-1FD7 1FE0-1FE2 1FE4-1FE7 1FF6} | Characters classed as “historic” in the [MSR] | |
tentatively-historic | explicit | 1 | {08B5} | Characters tentatively classed as “obsolete” | |
obsolete | explicit | 93 | {0138 037B-037D 03FC 049D 04A7 04B9 0515 0517 0519 051B 051D 051F 0521 0523 0527 0529 052F 063B-063C 063E-063F 06AC 077E-077F 093D 0960-0963 0971 09BD 09E0-09E3 0ABD 0AE0-0AE3 0B3D 0B60-0B61 0C3D 0C60-0C61 0CBD 0CE0-0CE3 0CF1-0CF2 0D3A 0D3D 0D4C 0D4E 0D60-0D61 0F6A 0F82-0F83 0F86-0F8F 1050-1059 17DC A67F 1B150-1B152 1B164-1B167} | Characters classed as “obsolete” in the [MSR] | |
tentatively-obsolete | explicit | 15 | {A7C1 A7C8 A7CA A7D1 A7D3 A7D5 A7D7 A7D9 1B11F-1B122 1B132 1B155 1E08F} | Characters tentatively classed as “obsolete” | |
uppercase-obsolete | explicit | 26 | {03FD-03FF 049C 04A6 04B8 0510 0512 0514 0516 0518 051A 051C 051E 0520 0522 0526 0528 052E A7C0 A7C4 A7C7 A7C9 A7D0 A7D6 A7D8} | Uppercase equivalents of lowercase “obsolete” | |
polytoniko | explicit | 203 | {0345 1F00-1F15 1F18-1F1D 1F20-1F45 1F48-1F4D 1F50-1F57 1F59 1F5B 1F5D 1F5F-1F70 1F72 1F74 1F76 1F78 1F7A 1F7C 1F80-1FB4 1FB6-1FBA 1FBC 1FC2-1FC4 1FC6-1FC8 1FCA 1FCC 1FD0-1FD2 1FD6-1FDB 1FE0-1FE2 1FE4-1FEB 1FF2-1FF4 1FF6-1FF8 1FFA 1FFC} | All characters in the Extended Greek Block, plus YPOGEGRAMMENI | |
uncommon_use | explicit | 237 | {048B 048D 048F 04C4 04C6 04C8 04CA 04CE 04F7 04FB 04FD 04FF 0511 0513 05EF 08B2 08B6-08BA 09FE 0A01 0AFA-0AFF 0C01 0C04 0C80 0D00 0D54-0D56 0E8E-0E93 0EA8-0EA9 0EAC 0F6A-0F6C 0F82-0F83 0FAE-0FB0 0FC6 1050-1059 1065-1074 108E 109A-109D 10F9-10FA 10FD-10FF 1380-138F 2DA0-2DA6 2DA8-2DAE 2DB0-2DB6 2DB8-2DBE 2DC0-2DC6 2DC8-2DCE 2DD0-2DD6 2DD8-2DDE A7C3 A9E7-A9FE AA60-AA76 AA7A AA7C-AA7F AB11-AB16 AB20-AB26 AB28-AB2E AB66-AB67} | Characters classed as “uncommon_use” in the [MSR] | |
limited_use | explicit | 2 | {A793 1133B} | Characters classed as “limited_use” in the [MSR] | |
tentatively-limited_use | explicit | 27 | {0889-088E 08C3-08C8 0B55 0C3C 0C5D 0CDD 0CF3 0E86 0E89 0E8C 0E98 0EA0 0EAC 0EBA 0ECE 3099-309A} | Characters tentatively classed as “limited_use” | |
uppercase-uncommon_use | explicit | 14 | {048A 048C 048E 04C3 04C5 04C7 04C9 04CD 04F6 04FA 04FC 04FE A792 A7C2} | Uppercase equivalents of lowercase “uncommon_use” | |
all-classifications | combined = [[:numeric:] ∪ [:context-other:] ∪ [:punctuation:] ∪ [:duplicate:] ∪ [:modifier:] ∪ [:poetic:] ∪ [:religious_use:] ∪ [:tentatively-religious_use:] ∪ [:symbol:] ∪ [:technical:] ∪ [:tentatively-technical:] ∪ [:uppercase-technical:] ∪ [:historic:] ∪ [:tentatively-historic:] ∪ [:obsolete:] ∪ [:tentatively-obsolete:] ∪ [:uppercase-obsolete:] ∪ [:polytoniko:] ∪ [:uncommon_use:] ∪ [:limited_use:] ∪ [:tentatively-limited_use:] ∪ [:uppercase-uncommon_use:]] |
760 | {0138 0200-0217 02BB-02BC 02EC 030F-0311 0313-0314 0324-0325 032D-032E 0330 0335 0338-0339 0342 0345 0375 037B-037D 03FC-03FF 048A-048F 049C-049D 04A6-04A7 ...} | ||
implicit | Tag=Default_Ignorable | 396→0 | {} | Any character tagged as Default_Ignorable | |
implicit | Tag=Deprecated | 15→0 | {} | Any character tagged as Deprecated | |
implicit | Tag=Exclusion | 21647→0 | {} | Any character tagged as Exclusion | |
implicit | Tag=Inclusion | 12→11 | {0027 002E 003A 058A 05F3-05F4 06FD-06FE 2010 2019 2027} | Any character tagged as Inclusion | |
implicit | Tag=Limited_Use | 5271→0 | {} | Any character tagged as Limited_Use | |
implicit | Tag=MSR-non-Han | 2494→0 | {} | Any character tagged as MSR-non-Han | |
implicit | Tag=Obsolete | 1620→0 | {} | Any character tagged as Obsolete | |
implicit | Tag=Recommended | 2 | {005F 063D} | Any character tagged as Recommended | |
implicit | Tag=RefLGR | 2332→0 | {} | Any character tagged as RefLGR | |
implicit | Tag=RefLGRBySequence | 13→0 | {} | Any character tagged as RefLGRBySequence | |
implicit | Tag=Technical | 1624→0 | {} | Any character tagged as Technical | |
implicit | Tag=Uncommon_Use | 346→0 | {} | Any character tagged as Uncommon_Use | |
implicit | Tag=Proposed:Exclusion | 2 | {0F7B 0F7D} | Any character tagged as Proposed:Exclusion | |
implicit | Tag=Proposed:Inclusion | 1 | {02BB} | The character tagged as Proposed:Inclusion | |
implicit | Tag=Proposed:Obsolete | 336 | {0138 0345 037B-037D 03FC-03FF 049C-049D 04A6-04A7 04B8-04B9 0514-0523 0526-0529 052E-052F 063B-063C 063E-063F 06AC 077E-077F 08B5 093D 0960-0963 0971 09BD ...} | Any character tagged as Proposed:Obsolete | |
implicit | Tag=Proposed:Technical | 155 | {0200-0217 02BC 02EC 030F-0311 0313-0314 0324-0325 032D-032E 0330 0335 0338-0339 0342 0375 03FC 0559 0653 06E5-06E6 0870-0888 0950 097D 0A74 0AD0 0B82 0BD0 ...} | Any character tagged as Proposed:Technical | |
implicit | Tag=Proposed:Uncommon_Use | 281 | {048A-048F 04C3-04CA 04CD-04CE 04F6-04F7 04FA-04FF 0510-0513 05EF 0889-088E 08B2 08B6-08BA 08C3-08C8 09FE 0A01 0AFA-0AFF 0B55 0C01 0C04 0C3C 0C5D 0C80 0CDD ...} | Any character tagged as Proposed:Uncommon_Use |
Legend
- Members or Ranges
- Lists the members of the class as code points (xxx) or as ranges of code points (xxx-yyy). Any class too numerous to list in full is elided with "...".
- m→n
- Indicates a set for which only n of its m members fall inside the repertoire.
- Tag=ttt
- A named or implicit class defined by all code points that share the given tag value (ttt).
- Prop=ppp:vvv
- A named class defined by reference to value vvv of Unicode property ppp.
- Explicit
- A named class defined by explicitly listing all its members.
- Implicit
- An anonymous class implicitly defined based on tag value and for which there is no named equivalent.
- Combined
- A named class defined by set operations on other classes using the following syntax:
- [: :] - named or implicit character set
- Reference to a named character set [:name:] or an implicit character set [:tag:]. A leading “^” before name or tag indicates the set complement.
- ∪, ∩, ∖, ∆ - set operators
- Sets may be combined by set operators (∪ = union, ∩ = intersection, ∖ = difference, ∆ = symmetric difference).
Note: The following named classes are defined but not used in this LGR: Digits, Uppercase, all-classifications.
Whole label evaluation and context rules
The LGR does not define any rules.
Actions
The LGR does not define any actions.
Table of References
The following lists the references cited for specific code points, variants, classes, rules or actions in this LGR.
[Avagraha] | Wikipedia, “Avagraha”, https://en.wikipedia.org/wiki/Avagraha |
[Azerbaijani-Encoding-Proposal] | ARABIC LETTER FARSI YEH WITH INVERTED V was encoded based on: M. Everson, R. Pournader, and E. Sarbar “Proposal to encode eight Arabic characters for Persian and Azerbaijani in the UCS”, https://www.unicode.org/L2/L2006/06345r-n3180r-fa-az.pdf |
[EGIDS] | Lewis and Simons, EGIDS: Expanded Graded Intergenerational Disruption Scale,” documented in [SIL-Ethnologue] and summarized here: https://en.wikipedia.org/wiki/Expanded_Graded_Intergenerational_Disruption_Scale_(EGIDS) |
[Greek-IDN-Case-Study] | “Study of the issues present in the registration of IDN TLDs in Greek characters”, Greek Case Study Team, ICANN IDN Variant Issues Project, http://archive.icann.org/en/topics/new-gtlds/greek-vip-issues-report-07oct11-en.pdf. |
[Greek-ccTLD] | FORTH-ICS .gr/.ελ , Registration of Domain Names in Greek Characters, https://grweb.ics.forth.gr/public/domains/idn?lang=en(The list of characters is found at https://grweb.ics.forth.gr/public/assets/docs/en/acceptable-greek-chars-29d9874ecbeb8ca27e43d49cf1fd7af6.pdf) |
[MSR] | ICANN, “Maximal Starting Repertoire”, https://www.icann.org/resources/pages/msr-2015-06-21-en |
[Proposal-Arabic] | “Proposal for Arabic Script Root Zone LGR”, https://www.icann.org/en/system/files/files/arabic-lgr-proposal-18nov15-en.pdf |
[Proposal-Greek] | Greek Generation Panel, “Proposal for a Greek Script Root Zone Label Generation Ruleset (LGR)”, 1 February 2022, https://www.icann.org/en/system/files/files/proposal-greek-lgr-01feb22-en.pdf |
[RefLGR] | ICANN, “Second-Level Reference Label Generation Rules”, https://www.icann.org/resources/pages/second-level-lgr-2015-06-21-en |
[RefLGR-Overview] | ICANN, “Reference Label Generation Rules (LGR) for the Second Level — Overview and Summary”, https://www.icann.org/sites/default/files/packages/lgr/lgr-second-level-overview-summary-25oct24-en.pdf |
[RZ-LGR] | ICANN, “Root Zone Label Generation Rules”, https://www.icann.org/resources/pages/root-zone-lgr-2015-06-21-en |
[SIL-Ethnologue] | David M. Eberhard, Gary F. Simons & Charles D. Fennig (eds.). 2021. Ethnologue: Languages of the World, Twenty fourth edition. Dallas, Texas: SIL International. Online version available as https://www.ethnologue.com |
[100] | ICANN, Maximal Starting Repertoire - MSR-5:Annotated Repertoire Tables, Non-CJK, Date: 2021-06-24, available as https://www.icann.org/en/system/files/files/msr-5-non-cjk-24jun21-en.pdf Code points cited are shown as excluded from the MSR |
[KK] | Kazakh Arabic alphabet, Wikipedia: Kazakh alphabets, https://en.wikipedia.org/wiki/Kazakh_alphabets Note: this alphabet cited as in official use in the Ili Kazakh Autonomous Prefecture of the Xinjiang Uyghur Autonomous Region in China. |
[KY] | Kyrgyz Arabic alphabet, Wikipedia: Kyrgyz Alphabets, https://en.wikipedia.org/wiki/Kyrgyz_alphabets (Note: this alphabet cited as in official use in Afghanistan, Pakistan and the People's Republic of China China) in the Kizilsu Kyrgyz Autonomous Prefecture, the Ili Kazakh Autonomous Prefecture of the Xinjiang Uyghur Autonomous Region. |
[AZ] | Azerbaijani Arabic alphabet, Wikipedia: Azerbaijani alphabet, https://en.wikipedia.org/wiki/Azerbaijani_alphabet Note: cited as in use with the Southern Azerbaijani language in Iran. The page uses U+063D throughout for samples, but does not list it in precomposed form in the alphabet |
[UG] | Uyghur Arabic alphabet, Wikipedia: Uyghur alphabets, https://en.wikipedia.org/wiki/Uyghur_alphabets Note: this alphabet cited as official and in widespread use in Xinjiang province of China. |