[Unicode] The Unicode Standard Tech Site | Site Map | Search
 

ALPHA REVIEW UnicodeĀ® 18.0.0

The next version of the Unicode Standard will be Version 18.0.0, planned for release on September 15, 2026. Unicode 18.0.0 will include the publication of one new Unicode Standard Annex and one new version-synchronized Unicode Technical Standard, as well as updates to several other Unicode Standard Annexes and Unicode Technical Standards.

This version will include significant new repertoire additions. The planned repertoire adds a total of 13,048 new characters. Note, however, that during the alpha review period, the repertoire is not yet frozen. The main goal of the alpha review is to ensure that all repertoire, including character names and glyphs, is correct and appropriate for the final release.

An alpha version of the 18.0.0 Unicode Character Database files is available for public review. We strongly encourage implementers to review the summary description, download the alpha 18.0.0 Unicode Character Database files, and test their programs with the new data, well before the end of the alpha period. It is especially important to review the Notable Issues for Alpha Reviewers.

We encourage users to check the code charts carefully to verify correctness of the new characters added to Unicode 18.0.0 and to ensure that there are no regressions in glyph shapes for previously encoded characters.

Unicode 18.0.0 summary description (draft)
Readme.txt for alpha charts
Single-block delta charts with yellow highlighting for new characters
Full code charts — single download (180 MB)
Unihan radical-stroke index — single download (35 MB)
Emoji 18.0 list for alpha review (See PRI #537)
Unicode character database (UCD)

Unicode Standard Annexes (proposed updates)

If an annex is not listed, no proposed update is available for review yet. This situation may occur when no significant change is planned for that annex for a particular release.

Related Unicode Technical Standards (proposed updates)

In addition to the Unicode Standard proper, several Unicode Technical Standards have significant text and data file updates that are correlated with the new additions for Unicode 18.0.0. Review of that text and data is also encouraged during the alpha review period.

Note: Links to data files are inactivated if the data files have not yet been prepared during the alpha review period. Links to the Unicode Technical Standards point to the latest published version, if no proposed update has been posted yet for Unicode 18.0.0.

Review and Feedback

For guidance on how to focus your review, see the section Notable Issues for Alpha Reviewers.

Any feedback should be reported using the contact form. Comments on the Unicode Standard Version 18.0.0 or the Unicode Character Database data files should refer to the alpha review Public Review Issue #536. Comments on specific Version 18.0.0 UAXes and UTSes should refer to the respective Public Review Issue Numbers for each document, where available.

The comment period ends March 31, 2026. All substantive technical comments must have been received by that date for consideration at the April UTC meeting. Editorial comments (typos, etc.) may be still submitted after that date for consideration in further editorial work.

Note: All alpha files may be updated, replaced, or superseded by other files at any time. The alpha files will be discarded once Unicode 18.0.0 is final. It is inappropriate to cite these files as other than a work in progress. No products or implementations should be released based on the alpha UCD data files—use only the final, approved Version 18.0.0 data files, expected on September 15, 2026.

The Unicode Consortium provides early access to updated versions of the data files and text to give reviewers and developers as much time as possible to ensure a problem-free adoption of Version 18.0.0.

One of the main purposes of the alpha review period is to verify and correct the preliminary character property assignments in the Unicode Character Database. Reviewers should check for property changes to existing Unicode 17.0.0 characters, as well as the property values for the new Unicode 18.0.0 character additions.

Notable Issues for Alpha Reviewers

The focus of this alpha review is the new repertoire planned for encoding. Reviewers should concentrate on the new repertoire shown in the code charts, verifying that it is appropriate and complete, and that new character names and glyphs are correct.

This list of notable issues briefly mentions other aspects of the 18.0 release, to provide more context. Further details regarding updates to annexes, UTSes, and data files will be provided during beta review.

Changes to Unicode Standard Annexes

Some of the Unicode Standard Annexes have modifications for Unicode 18.0.0, often in coordination with changes to character properties.

See the Modifications section of each Annex for details of the relevant changes.

Core Specification Update

The alpha review draft core specification is available as per-chapter web pages.

Reviewers should carefully check for inadvertent changes in the text, in particular in glyph examples.

The text still contains a number of editor's notes, indicating both general information for reviewers and spots in the text that are not yet complete for Unicode 18.0. Please use those notes as guidance, as there is no need for repeated feedback reports regarding omissions or defects that the editors already know about and are actively working on.

Script-specific Issues

There are four new scripts encoded in Unicode 18.0. Jurchen and Seal are both large ideographic scripts. Seal, in particular, which is encoded in one large block containing 11,328 characters, should be checked carefully for any potential errors.

Numeric Property Issues

  • There is one new set of decimal digits added in Unicode 18.0, for the newly encoded script Chisoi. Implementations of numeric values and numeric formatting should take this new set into account.

Unihan-related Issues

All Unihan properties should be reviewed carefully.

See UAX #38 for details on changes to Unihan, especially Section 4.2, Listing by Date of Addition to the Unicode Standard, and Section 4.3, Listing by Location within Unihan.zip.

Standardized Variation Sequences

  • Nine variation sequences have been added, mostly for historical forms of some mathematical operators. These complement other symbolic additions for mathematical operators and other symbols to support representation of historical mathematical treatises and manuscripts.

Code Charts

As always, careful review of the updated code charts for Version 18.0.0 is advised. Particular issues to take note of include:

Collation-related Issues

The Default Unicode collation Element Table (DUCET) has not yet been updated for the Unicode 18.0.0 repertoire. A notice and data links will be added to this page if that table become available during the alpha review period.

IDNA-related Issues

The listing of the IDNA2008_Category property in Idna2008.txt has been updated to reflect the planned Unicode 18.0.0 repertoire. Implementers concerned with the stability of IDNA 2008 should check that data carefully to verify it meets their expectations.

New Data Files

There are two new data files in the UCD for Version 18.0.0:

  • JurchenSources.txt
  • SealSources.txt

General Issues

For current proposed updates to the particular UAXes, see Proposed Updates for Standard Annexes. Particular issues in the UAXes may also be the focus of specific Public Review Issues. Each proposed textual change in a UAX is highlighted, so that you can focus your review on those sections if you have limited time. The changes are also listed in detail in the Modifications sections (linked from the table of contents of each document), and are summarized in UAX changes, so you can check on those areas that might be of most interest.

Some links between alpha documents and the proposed updates for UAXes will not work correctly during the alpha review period. This is a known problem which does not need to be reported, as such links point to the eventual final names or revision numbers for the released versions.

Note that all links to versioned data files on this alpha review page are using "/Public/18.0.0/" links. During alpha review, those links are redirected to the actual development directory at "/Public/draft/". Once Unicode 18.0.0 has been released, all of these links will then point to the final permalinks for Unicode 18.0.0 data, so that this alpha review page for Unicode 18.0.0 does not end up anomalously pointing to draft data directories for future release development.

Stability

Certain character properties for newly assigned characters cannot be changed after the formal release of each version of the standard, because of the Character Encoding Stability Policy. Such character property values need special attention during the alpha and beta review process, as they cannot be corrected after publication. These include:

  • Any property affecting Unicode Normalization, including Decomposition_Mapping, Canonical_Combining_Class, and Composition_Exclusion.
  • The determination of whether a character is included in identifiers (XID_Start, XID_Continue).
  • Case foldings.
  • There are also strong constraints on additions and changes to case mappings.

 


Access to Copyright and terms of use