Unicode® 17.0.0 (Draft)
STATUS: This is a preliminary draft page for an upcoming release. Some details may be missing or incorrect, and some links may be wrong or broken. During the alpha review period, errors are expected and feedback is not necessary. During the beta review period, feedback on errors will be helpful and appreciated.
This page summarizes the important changes for the Unicode Standard, Version 17.0.0.
This version supersedes all previous versions of the Unicode Standard.
A. Summary
Unicode 17.0 adds NNNN characters,
for a total of 15N,NNN characters.
The new additions include NN new scripts:
New Data Files for Unicode 17.0
Synchronization
Several other important Unicode specifications have been updated for Version 17.0.
The following four Unicode Technical Standards are versioned in
synchrony with the Unicode Standard, because their data files cover the same repertoire.
All have been updated to Version 17.0:
Some of the changes in Version 17.0 and associated Unicode Technical Standards
may require modifications
to implementations. For more information, see the migration and modification sections of
UTS #10, UTS #39, UTS #46, and UTS #51.
See Sections D through H below for additional details regarding the changes in this version of
the Unicode Standard, its associated annexes, and the other synchronized Unicode specifications.
See the following resource links for general information about Unicode versions
and other information about the Unicode Standard and other publications
of the Unicode Consortium.
B. Technical Overview
Version 17.0 of the Unicode Standard consists of:
- The core specification
- The code charts (delta and archival) for this version
- The Unicode Standard Annexes
- The Unicode Character Database (UCD)
The core specification gives the general principles,
requirements for conformance, and guidelines for implementers. The
code charts show representative glyphs for all the Unicode
characters. The Unicode Standard Annexes supply detailed normative
information about particular aspects of the standard. The Unicode
Character Database supplies normative and informative data for
implementers to allow them to implement the Unicode Standard.
Core Specification
The core specification for Version 17.0 is available for browsing online
as per-chapter web pages. Because the full table of contents for the core specification is provided,
with interactive links, no separate bookmarks page is provided for this release,
nor are separate chapter links provided directly in this summary page
for the Unicode Standard.
Anchors for chapters, sections, tables, and figures in the core specification are
shown with the convention of a "#" in the left margin of the heading or caption.
Those anchors can be clicked on to provide custom bookmarks to any particular
portion of the text, down to the level of subsections. Numbering of sections has
been extended down to the subsection level, as well, to improve referenceabiity
of precise content.
The HTML version of the core specification is authoritative. However, for
convenience of reference, an archival
version of core specification is also available as
a single pdf. (NN MB)
Code Charts
Several sets of code charts are available. They serve different
purposes:
Chart Type |
Description |
Latest Code Charts |
These charts are always the most current
code charts available, and may be updated at any time. The charts are organized by
scripts and blocks for easy reference.
An online index by character name
is also provided. |
Delta Code Charts |
These charts show the
new blocks and any blocks in which characters were added
specifically for Unicode 17.0.0. The new characters and any major updates to the
representative glyphs are visually highlighted in these charts. |
Archival Code Charts |
These charts contain
the entire set of characters, names and representative glyphs at the time of publication of Unicode 17.0.0.
|
The delta and archival code charts are a stable part of this release of the Unicode Standard. They will never be updated.
Han Radical-Stroke Indices
There are a number of radical-stroke indices available to assist in
the lookup of Han ideographs in the code charts.
Index Type |
Description |
Interactive |
An interactive CJK character lookup page that supports lookup
either by code point or by radical and stroke values. |
IICore (3.8 MB) |
A static radical-stroke index PDF file limited to only the
IICore repertoire. (This RS index is seldom updated.) |
Unihan Core 2020 (8.2 MB) |
A static radical-stroke index PDF file limited to only the
Unihan
Core 2020 repertoire. (This RS index is seldom updated.) |
Complete (NN MB) |
A static radical-stroke index PDF file that covers the entire
CJK ideograph repertoire for Unicode 17.0. |
Complete |
A static data file that corresponds to the complete radical-stroke index for Unicode 17.0. |
The complete radical-stroke index is a stable part of this release of the Unicode Standard. It will never be updated.
Unicode Standard Annexes
STATUS: During the alpha review and beta review periods,
links to individual UAXes (or UTSes) point to the proposed update for that document, if any.
If no proposed update has been posted for the document, links point to the last published
version of the document, for reference.
Links to the individual Unicode Standard Annexes for this
version are available in
Section I, List of Components below. The summary list of significant changes
in the content of each Unicode Standard Annex for Version 17.0 can be found
in Section G, Changes in the Unicode Standard Annexes below.
Unicode Character Database
Data files
for Version 17.0 of
the Unicode Character Database are available. The ReadMe.txt in that directory provides a roadmap
to the functions of the various subdirectories.
Detailed documentation about the data files can be found in
UAX #44, Unicode Character Database.
Zipped versions of the UCD
for bulk download are available, as well.
Version References
Version 17.0.0 of the Unicode Standard
should be referenced as:
The Unicode Consortium. The Unicode Standard, Version 17.0.0, (South San Francisco: The Unicode Consortium,
2025. ISBN 978-1-936213-xx-x)
https://www.unicode.org/versions/Unicode17.0.0/
The terms “Version 17.0” or “Unicode 17.0” are abbreviations for the full version reference, Version 17.0.0.
The citation and permalink for the latest published version of the Unicode Standard is:
The Unicode Consortium. The Unicode Standard.
https://www.unicode.org/versions/latest/
A complete specification of the contributory files for Unicode
17.0 is found below in Section I, List of Components.
For examples of how to cite particular portions of the Unicode Standard, see also the Reference Examples.
Errata
Errata incorporated into Unicode 17.0 are listed by date in
a separate table. For corrigenda and errata after the release of Unicode 17.0, see the list of current
Updates and Errata.
C. Stability Policy Update
No significant updates to the Character Encoding Stability Policies have occurred in the interval since the last release of the Unicode Standard.
D. Textual Changes and Character Additions
Changes in the Unicode Standard Annexes are listed in Section G.
Character Assignment Overview
NNNN characters have been added.
Most character additions are in new blocks, but there are also character additions to a number of existing blocks. For details, see the delta code charts.
New Blocks
The following blocks are newly defined in Version 17.0:
Range |
Block Name |
NNNN..NNNN |
TBD |
E. Conformance Changes
There are no new conformance requirements for the core specification in Unicode 17.0.
F. Changes in the Unicode Character Database
The detailed listing of all changes to the contributory data files of the Unicode Character Database
for Version 17.0 can be found in
UAX #44, Unicode Character Database.
The changes listed there include character additions and property revisions to existing characters that will affect implementations.
Some of the important impacts on implementations migrating from earlier versions of the standard are highlighted in
Section M.
G. Changes in the Unicode Standard Annexes
In Version 17.0, some of the Unicode Standard Annexes have had significant revisions. The most important of these changes are listed below. For the full details of all changes, see the Modifications section
of each UAX, linked directly from the following list of UAXes.
Unicode Standard Annex |
Changes |
UAX #9 Unicode Bidirectional Algorithm
|
No significant changes in this version. |
UAX
#11 East Asian Width |
No significant changes in this version. |
UAX
#14 Unicode Line Breaking Algorithm |
No significant changes in this version. |
UAX
#15 Unicode Normalization Forms
|
No significant changes in this version. |
UAX
#24 Unicode Script Property
|
No significant changes in this version. |
UAX
#29 Unicode Text Segmentation |
No significant changes in this version. |
UAX
#31 Unicode Identifiers and Syntax
|
No significant changes in this version. |
UAX
#34 Unicode Named Character Sequences |
No significant changes in this version. |
UAX
#38 Unicode Han Database (Unihan) |
No significant changes in this version. |
UAX
#41 Common References for Unicode Standard Annexes |
All references were updated for Unicode 17.0. |
UAX
#42 Unicode Character Database in XML |
New code point attributes, values, and patterns were added for Unicode 17.0. |
UAX
#44
Unicode Character Database |
The documentation was updated to describe the changes to the UCD for
Version 17.0. |
UAX #45
U-Source Ideographs |
No significant changes in this version. |
UAX #50
Unicode Vertical Text Layout |
No significant changes in this version. |
UAX #53
Unicode Arabic Mark Rendering |
No significant changes in this version. |
UAX #57
Unicode Egyptian Hieroglyph Database (Unikemet) |
No significant changes in this version. |
H. Changes in Synchronized Unicode Technical Standards
There are also significant revisions in the Unicode Technical Standards whose
versions are synchronized with the Unicode Standard. The most important of these changes are listed below.
For the full details of all changes, see the Modifications section
of each UTS, linked directly from the following list of UTSes.
Unicode Technical Standard |
Changes |
UTS #10 Unicode Collation Algorithm |
No significant changes in this version. |
UTS #39 Unicode Security Mechanisms |
No significant changes in this version. |
UTS #46 Unicode IDNA Compatibility Processing |
No significant changes in this version. |
UTS #51 Unicode Emoji |
No significant changes in this version. |
I. List of Components
This section lists the components of Version 17.0.0 of the Unicode
Standard. The version numbering and the role of each component are
explained in
Versions of The Unicode Standard.
M. Implications for Migration
There are a significant number of changes in Unicode 17.0 which may impact implementations upgrading
to Version 17.0 from earlier versions of the standard. The most important of these are listed
and explained here, to help focus on the issues most likely to cause unexpected trouble during upgrades.
Core Specification Changes
TBD
Script-related Changes
There are NN new scripts encoded in Unicode 17.0. Some of these scripts,
such as xxx, have complex layout.
TBD
General Character Property Issues
Normalization Behavior
TBD
Segmentation
TBD
Numeric Property Issues
TBD
CJK/Unihan Changes
See UAX
#38, Unicode Han Database (Unihan) for further details on these changes,
especially TBD:
Standardized Variation Sequences
UTS #46 (IDNA) Changes
Changes to Code Charts
Collation-related Changes
TBD
Emoji Changes
For details about emoji changes, see the Unicode 17.0 emoji charts and Emoji Recently Added, v17.0.