Publications may be listed more than once under different
headings.
[Bidi] |
UAX #9: Unicode Bidirectional
Algorithm
http://www.unicode.org/reports/tr9/ |
[Blocks] |
Blocks data file
For the latest version, see:
http://www.unicode.org/Public/UNIDATA/Blocks.txt
For the 5.1.0 version, see:
http://www.unicode.org/Public/5.1.0/ucd/Blocks.txt |
[Boundaries] |
UAX #29: Unicode Text Segmentation Algorithms
http://www.unicode.org/reports/tr29/ |
[Charts] |
Online Code Charts
http://www.unicode.org/charts/
An index to character names with links to the corresponding chart is
found at
http://www.unicode.org/charts/charindex.html
|
[Charts14] |
Charts for the test files
For the latest version, see:
http://www.unicode.org/Public/UNIDATA/auxiliary/LineBreakTest.html
For the 5.1.0 version, see:
http://www.unicode.org/Public/5.1.0/ucd/auxiliary/LineBreakTest.html |
[Charts15] |
Normalization Charts
http://www.unicode.org/reports/tr15/charts |
[Charts29] |
Charts for the test files
For the latest version, see:
http://www.unicode.org/Public/UNIDATA/auxiliary/GraphemeBreakTest.html
http://www.unicode.org/Public/UNIDATA/auxiliary/WordBreakTest.html
http://www.unicode.org/Public/UNIDATA/auxiliary/SentenceBreakTest.html
For the 5.1.0 version, see:
http://www.unicode.org/Public/5.1.0/ucd/auxiliary/GraphemeBreakTest.html
http://www.unicode.org/Public/5.1.0/ucd/auxiliary/WordBreakTest.html
http://www.unicode.org/Public/5.1.0/ucd/auxiliary/SentenceBreakTest.html |
[CLDR] |
Common Locale Data Repository
http://www.unicode.org/cldr/ |
[Code9] |
Reference code implementing Unicode Bidirectional
Algorithm
For the original verified C/C++ reference implementation, see:
http://www.unicode.org/reports/tr9/BidiReferenceCpp/
For the original verified Java reference implementation, see:
http://www.unicode.org/reports/tr9/BidiReferenceJava/
For updates to the C/C++ sample code, see:
http://www.unicode.org/Public/PROGRAMS/BidiReferenceCpp/ |
[Code14] |
Sample code implementing Unicode Line Breaking Algorithm using a pair table
http://www.unicode.org/Public/PROGRAMS/LineBreakSampleCpp/
Contains the code samples shown in UAX #14 together with driver
code. |
[Collation] |
UTS #10: Unicode Collation Algorithm
(UCA)
http://www.unicode.org/reports/tr10/ |
[Corrections] |
Normalization Corrections
For the latest version, see:
http://www.unicode.org/Public/UNIDATA/NormalizationCorrections.txt
For the 5.1.0 version, see:
http://www.unicode.org/Public/5.1.0/ucd/NormalizationCorrections.txt |
[Corrigendum1] |
Corrigendum #1: UTF-8 Shortest Form
http://www.unicode.org/versions/corrigendum1.html |
[Corrigendum2] |
Corrigendum #2: Yod with Hiriq Normalization
http://www.unicode.org/versions/corrigendum2.html |
[Corrigendum3] |
Corrigendum #3: U+F951 Normalization
http://www.unicode.org/versions/corrigendum3.html |
[Corrigendum4] |
Corrigendum #4: Five CJK Canonical Mapping Errors
http://www.unicode.org/versions/corrigendum4.html |
[Corrigendum5] |
Corrigendum #5: Normalization Idempotency
http://www.unicode.org/versions/corrigendum5.html |
[Data9] |
Bidi Mirroring
For the latest version, see:
http://www.unicode.org/Public/UNIDATA/BidiMirroring.txt
For the 5.1.0 version, see:
http://www.unicode.org/Public/5.1.0/ucd/BidiMirroring.txt |
[Data11] |
East Asian Width property data file
For the latest version, see:
http://www.unicode.org/Public/UNIDATA/EastAsianWidth.txt
For the 5.1.0 version, see:
http://www.unicode.org/Public/5.1.0/ucd/EastAsianWidth.txt |
[Data14] |
Line Break property data file
For the latest version, see:
http://www.unicode.org/Public/UNIDATA/LineBreak.txt
For the 5.1.0 version, see:
http://www.unicode.org/Public/5.1.0/ucd/LineBreak.txt |
[Data24] |
Scripts data file
For the latest version, see:
http://www.unicode.org/Public/UNIDATA/Scripts.txt
For the 5.1.0 version, see:
http://www.unicode.org/Public/5.1.0/ucd/Scripts.txt |
[Data34] |
Named Sequences data file
For the latest version, see:
http://www.unicode.org/Public/UNIDATA/NamedSequences.txt
For the 5.1.0 version see:
http://www.unicode.org/Public/5.1.0/ucd/NamedSequences.txt |
[DataProv] |
Provisional Named Sequences data file
For the latest version, see:
http://www.unicode.org/Public/UNIDATA/NamedSequencesProv.txt
For the 5.1.0 version see:
http://www.unicode.org/Public/5.1.0/ucd/NamedSequencesProv.txt |
[DerivedBIDI] |
Derived Bidi Properties
For the latest version see:
http://www.unicode.org/Public/UNIDATA/extracted/DerivedBidiClass.txt
For the 5.1.0 version, see:
http://www.unicode.org/Public/5.1.0/ucd/extracted/DerivedBidiClass.txt |
[EAW] |
UAX #11: East Asian Width
http://www.unicode.org/reports/tr11/ |
[Errata] |
Updates and Errata
http://www.unicode.org/errata |
[Exclusions] |
Composition Exclusion Table
For the latest version, see:
http://www.unicode.org/Public/UNIDATA/CompositionExclusions.txt
For the 5.1.0 version, see:
http://www.unicode.org/Public/5.1.0/ucd/CompositionExclusions.txt |
[FAQ] |
Unicode Frequently Asked Questions
http://www.unicode.org/faq/
For answers to common questions on technical issues. |
[Feedback] |
Reporting Form
http://www.unicode.org/reporting.html
For reporting errors and requesting information online. |
[Glossary] |
Unicode Glossary
http://www.unicode.org/glossary/
For explanations of terminology used in this and other documents. |
[HangulST] |
Hangul Syllable Types
For the latest version, see:
http://www.unicode.org/Public/UNIDATA/HangulSyllableType.txt
For the 5.1.0 version, see:
http://www.unicode.org/Public/5.1.0/ucd/HangulSyllableType.txt |
[LineBreak] |
UAX #14: Unicode Line Breaking Algorithm
http://www.unicode.org/reports/tr14/ |
[NormProps] |
Derived Normalization Properties
For the latest version, see:
http://www.unicode.org/Public/UNIDATA/DerivedNormalizationProps.txt
For the 5.1.0 version, see:
http://www.unicode.org/Public/5.1.0/ucd/DerivedNormalizationProps.txt |
[Policies] |
Unicode Policies
http://www.unicode.org/policies/policies.html |
[Props] |
Property Data:
For the latest version, see:
http://www.unicode.org/Public/UNIDATA/auxiliary/GraphemeBreakProperty.txt
http://www.unicode.org/Public/UNIDATA/auxiliary/WordBreakProperty.txt
http://www.unicode.org/Public/UNIDATA/auxiliary/SentenceBreakProperty.txt
For the 5.1.0 version, see:
http://www.unicode.org/Public/5.1.0/ucd/auxiliary/GraphemeBreakProperty.txt
http://www.unicode.org/Public/5.1.0/ucd/auxiliary/WordBreakProperty.txt
http://www.unicode.org/Public/5.1.0/ucd/auxiliary/SentenceBreakProperty.txt |
[PropValue] |
Property Value Aliases data file
For the latest version, see:
http://www.unicode.org/Public/UNIDATA/PropertyValueAliases.txt
For the 5.1.0 version, see:
http://www.unicode.org/Public/5.1.0/ucd/PropertyValueAliases.txt |
[RegEx] |
UTS #18: Unicode Regular Expressions
http://www.unicode.org/reports/tr18/ |
[Reports] |
Unicode Technical Reports
http://www.unicode.org/reports/
For information on the status and development process for technical reports, and for a list of technical reports. |
[Sample] |
Sample Normalizer code
http://www.unicode.org/reports/tr15/Normalizer.html |
[Security] |
UTR #36:
Security Considerations for the Implementation of Unicode and Related
Technology
http://www.unicode.org/reports/tr36/ |
[Stability] |
Unicode Consortium Stability Policies
http://www.unicode.org/policies/stability_policy.html |
[Tests14] |
Test data:
For the latest version, see:
http://www.unicode.org/Public/UNIDATA/auxiliary/LineBreakTest.txt
For the 5.1.0 version, see:
http://www.unicode.org/Public/5.1.0/ucd/auxiliary/LineBreakTest.txt |
[Tests15] |
Normalization Conformance Test
For the latest version, see:
http://www.unicode.org/Public/UNIDATA/NormalizationTest.txt
For the 5.1.0 version, see:
http://www.unicode.org/Public/5.1.0/ucd/NormalizationTest.txt |
[Tests29] |
Test data:
For the latest version, see:
http://www.unicode.org/Public/UNIDATA/auxiliary/GraphemeBreakTest.txt
http://www.unicode.org/Public/UNIDATA/auxiliary/WordBreakTest.txt
http://www.unicode.org/Public/UNIDATA/auxiliary/SentenceBreakTest.txt
For the 5.1.0 version, see:
http://www.unicode.org/Public/5.1.0/ucd/auxiliary/GraphemeBreakTest.txt
http://www.unicode.org/Public/5.1.0/ucd/auxiliary/WordBreakTest.txt
http://www.unicode.org/Public/5.1.0/ucd/auxiliary/SentenceBreakTest.txt |
[UAX9] |
UAX #9: Unicode Bidirectional Algorithm
http://www.unicode.org/reports/tr9/ |
[UAX11] |
UAX #11: East Asian Width
http://www.unicode.org/reports/tr11/ |
[UAX14] |
UAX #14: Unicode Line Breaking Algorithm
http://www.unicode.org/reports/tr14/ |
[UAX15] |
UAX #15: Unicode Normalization Forms
http://www.unicode.org/reports/tr15/ |
[UAX24] |
UAX #24: Unicode Script Property
http://www.unicode.org/reports/tr24/ |
[UAX29] |
UAX #29: Unicode Text Segmentation
http://www.unicode.org/reports/tr29/ |
[UAX31] |
UAX #31: Unicode Identifier and Pattern Syntax
http://www.unicode.org/reports/tr31/ |
[UAX34] |
UAX #34: Unicode Named Character Sequences
http://www.unicode.org/reports/tr34/ |
[UAX38] |
UAX #38: Unicode Han Database (Unihan)
http://www.unicode.org/reports/tr38/ |
[UAX41] |
UAX #41: Common References for Unicode Standard Annexes
http://www.unicode.org/reports/tr41/ |
[UAX42] |
UAX #42:Unicode Character Database in XML
http://www.unicode.org/reports/tr42/ |
[UAX44] |
UAX #44:Unicode Character Database
http://www.unicode.org/reports/tr44/ |
[UCA] |
UTS #10: Unicode Collation Algorithm
http://www.unicode.org/reports/tr10/ |
[UCD] |
Unicode Character Database
http://www.unicode.org/ucd/
For an overview of the Unicode Character Database and a list of its
associated files, see:
http://www.unicode.org/Public/UNIDATA/UCD.html |
[UCDDoc] |
Unicode Character Database Documentation
http://www.unicode.org/Public/UNIDATA/UCD.html |
[Unicode] |
The Unicode Standard For the latest version, see:
http://www.unicode.org/versions/latest/
For the 5.1.0 version, see:
http://www.unicode.org/versions/Unicode5.1.0/ |
[Unicode3.0] |
The Unicode Consortium. The Unicode Standard, Version 3.0
(Reading, MA, Addison-Wesley, 2000. ISBN 0-201-61633-5). |
[Unicode3.1] |
The Unicode Consortium. The Unicode
Standard, Version 3.1.0, defined by: The Unicode Standard, Version
3.0 (Reading, MA, Addison-Wesley, 2000. ISBN 0-201-61633-5), as
amended by the Unicode Standard Annex #27: Unicode 3.1
http://www.unicode.org/reports/tr27/ |
[Unicode3.2] |
The Unicode Consortium. The Unicode
Standard, Version 3.2.0, defined by: The Unicode Standard, Version 3.0
(Reading, MA, Addison-Wesley, 2000. ISBN 0-201-61633-5), as amended by
the Unicode Standard Annex #27: Unicode 3.1 and the Unicode
Standard Annex #28: Unicode 3.2
http://www.unicode.org/reports/tr28/ |
[Unicode4.0] |
The Unicode Consortium.
The Unicode Standard, Version 4.0
(Boston, MA, Addison-Wesley, 2003. ISBN 0-321-18578-1). |
[Unicode4.0.1] |
The Unicode Consortium. The Unicode Standard, Version 4.0.1, defined by:
The Unicode Standard, Version 4.0 (Boston, MA, Addison-Wesley, 2003. ISBN
0-321-18578-1), as amended by
Unicode 4.0.1
http://www.unicode.org/versions/Unicode4.0.1/ |
[Unicode4.1] |
The Unicode Consortium. The Unicode Standard, Version 4.1.0, defined by:
The Unicode Standard, Version 4.0
(Boston, MA, Addison-Wesley, 2003. ISBN 0-321-18578-1), as amended by
Unicode 4.0.1 and by
Unicode 4.1.0
http://www.unicode.org/versions/Unicode4.1.0/ |
[Unicode5.0] |
The Unicode Consortium.
The Unicode Standard, Version
5.0 (Boston, MA, Addison-Wesley, 2007. ISBN 0-321-48091-0). |
[Unicode5.1] |
The Unicode Consortium. The Unicode Standard, Version 5.1.0, defined by: The Unicode Standard, Version 5.0
(Boston, MA, Addison-Wesley, 2007. ISBN 0-321-48091-0), as amended by
Unicode 5.1.0 |
[UTC] |
Unicode Technical Committee
http://www.unicode.org/consortium/utc.html |
[UTN5] |
UTN #5: Canonical Equivalences in Applications
http://www.unicode.org/notes/tn5 |
[UTR17] |
UTR #17: Unicode Character Encoding Model
http://www.unicode.org/reports/tr17/ |
[UTR20] |
UTR # 20: Unicode in XML and other Markup Languages
http://www.unicode.org/reports/tr20/ |
[UTR23] |
UTR # 23: The Unicode Character Property Model
http://www.unicode.org/reports/tr23/ |
[UTR25] |
UTR # 25: Unicode Support for Mathematics
http://www.unicode.org/reports/tr25/ |
[UTR33] |
UTR # 33: Unicode Conformance Model
http://www.unicode.org/reports/tr33/ |
[UTR36] |
UTR #36: Unicode Security Considerations
http://www.unicode.org/reports/tr36/ |
[UTS6] |
UTS #6: A Standard Compression Scheme
for Unicode
http://www.unicode.org/reports/tr6/ |
[UTS10] |
UTS #10: Unicode Collation Algorithm
(UCA)
http://www.unicode.org/reports/tr10/ |
[UTS18] |
UTS #18: Unicode Regular Expressions
http://www.unicode.org/reports/tr18/ |
[UTS22] |
UTS #22: Unicode Character Mapping Markup Language
http://www.unicode.org/reports/tr22/ |
[UTS35] |
UTS #35: Unicode Locale Data Markup Language (LDML)
http://www.unicode.org/reports/tr35/ |
[UTS37] |
UTS #37: Unicode Ideographic Variation Database
http://www.unicode.org/reports/tr37/ |
[UTS39] |
UTS #39: Unicode Security Mechanisms
http://www.unicode.org/reports/tr39/ |
[Versions] |
Versions of the Unicode Standard
http://www.unicode.org/versions/
For information on version numbering, and citing and referencing the Unicode Standard,
the Unicode Character Database, and Unicode Technical Reports. |
[Cedar97] |
Cy Cedar, David Veintimilla, Michel Suignard, and Asmus
Freytag, Report from the Trenches: Microsoft Publisher goes Unicode. Proceedings
of the Eleventh International Unicode Conference, San Jose, CA, 1997. |
[CharLint] |
Charlint—A Character Normalization Tool
http://www.w3.org/International/charlint/ |
[CharMod] |
W3C Character Model for the World Wide Web
http://www.w3.org/TR/charmod/ |
[CharReq] |
W3C Requirements for String Identity Matching and String
Indexing
http://www.w3.org/TR/WD-charreq |
[Knuth78] |
Donald E. Knuth and Michael F. Plass, Breaking Lines
into Paragraphs, republished in Digital Typography, CSLI 78
(Stanford, California: CLSI Publications 1997). |
[Suign98] |
Michel Suignard, Worldwide Typography and How to Apply
JIS X 4051-1995 to Unicode. Proceedings of the Twelfth International
Unicode/ISO 10646 Conference, Tokyo, Japan, 1998. |
[TEX] |
Donald E. Knuth, TEX, the Program,
Volume B of Computers & Typesetting (Reading, MA,
Addison-Wesley, 1986). |