This content is not current
You are viewing a previous year’s website. To visit the current UTW site, click here.
End-to-end
Education
Internationalization
Standards
Workshop 1: Introduction to Unicode & Beyond
Michael McKenna, Tex Texin, Craig Cummings
This tutorial will give you the knowledge for correct implementation for using Unicode to process text in any language. Unicode is the text encoding standard covering every major language on the planet.
Taught by software internationalization experts, this tutorial will introduce you to the key principles of Unicode, its design and architecture, and provide you with examples of real-world implementation. Attendees will come away with a basic knowledge of Unicode and how to be more effective at processing, handling, and debugging multilingual text content.
The modules of the tutorial will cover:
- Why is the Unicode standard necessary? What problems does it solve?
- How computers work with text: Introduction to glyphs, character sets, and encodings
- Unicode Standard Specification and Related Data and Content
- Principles of Unicode’s Design
- Components of the Unicode Standard
- Encoding Forms, Behavior, Technical Reports, Database
- How to Use the Unicode Standard
- International Components for Unicode (ICU)
- Unicode Implementation Details and Recommendations
- The Unicode Consortium umbrella of Unicode, that is CLDR, ICU, and more
- Unicode Implementation Details and Recommendations
- Attributes, Compatibility, Non-spacing Characters, Directionality, Normalization, Graphemes, Complex Scripts, Surrogates, Collation, Regular Expressions and More
- Unicode and the Real World
- Support for Unicode in Software Platforms
- How Unicode is Evolving”