Enabling International Domain Names, Expanding Worldwide
Accessibility, and Reducing the Digital Divide
Major Enhancements to the Unicode Standard
Mountain View, CA, August 27, 2003 -- The Unicode® Consortium and
Addison-Wesley announce publication of Version 4.0 of the Unicode
Standard. Unicode is the fundamental specification for the
representation of text, at the core of all modern software,
programming languages, and standards, including Windows, Java, C#,
Perl, XML, HTML, DB2, Oracle, and many others.
Unicode is also central to the new internationalized domain
names, which allow everyone in the world to have URLs in their own
languages. This is yet another case where Unicode opens the door to
more of the world’s different cultures, helping to break down the
digital divide.
Version 4.0 strengthens Unicode support for worldwide
communication, software availability, and publishing. The text has
been extensively rewritten, and incorporates specifications that
were previously only available as separate documents. The clarified
specification of conformance requirements incorporates the most
highly developed character encoding model in existence, encompassing
the wide variety of types of characters needed by the world’s
languages, and permitting compatibility with all modern computer
architectures.
Record-breaking character content
Version 4.0 encodes over 96,000 characters, twice as many as
Version 3.0, and includes two record-breaking collections of encoded
characters. The largest encoded character collection for Chinese
characters in the history of computing has doubled in size yet again
to encompass over 2000 years of Chinese, Japanese, Korean, and
Vietnamese literary usage, including all the main classical
dictionaries of these languages. Version 4.0 also encodes the
largest set of characters for mathematical and technical publishing
in existence. The character repertoires of Version 4.0 and
International Standard ISO/IEC 10646 are fully synchronized.
Reducing the digital divide
To meet the needs of all linguistic communities, the Unicode
Standard and associated standards are continually being extended,
not only in terms of the addition of characters, but also in
specifying *how* those characters work, such as:
- how text sorts or matches in different languages
- how text behaves for East Asian languages (e.g. vertically) or
in Middle Eastern languages (from right to left)
- how text should upper- or lowercase
- how text breaks into lines or words
- how text behaves in Regular Expressions (a key tool used in a
vast number of web servers)
Small linguistic communities all over the world have the
opportunity to get mainstream software working right out of the box,
instead of waiting years for special adaptations that may never
come.
For more information on the scripts encoded in the Unicode
Standard, see
http://www.unicode.org/versions/Unicode4.0.0/
Version 4.0 is published by Addison-Wesley (ISBN 0-321-18578-1),
and is available from the Unicode Consortium or through the book
trade. The text and code charts of Version 4.0 are also available on
the Consortium’s Web site
www.unicode.org.
About the Unicode Consortium
The Unicode Consortium is a non-profit organization founded to
develop, extend and promote use of the Unicode Standard, which
specifies the representation of text in modern software products and
standards.
Members of the Consortium are a broad spectrum of corporations
and organizations in the computer and information technology
industry. Full members are: Adobe Systems, Apple Computer, Basis
Technology, Government of India (Ministry of Information
Technology), Government of Pakistan (National Language Authority),
HP, IBM, Justsystem, Microsoft, Oracle, PeopleSoft, RLG, SAP, Sun
Microsystems, and Sybase.
Membership in the Unicode Consortium is open to organizations and
individuals anywhere in the world who support the Unicode Standard
and wish to assist in its extension and implementation.
For additional information on Unicode, please contact the Unicode
Consortium 650-693-3921
About Addison-Wesley
Addison-Wesley (www.awprofessional.com) is the leading publisher
of quality computer science and engineering books and software for
technical professionals, developed and authored by the world's
leading technology experts. It is a unit of Pearson Technology
Group, the world's largest provider of consumer and professional
computer, information technology, engineering and reference content.
Pearson Technology Group is an operating unit of Pearson Education,
the world's leading educational publisher.
Pearson Education is part of Pearson plc (NYSE: PSO), the
international media company.
For more information on The Unicode Standard, Version 4.0, see:
http://www.awprofessional.com/titles/0321185781
Corporate Sales and Press Information:
Heather Mullane
617/848-6531
heather.mullane@aw.com