PurposeDescribe the use of Unicode as infrastructure for shareable Japanese character data support on a very large database engine. DescriptionThe original Teradata data base system was the first commercial database engine based upon the client-server model. It used cutting edge technology for the time. Heterogeneous ASCII and EBCDIC clients could share character data which was stored on the server utilizing an extended form of ASCII. As time passed, there has been continuous updating of both the hardware and the software, and significant inroads have been made in the area of internationalization, including support for Western Europe and Japan. However, much of the infrastructure remains from the original system. NCR has approved an upgrade of this infrastructure, converting aging Pascal code to C, and making Unicode support intrinsic within the database engine. These two efforts have proceeded independently, but writing Unicode support in Pascal is quite challenging. Therefore, the decision was made to implement Unicode inside, rather than throughout, the database engine, until the two projects merge. Meanwhile, the difficulties of providing Unicode support are exacerbated by an inadequate and ambiguous SQL internationalization model. Still, extensions to the SQL model, innovative use of previous Japanese support, entirely new translation and collation processing, and a new Unicode knowledgeable hash function for dividing the workload amongst the many processors, provides significant heterogeneous multilingual support. ConclusionEven in the situation where external Unicode support is not feasible, the use of internal Unicode infrastructure provides many benefits to the users of a system. The new C based parser now available should allow us to complete the transition to Unicode throughout the system. |
When the world wants to talk, it speaks Unicode |
International Unicode Conferences are organized by Global Meeting Services, Inc., (GMS).
GMS is pleased to be able to offer the International Unicode Conferences under an exclusive
license granted by the Unicode Consortium. All responsibility for conference finances and
operations is borne by GMS. The independent conference board serves solely at the pleasure
of GMS and is composed of volunteers active in Unicode and in international software
development. All inquiries regarding International Unicode Conferences should be addressed
to info@global-conference.com. Unicode and the Unicode logo are registered trademarks of Unicode, Inc. Used with permission. 26 January 1999, Webmaster |