Re: UTF-8 format

From: Markus Kuhn (Markus.Kuhn@cl.cam.ac.uk)
Date: Tue Aug 18 1998 - 04:25:01 EDT


JATIN B KHANDELWAL wrote on 1998-08-18 00:15 UTC:
> Need information on UTF-8 format

Read

 ftp://ftp.informatik.uni-erlangen.de/pub/doc/ISO/charsets/utf-8.c
 ftp://ftp.informatik.uni-erlangen.de/pub/doc/ISO/charsets/ISO-10646-UTF-8.html

> and conversion tables with respect to
> Unicode 2.0, EUC, JIS and Shift-JIS.

Once you understood UTF-8 by reading the above documents, you will not
be interested any more in UTF-8 <-> Unicode conversion tables. The
conversion is a trivial algorithm, no table required. For EUC, JIS and
Shift-JIS you use the normal Unicode conversion tables and apply the
UTF-8 algorithm.

It is very exciting to see that UTF-8 is finally starting to fly these
days. It looks like there are signs of exponential growth.

Markus

-- 
Markus G. Kuhn, Security Group, Computer Lab, Cambridge University, UK
email: mkuhn at acm.org,  home page: <http://www.cl.cam.ac.uk/~mgk25/>



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:40 EDT