From: Bob_Hallissy@sil.org
Date: Tue Dec 17 2002 - 07:12:57 EST
On 16/12/2002 22:02:36 "Magda Danish (Unicode)" wrote:
>> I have a data in devanagri true type font i want to convert
>> this data into mangal unicode.
Sunil,
For Windows or Mac use: If you want to convert data from one encoding to
Unicode, one option is to look at the free TECkit package. There are many
non-Unicode encodings of Devanagari, so I'm unable to guess how your data
is currently encoded. TECkit is table-driven, i.e., you find or prepare a
description of the mapping between your encoding and Unicode, and then
TECkit uses that description to convert data. You may even be able to find
a mapping description already prepared as TECkit can use the XML mapping
definitions from ICU (see
http://oss.software.ibm.com/cvs/icu/charset/data/xml/) For more
information about TECkit or to download it, see
http://www.sil.org/nrsi/teckit/
Depending on the characteristics of your encoding and your desire to do a
bit of programming, you may also be able to incorporate the ICU
(International Components for Unicode) library into your own program to do
the conversion you need. See
http://oss.software.ibm.com/developerworks/opensource/icu/project/ for more
information.
NB: One of the complexities you may run into, and which will limit your
options, is that your encoding may store text in a different order than
Unicode requires. If this is the case, TECkit can do the rearrangement for
you but I'm not sure ICU will easily do that. Certainly the current
standard for XML-based descriptions of encoding mappings as given in
Unicode Technical Report 22 (see
http://www.unicode.org/unicode/reports/tr22/ ) cannot express such
mappings.
Bob
This archive was generated by hypermail 2.1.5 : Tue Dec 17 2002 - 08:39:40 EST