[Unicode]   Common Locale Data Repository : Bug Tracking Home | Site Map | Search
 
Modify

CLDR Ticket #10763(accepted data)

Opened 9 months ago

Last modified 6 months ago

Transform Devanagari from/to Grantha

Reported by: sascha Owned by: sascha
Component: translit Data Locale: und-Deva, und-Gran
Phase: dsub Review:
Weeks: Data Xpath:
Xref:

Description

Because the Grantha script has been encoded at a constant offset to Devanagari, it should be straightforward to write a CLDR transform for converting from Devanagari to Grantha and vice versa.

Here’s a Python script for converting from Devanagari to Grantha, kindly contributed by Shriramana Sharma <jamadagni@…>:

ifile = open("udhr/udhr_san.xml")
ofile = open("/tmp/udhr_san_gran.xml", "w")
for iline in ifile:
    oline = ""
    for c in iline:
        o = ord(c)
        if 0x0900 <= o <= 0x097f and o not in (0x0964, 0x0965):
            oline += chr(o + 0x10a00)
        else:
            oline += c
    ofile.write(oline)
ifile.close()
ofile.close()

In terms of CLDR-internal implementation, we could do this via a small set of (trivial) rewrite rules since the current transform mechanism doesn’t allow arithmetics on codepoints. It might be an option to add Grantha to the existing InterIndic transforms.

For test data, see https://github.com/unicode-org/udhr/pull/13 (respectively, once this pull request has been merged into the master branch, the udhr_san_gran.xml file in the Unicode UDHR repository).

Attachments

Change History

comment:1 Changed 9 months ago by mark

Fine to create a transform based on this, which is what we assume the ticket is for.

comment:2 Changed 9 months ago by mark

  • Owner changed from anybody to sascha
  • Status changed from new to accepted
  • Milestone changed from UNSCH to 33

comment:3 Changed 6 months ago by sascha

  • Milestone changed from 33 to 34
View

Add a comment

Modify Ticket

Action
as accepted
Author


E-mail address and user name can be saved in the Preferences.

 
Note: See TracTickets for help on using tickets.