Re: Unicode, SMS and year 2012

From: Cristian Secară <orice_at_secarica.ro>
Date: Sat, 28 Apr 2012 01:29:30 +0300

În data de Fri, 27 Apr 2012 12:26:25 -0700, Mark Davis ☕ a scris:

> Actually, if the goal is to get as many characters in as possible,
> Punycode might be the best solution. That is the encoding used for
> internationalized domains. In that form, it uses a smaller number of
> bytes per character, but a parameterization allows use of all byte
> values.

I suspect the punycode goal is to take a wide character set into a
restricted character set, without caring much on resulting string
length; if the original string happens to be in other character set
than the target restricted character set, then the string length
increases too much to be of interest in the SMS discussion.

Just do a test: write something in a non-Latin alphabetic script into
this page here http://demo.icu-project.org/icu-bin/idnbrowser

Cristi

-- 
Cristian Secară
http://www.secarica.ro
Received on Fri Apr 27 2012 - 17:34:36 CDT

This archive was generated by hypermail 2.2.0 : Fri Apr 27 2012 - 17:34:51 CDT