From: Mark Davis ☕ (mark@macchiato.com)
Date: Tue May 18 2010 - 14:08:38 CDT
The UTC is adding the Dandas to the class of Terminal punctuation (STerm).
This means that they mark the end of a sentence (in the default
algorithm). It would be interesting to get feedback from people familiar
with interesting scripts on whether there are *other* punctuation marks that
are also missing from this set, and propose that to the UTC.
The following lists the 5.2 Unicode characters that are marked as Other
Punctuation that are not STerm.
http://unicode.org/cldr/utility/list-unicodeset.jsp?a=[:po:]-[:sterm:]&g=gc
For example, is U+2CF9 <character.jsp?a=2CF9> ( ⳹ ) COPTIC OLD NUBIAN FULL
STOP a terminal punctuation?
Mark
— Il meglio è l’inimico del bene —
This archive was generated by hypermail 2.1.5 : Tue May 18 2010 - 14:12:07 CDT