Re: Usage stats?

From: Richard Wordingham <richard.wordingham_at_ntlworld.com>
Date: Sat, 28 Mar 2015 23:29:21 +0000

On Sat, 28 Mar 2015 00:59:56 +0000
Richard Wordingham <richard.wordingham_at_ntlworld.com> wrote:

> On Fri, 27 Mar 2015 16:27:26 -0400
> Michael Norton <michaelanortonster_at_gmail.com> wrote:
>
> > Easy example: what's the code for [blank space] U+020 across all
> > language sets of Unicode? Is it the same ie: 100%?

I've seen a claim from a normally reliable source that U+0020 is
extremely rare in Thai or Japanese text. It does occur in Japanese
text, though quite possibly as an error for IDEOGRAPHIC SPACE.

In Thai, U+0020 is an extremely common and prescribed punctuation
mark. It is reliably used as a clause and sentence separator, and is
also used to delimit names and also numbers composed of digits. In
newspaper columns, it occurs in most lines, and in books there are
usually several to the line. The other common punctuation marks in
serious material are the abbreviation mark U+002E FULL STOP (especially
for initialisms) and the list item separator U+002C COMMA. Quotation
marks, exclamation marks and ellipses occur in fictional dialogue with
pretty much the same meaning as in English.

Richard.
_______________________________________________
Unicode mailing list
Unicode_at_unicode.org
http://unicode.org/mailman/listinfo/unicode
Received on Sat Mar 28 2015 - 18:30:53 CDT

This archive was generated by hypermail 2.2.0 : Sat Mar 28 2015 - 18:30:55 CDT