From: Mark Davis ☕ (mark@macchiato.com)
Date: Sun Feb 13 2011 - 20:26:01 CST
As people have said on this thread, it depends entirely on the data sample
in use. As it turns out, when looking at HTML pages on the web (with a
good-sized sample from work here at Google), SPACE is the most frequent
character (by a huge margin). That is even true on Chinese pages, just
because of the proportion of markup on pages.
For those interested, the most frequent Alphabetic is 'e'.
Mark
*— Il meglio è l’inimico del bene —*
On Sat, Feb 12, 2011 at 02:13, Charlie Ruland <ruland@luckymail.com> wrote:
> U+0020 SPACE is by no means ‘the most used character’ universally. For
> Chinese it is completely unnecessary, not only when writing from top to
> bottom. The same is probably true for Japanese and ‘early forms’ of
> influential W Eurasian languages such as Phoenician, Hebrew, Greek and
> Latin. And further examples from other parts of the world won’t be hard to
> find.
> Charlie
This archive was generated by hypermail 2.1.5 : Sun Feb 13 2011 - 20:31:19 CST