From: Eric Muller (emuller@adobe.com)
Date: Mon Feb 14 2011 - 14:17:11 CST
On 2/13/2011 6:26 PM, Mark Davis ☕ wrote:
> As it turns out, when looking at HTML pages on the web (with a
> good-sized sample from work here at Google), SPACE is the most
> frequent character (by a huge margin).
Are you looking at the text nodes of the HTML (after space
normalization) or at the HTML serialization ? E.g. do you count the
space in "<p class="foo">" ?
Thanks,
Eric.
This archive was generated by hypermail 2.1.5 : Mon Feb 14 2011 - 14:21:11 CST