Re: FYI: Google blog on Unicode

From: Mark Davis ☕ (mark@macchiato.com)
Date: Fri Jan 29 2010 - 13:06:50 CST

  • Next message: spir: "Re: Transform for Hans with multiple pronunciations"

    If anyone is interested, the detection algorithm we use is
    open-sourced, available in the Chrome source code.

    Mark

    On Fri, Jan 29, 2010 at 07:06, Mark Davis ☕ <mark@macchiato.com> wrote:
    > It is encodings determined by a detection algorithm. The declarations
    > for encodings (and language) are far too unreliable to be depended on.
    > The detection algorithm itself is fairly complex, but quite fast and
    > compact.
    >
    > Mark
    >
    >
    >
    > On Thu, Jan 28, 2010 at 21:38, Simon Montagu <smontagu@smontagu.org> wrote:
    >> On 28/01/2010 10:50, Mark Davis ☕ wrote:
    >>>
    >>> There's a blog on Unicode that people may find interesting:
    >>> http://googleblog.blogspot.com/2010/01/unicode-nearing-50-of-web.html
    >>>
    >>> (The graph on Unicode is too small; until they get that fixed, I have
    >>> the large one on http://www.macchiato.com/)
    >>>
    >>> Mark
    >>
    >> What exactly is this counting? Encodings declared internally in web-pages?
    >> Encodings declared in HTTP headers? Encodings determined by auto-detection?
    >> Some combination of the above?
    >>
    >> --
    >> Simon Montagu
    >> Mozilla internationalization
    >> סיימון מונטגיו
    >>
    >>
    >



    This archive was generated by hypermail 2.1.5 : Fri Jan 29 2010 - 13:11:06 CST