Re: Nicest UTF

From: Marcin 'Qrczak' Kowalczyk (qrczak@knm.org.pl)
Date: Fri Dec 10 2004 - 17:47:03 CST

  • Next message: D. Starner: "Re: Nicest UTF"

    John Cowan <jcowan@reutershealth.com> writes:

    >> > The XML/HTML core syntax is defined with fixed behavior of some
    >> > individual characters like '&', '<', quotation marks, and with special
    >> > behavior for spaces.
    >>
    >> The point is: what "characters" mean in this sentence. Code points?
    >> Combining character sequences? Something else?
    >
    > Neither. Unicode characters.

    http://www.w3.org/TR/2000/REC-xml-20001006#charsets
    implies that the appropriate level for parsing XML is code points.

    In particular XML allows a combining character directly after ">".

    -- 
       __("<         Marcin Kowalczyk
       \__/       qrczak@knm.org.pl
        ^^     http://qrnik.knm.org.pl/~qrczak/
    


    This archive was generated by hypermail 2.1.5 : Fri Dec 10 2004 - 17:53:27 CST