>It is expected that no characters will ever be assigned in
       Unicode that require the five-byte and six-byte UTF-8 forms, so
       you can consider the maximum for UTF-8 to be four bytes.
       You can, in fact, state this more strongly: *No characters will
       ever be assigned* in Unicode that require the five-byte and
       six-byte UTF-8 forms. Based on recent WG2 decisions (I think
       they made this decision last month), the same is true for ISO
       10646. All that's left now would be to formally change the
       definition for UTF-8 to eliminate the five- and six-byte forms.
       Peter
This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:21:02 EDT