I discovered today about a curious encoding practice, and I wished to share
this info with the Unicode List.
Cantonese uses many ideographs containing a "mouth" radical which are not
available in some legacy encodings. Some users invented a creative way to
encode these characters: they use a lowercase "o" to represent the mouth
radical.
E.g., "o左" (U+006F U+5DE6) stands for "咗" (U+5497). See an example of this
on <http://www.geocities.com/WongTzeWah/interview11.htm>.
Of course, that page is in Big-5: in Unicode, such a trick is not needed.
_ Marco
This archive was generated by hypermail 2.1.2 : Thu Mar 14 2002 - 05:24:26 EST