Re: Furigana

From: James Kass (jameskass@worldnet.att.net)
Date: Wed Aug 14 2002 - 00:53:58 EDT


John Cowan wrote,

> Non-characters aren't encoded, they're reserved either for specific
> purposes or for any desired purpose.
>

If it's a specific purpose, it seems like it should either fall under
"character" or mark-up.

I can understand reserving code points for any desired purpose,
such as control characters or escape sequences. These may well
differ from application to application. Once a meaning like
"INTERLINEAR ANNOTATION ANCHOR" has been assigned to
a code point, any application which chooses to use that code
point for any other purpose would be at fault.

In other words, if these characters are to be "used internally for
Japanese Ruby (furigana), etc.", then they ought to be able to
be used externally, as well.

I understand that having common internal use code points might
be considered handy from an implementer's point of view, but
suggest that such conventions should be shared among implementers
only, and should not be enshrined in a character encoding standard.
 
> > Is there such a thing as a non-character with a specific semantic
> > meaning?
>
> Why not?
>

Because it seems to be an oxymoron. If it has a specific semantic
meaning, then it should be possible to store and exchange it
without any loss of meaning. In other words, it's a character
and should be so encoded. (Logos and such notwithstanding.)

Best regards,

James Kass.



This archive was generated by hypermail 2.1.2 : Tue Aug 13 2002 - 23:01:08 EDT