Re: CJK Ideograph Fragments

From: Uriah Eisenstein (uriaheisenstein@gmail.com)
Date: Sun May 02 2010 - 08:06:08 CDT

  • Next message: Kevin Brown: "Greek Perispomeni - what's the correct symbol?"

    Not exactly, but I suppose such Hanzi fragments could be sued for similar
    purposes - e.g. looking up characters by components, where the available
    components may include non-character fragments. Some fragments may be useful
    for IME purposes, but probably not all.

    On Sat, May 1, 2010 at 8:57 PM, Edward Cherlin <echerlin@gmail.com> wrote:

    > 2010/4/28 John H. Jenkins <jenkins@apple.com>:
    > > No. You could certainly write up a proposal and submit it to the UTC.
    > > Should the UTC feel the idea has merit, it would then move it on to WG2
    > > and/or the IRG.
    > > The main problem here is that there is a very strong desire to limit
    > > ideograph encoding to attested and documentable forms. Anything which
    > does
    > > not exist in actual texts is not likely to be well-regarded.
    >
    > I had the idea some years ago of writing up a proposal to encode the
    > hanzi fragments used in Cangjie Shurufa IMEs. These fragments are used
    > extensively in dozens of howto books on keyboarding in Cangjie. This
    > includes the pieces (mostly real characters, with some radicals) used
    > on keyboard labels, and the common forms they stand for. I didn't get
    > any interest from the Cangjie development community or the authors of
    > a book on Cangjie that I have, so i abandoned the idea.
    >
    > Uriah, is this the sort of thing you have in mind?
    >
    > > Similarly, the
    > > UTC has a strong preference not to encoding anything which isn't in
    > actual
    > > use. Proposals to encode characters because they would be useful if
    > encoded
    > > even though they aren't actually being used right now are generally
    > looked
    > > on with disfavor.
    > >
    > > 在 Apr 28, 2010 12:03 PM 時, Uriah Eisenstein 寫到:
    > >
    > > Hello,
    > > My question is about common components of CJK Ideographs which are not
    > > encoded as independent Han characters (and perhaps indeed aren't). A good
    > > example is the right-hand part of the character 漢 itself: it is a
    > distinct
    > > component appearing in multiple other characters, but is not encoded to
    > the
    > > best of my knowledge. The same goes for the top part of 鳥 and 島, the
    > > surrounding part of 與 and 興 and several others. My question is whether
    > there
    > > are any plans or discussions for encoding these fragments in Unicode.
    > >
    > > (I haven't found anything about this in mailing list archives; I did find
    > > statements that Unicode does not intend to provide any decomposition data
    > of
    > > Han characters :) And for good reasons. However, such fragments may well
    > be
    > > useful for third-party software dealing with 漢字 glyph generation, lookup
    > by
    > > components etc.)
    > >
    > > Thanks,
    > > Uriah Eisenstein
    > >
    > >
    >
    >
    >
    > --
    > Edward Mokurai (默雷/धर्ममेघशब्दगर्ज/دھرممیگھشبدگر ج) Cherlin
    > Silent Thunder is my name, and Children are my nation.
    > The Cosmos is my dwelling place, the Truth my destination.
    > http://www.earthtreasury.org/
    >



    This archive was generated by hypermail 2.1.5 : Sun May 02 2010 - 08:14:01 CDT