RE: FW: 6 questions

From: B (11@onna.com)
Date: Tue Sep 18 2001 - 15:13:19 EDT


<ruby><rb>$B$8$e$&$$$C$A$c$s(B</rb><rp>(</rp><rt>Juuitchan</rt><rp>)</rp></ruby>
Well, I guess what you say is true,
I could never be the right kind of girl for you,
I could never be your woman
                      - White Town

--- Original Message ---
$B:9=P?M(B: "Magda Danish (Unicode)" <v-magdad@microsoft.com>;
$B08@h(B: unicode@unicode.org;
Cc:
$BF|;~(B: 01/09/18 17:25
$B7oL>(B: FW: 6 questions

>
>
>-----Original Message-----
>From: Bernard Miller [mailto:forunicode@yahoo.com]
>Sent: Monday, September 17, 2001 5:19 PM
>To: info@unicode.org
>Subject: 6 questions
>
>
>Hello,
>
>These are the questions I wanted to
>ask:
>
>1. Why does Unicode say that there are 63486 code
>values available to represent characters with single
>16 bit values and 2048 available to represent an
>additional 1,048,544 characters as surrogates? 65536 -
>2048 = 63488 (difference of 2) --I guess it's due to
>the 2 code values guaranteed not to be characters. But
>what about: 1024 x 1024 = 1,048,576 (difference of
>32), what accounts for the 32?

You can't have any character code end in FFFE or FFFF?

>
>2. CNS = chinese national standard? Why is there a
>chinese standard for japanese small variant forms
>(ch14, page 334 of Unicode 3.0 book)? Do CJK
>ideographs have small variant forms? Where are they?

If it is, not surprising. I am not sure, but I think Chinese fonts sometimes include hiragana. (Why not? There aren't many of them, and they come in VERY handy for people who bootleg Japanese CDs.)

>
>3. Why don't "noBreak" formatted Unicode characters
>have a canonical decomposition (the compatibility
>decomposition surrounded by glue)?
>
>4. Greek final sigma is not considered a compatibility
>decomposition (word position variant) because it's
>usage could also be dependant on spelling convention?
>Is that right? Even if so, isn't it more consistent to
>precede sigma with a non joiner if you don't want it
>to automatically be displayed as final sigma at the
>end of a word?

It is probably kept right where it is because of millenia of tradition.

>
>5. How come east asian width type W and H are non
>starters for line breaking?
>
>6. Why does Unicode use "capital" vs "small letter"
>terminology instead of "uppercase" vs "lowercase"? It
>seems like lowercase is more descriptive than "small
>letter".

Capital letters are actually "smaller" than lowercase, for Latin at least: no descenders!!

>
>
>__________________________________________________
>Terrorist Attacks on U.S. - How can you help?
>Donate cash, emergency relief information
>http://dailynews.yahoo.com/fc/US/Emergency_Information/
>
>



This archive was generated by hypermail 2.1.2 : Tue Sep 18 2001 - 14:28:54 EDT