From: M.-A. Lemburg (mal@egenix.com)
Date: Thu Dec 02 2010 - 17:31:09 CST
John H. Jenkins wrote:
>
> On 2010/11/29, at 下午2:17, M.-A. Lemburg wrote:
>
>> Question: Why don't these code points have the "Nd" category ?
>>
>
> Because they're not numerals, they're words/parts of words. East Asian writing systems don't traditionally distinguish the numeral "1" from the number-word "one." Yes, you *can* use them as straight decimal digits, but that's an extended use, and it's more akin to "two zero one zero" than "2010."
Right, but older Chinese texts and tables don't use the Arabic numerals
used nowadays.
>> Related to this, it is also unclear what to use as official zero
>> for these number systems (U+3007 is often recommended).
>>
>
> There isn't an official zero. U+3007 and U+96F6 are the two that are generally used. Which you want to use is somewhat context-dependent; if possible, the choice should be left to the user.
Ok, so this goes beyond what the standard defines as "simplistic
parsing".
Fair enough.
Thanks,
-- Marc-Andre Lemburg eGenix.com Professional Python Services directly from the Source (#1, Dec 03 2010) >>> Python/Zope Consulting and Support ... http://www.egenix.com/ >>> mxODBC.Zope.Database.Adapter ... http://zope.egenix.com/ >>> mxODBC, mxDateTime, mxTextTools ... http://python.egenix.com/ ________________________________________________________________________ ::: Try our new mxODBC.Connect Python Database Interface for free ! :::: eGenix.com Software, Skills and Services GmbH Pastor-Loeh-Str.48 D-40764 Langenfeld, Germany. CEO Dipl.-Math. Marc-Andre Lemburg Registered at Amtsgericht Duesseldorf: HRB 46611 http://www.egenix.com/company/contact/
This archive was generated by hypermail 2.1.5 : Thu Dec 02 2010 - 17:40:41 CST