Re: Character properties

From: Marcin 'Qrczak' Kowalczyk (qrczak@knm.org.pl)
Date: Sat Oct 21 2000 - 04:53:37 EDT


Wed, 11 Oct 2000 07:15:05 -0800 (GMT-0800), Mark Davis <mark@macchiato.com> pisze:

> Here is my take on the way Unicode general categories should be
> mapped to POSIX ones.

Reiterated, here is my compilation of mapping of properties proposed
for Haskell:

isAssigned: all except Cs, Cn
isControl: Cc, Cf
isPrint: L*, M*, N*, P*, S*, Zs, Co
isSpace: Zs (except U+00A0, U+202F), TAB, LF, VT, FF, CR
isGraph: L*, M*, N*, P*, S*, Co
isPunct: P*
isSymbol: S*
isAlphaNum: L*, M*, N*
isDigit: Nd
isHexDigit: '0'..'9', 'A'..'F', 'a'..'f'
isDecDigit: '0'..'9'
isOctDigit: '0'..'7'
isAlpha: L*, M*
isUpper: Lu, Lt
isLower: Ll
isLatin1: U+0000..U+00FF
isAscii: U+0000..U+007F

-- 
 __("<  Marcin Kowalczyk * qrczak@knm.org.pl http://qrczak.ids.net.pl/
 \__/
  ^^                      SYGNATURA ZASTĘPCZA
QRCZAK



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:21:14 EDT