Is it save to dig into comment contents of PropList.txt?

From: Steffen <sdaoden_at_gmail.com>
Date: Tue, 05 Nov 2013 14:38:06 +0100

Hello,
...i came to this solution in order to generate test data with
awk(1) in a memory-friendly way?

(The problem i'm facing is that _PRINT and _GRAPH cannot be set
for some properties from PropList.txt, say, _PRINT can't be set
for U+0009, CHARACTER TABULATION (ht), since it's a Cc, but in
order to know that i had to parse UnicodeData.txt and store
character information in memory first, (not thinking about further
options), but that requires a lot of memory, more than is
available on low-end machines.)

The standard text doesn't mention anything, neither does tr44, and
a grep(1) over the data files doesn't give any results, too.
The comments are in this form since (at least) Unicode 3.1 ([1]),
but i'd feel better if the format of the comments would be stated
somewhere.
Thank you.

  [1] <http://www.unicode.org/Public/3.1-Update/PropList-3.1.0.txt>

--steffen
Received on Tue Nov 05 2013 - 07:41:29 CST

This archive was generated by hypermail 2.2.0 : Tue Nov 05 2013 - 07:41:31 CST