From: Neil Harris (neil@tonal.clara.co.uk)
Date: Fri Oct 13 2006 - 09:32:56 CST
A slightly more cautious version of the program above:
import os, string, re
filenames = [x for x in os.listdir(".") if ".html" in x]
for filename in filenames:
points = [int(string.join(x, ""), 16) for x in
re.findall(r"U\+([0-9A-Fa-f]+)|([0-9A-Fa-f]+)\(", open(filename).read())
if len(string.join(x, "")) >= 4]
points = {}.fromkeys(points).keys()
points.sort()
print "<h1>", string.replace(filename, ".html", ""), "</h1>"
n = 0
for p in points:
print "&#%d;" % p
n += 1
if n % 40 == 0: print "<br>"
Of course, I give absolutely no guarantee that either this program, or
the previous one, gives the correct results, or is fit for any purpose
whatsoever. Use at your own risk.
-- Neil
This archive was generated by hypermail 2.1.5 : Fri Oct 13 2006 - 09:42:47 CST