From: Neil Harris (neil@tonal.clara.co.uk)
Date: Fri Oct 13 2006 - 09:24:09 CST
JFC Morfin wrote:
> From expeience, what would be the easiest way to generate PDF of the
> different http://www.iana.org/assignments/idn/registered.htm tables
> whch would display the concerned glyphs?
> Thank you for the help.
> jfc
>
>
>
>
Get the files thus:
wget -r -l1 http://www.iana.org/assignments/idn/registered.htm
Now run this Python program in the directory with the downloaded HTML
files in:
-------------------------------
import os, string, re
filenames = [x for x in os.listdir(".") if ".html" in x]
for filename in filenames:
points = [int(string.join(x, ""), 16) for x in
re.findall(r"U\+([0-9A-Fa-f]+)|([0-9A-Fa-f]+)\(",
open(filename).read())]
points = {}.fromkeys(points).keys()
points.sort()
print "<h1>", string.replace(filename, ".html", ""), "</h1>"
n = 0
for p in points:
print "&#%d;" % p
n += 1
if n % 40 == 0: print "<br>"
-------------------------------
and view the resulting output in your web browser, after installing CJK
fonts, including both traditional and simplified Chinese. Print to PDF.
-- Neil
This archive was generated by hypermail 2.1.5 : Fri Oct 13 2006 - 09:42:47 CST