On Thu, Mar 10 2016 at 22:40 CET, kenwhistler_at_att.net writes:
[...]
> The *reason* that NamesList.txt exists at all is to drive the tool, unibook,
> that formats the full Unicode code charts for posting. It is only
> posted in the Unicode Character Database at all as a matter of
> convenience, to give people access to a text only version of the
> names list that appears in the fully formatted pdf versions of the
> code charts
> that contain all the representative glyphs.
>
> NamesList.txt should *not* be data mined.
I've just noticed that NamesList.txt is in a sense data mined by the
Unicode consortium itself. I mean the "Unicode Utilities: Character
Properties", which e.g. for LATIN SMALL LETTER P WITH FLOURISH
(http://unicode.org/cldr/utility/character.jsp?a=A753) display in
particular
subhead: Medievalist addition
Am I right that this information is available only in NamesList.txt?
In my opinion this is important information and should be officially
available for character data mining engines.
Best regards
Janusz
-- , Prof. dr hab. Janusz S. Bien - Uniwersytet Warszawski (Katedra Lingwistyki Formalnej) Prof. Janusz S. Bien - University of Warsaw (Formal Linguistics Department) jsbien@uw.edu.pl, jsbien@mimuw.edu.pl, http://fleksem.klf.uw.edu.pl/~jsbien/Received on Sat Mar 26 2016 - 04:11:48 CDT
This archive was generated by hypermail 2.2.0 : Sat Mar 26 2016 - 04:11:48 CDT