I would recommend reading Microsoft Word files by converting them to RTF
format and reading that. Reading Word's binary format is quite tricky
(even when you work for Microsoft!) and it does change from year to year
in order to handle enhancements.
Murray
>----------
>From: unicode@Unicode.ORG[SMTP:unicode@Unicode.ORG]
>Sent: Tuesday, June 18, 1996 8:19 PM
>To: unicode@Unicode.ORG
>Subject: Microsoft Word type binary file
>
>To:unicode@unicode.org
>Subject:Microsoft Word type binary file.
>Reply-To:Lieyong Fu <lfu@iii.com>
>
>Would anyone please help me if you know where can I find
>information/spec
>about the binary file structure that Microsoft Word/Word Perfect
>program save
>and retrieve from when writting/reading a document.
>
>I assume every character is represented by the value of character and
>which
>font file does this character belong to.
>
This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:31 EDT