On 10/11/06, Craig White <craigwhite@azapple.com> wrote:
I have a lot of documents to convert from Microsoft 'doc' format to html

I have found a number of tools to do that but it doesn't really clean
them up very well which means that I will have to do a lot of hand edits
which isn't so bad considering that I will undoubtedly have to do this
anyway to get a common css and common headers/footers, etc.

If anyone has suggestions on best methods for the above, I would
appreciate it but thus far, I see little better than openoffice macros
which do this in bulk (conversions).

More importantly though, there is a structure to the storage...

Base (subdirectory)
  Section 1 (subdirectory)
    Section 1A (document)
    Section 1B (document)
  Section 2 (subdirectory)
    Section 2A (document)

etc. and I would love for some methodology to build a table of
contents/links to these documents automatically - and possibly even
output the end result (the whole enchilada) perhaps in PDF so that I
have other means to distribute this. I have seen many different
publications that get built this way and I don't know how they
accomplish this. Are there some open source tools that can do this?

Craig

---------------------------------------------------
PLUG-discuss mailing list - PLUG-discuss@lists.plug.phoenix.az.us
To subscribe, unsubscribe, or to change  you mail settings:
http://lists.PLUG.phoenix.az.us/mailman/listinfo/plug-discuss

This link to a ["probably" OT?] message:
    http://lists.plug.phoenix.az.us/lurker/message/20060721.041455.cfc3f8ea.en.html
may be N/A to your main concerns here, if you already have a converter, and just
want to run a script to convert tons of files; 
but it might be of interest, regarding use of MS's own conversion software to convert .doc files to .html "without annotations".
--
Mike Schwartz    
Glendale  AZ
schwartz@acm.org
Mike.L.Schwartz@gmail.com