DOC to HTML conversions

Joseph Sinclair plug-discuss at stcaz.net
Tue Aug 22 23:46:44 MST 2006


2 ideas:
1) Batch convert to ODF (there is some work going on in this area at [http://www.opendocumentfellowship.org/applications/conversion], and a couple of solutions already available [such as DocVert (http://holloway.co.nz/docvert/index.html)] using OO.o and automation API's).  Once the document is in ODF (which is just a ZIP-compressed set of XML files and related image files) there are a couple of XSLT-based solutions currently available, none of them are great yet, but you can grab one and do some work to make it closer to what you want (and contribute to the community at the same time).
2) Dance on the "dark side" and /gulp/ *buy proprietary software* designed for this purpose, there are several Word-->HTML converters available from ISV's, they're not even close to FLOSS, but that's what you get working with Word documents...  Search Google for [Word "HTML Conversion" Automated], the first solution isn't bad from what I hear.

Craig White wrote:
> I am looking for a way to mass convert Microsoft Word documents to html.
> 
> I have been able to do this with Oo macro and it works fine but I wasn't
> impressed with some of the things that it does such as numbered lists in
> the word documents were favored as formatted text rather than an ordered
> list.
> 
> Perhaps I am expecting too much but perhaps there's something else out
> there that will do a better job than Oo.
> 
> Suggestions anyone?
> 
> Craig



More information about the PLUG-discuss mailing list