how to sanitize MS Word HTML output?

Steven A. DuChene linux-clusters at mindspring.com
Mon May 4 10:27:41 MST 2009


Hello all:
My wife has a class sylibus file from one of her profs at MCC and the file
is "supposed" to be html but it is that awful sort-of-html crap from
MS-Office. It is filled with a lot of un-needed style and formating tags
as well as all kinds of stupid extra characters due to some MS "standard"
character formatting stuff. Things like braking lines in the middle of
words and then adding an equal sign at the end of the broken line or
replacing equal signs in the html code with "=3D' 

Does anyone know of a tool that will clean this crappy excuse for html
code up into something more standard? Or failing that just some tool
or script that will fix the weird character formating stuff with the
extra equal signs or "=3D" problems???
--
Steve DuChene




More information about the PLUG-discuss mailing list