Re: how to sanitize MS Word HTML output?

Top Page
Attachments:
Message as email
+ (text/plain)
+ (text/html)
+ (text/plain)
Delete this message
Reply to this message
Author: Lisa Kachold
Date:  
To: Steven A. DuChene, Main PLUG discussion list
Subject: Re: how to sanitize MS Word HTML output?
Load into OpenOffice, display source, remove formatting, highlight,
cut/splice into text file, or save as text, rename to HTML?

Should work?

There are also online document conversions, I think MS HTML to text is one
of them?

On Mon, May 4, 2009 at 10:27 AM, Steven A. DuChene <
> wrote:

> Hello all:
> My wife has a class sylibus file from one of her profs at MCC and the file
> is "supposed" to be html but it is that awful sort-of-html crap from
> MS-Office. It is filled with a lot of un-needed style and formating tags
> as well as all kinds of stupid extra characters due to some MS "standard"
> character formatting stuff. Things like braking lines in the middle of
> words and then adding an equal sign at the end of the broken line or
> replacing equal signs in the html code with "=3D'
>
> Does anyone know of a tool that will clean this crappy excuse for html
> code up into something more standard? Or failing that just some tool
> or script that will fix the weird character formating stuff with the
> extra equal signs or "=3D" problems???
> --
> Steve DuChene
>
>
> ---------------------------------------------------
> PLUG-discuss mailing list -
> To subscribe, unsubscribe, or to change your mail settings:
> http://lists.PLUG.phoenix.az.us/mailman/listinfo/plug-discuss
>




--
www.obnosis.com (503)754-4452
"Contradictions do not exist." A. Rand
---------------------------------------------------
PLUG-discuss mailing list -
To subscribe, unsubscribe, or to change your mail settings:
http://lists.PLUG.phoenix.az.us/mailman/listinfo/plug-discuss