Annoying .DOC files (was Re: Excelco Needs Accounting, Technical, Programming People)

J.L.Francois jlf@magusnet.gilbert.az.us
Mon, 18 Sep 2000 09:24:10 -0700


try:
catdoc

Package: catdoc
Priority: optional
Section: text
Installed-Size: 636
Maintainer: Bdale Garbee <bdale@gag.com>
Version: 0.91.4-1
Depends: libc6 (>= 2.1.2)
Suggests: wish
Architecture: i386
Size: 66394
MD5sum: e31084fed199fbda2ae24af844542250
Filename: dists/woody/main/binary-i386/text/catdoc_0.91.4-1.deb
Description: MS-Word to TeX or plain text converter
 This program extracts text from MS-Word files, trying to preserve
 as many special printable characters as possible. catdoc supports
 everything up to Word-97.
 .
 It doesn't even try to preserve fancy Word formatting, because
 Word users usually don't care about document structure, and it is
 this very thing which is important to LaTeX users.
 .
 Also provided is xls2csv, which extracts data from Excel spreadsheets
 and outputs it in comma-separated-value format.
 .
 This package suggests 'wish' because it also includes wordview, an
 optional Tk-based GUI for catdoc.  The MIME config provided in this
 package will use wordview is X is running, or catdoc directly if it
 is not.


It seems like on Mon, Sep 18, 2000 at 07:43:04AM -0700, Rod Roark scribbled:
Orig Msg> On Sun, 17 Sep 2000, Jason wrote:
Orig Msg> > > If interested, please send text in plain text to john@excelco.com.  DO
Orig Msg> > > NOT send any attachments in the form of *.DOC to me for review.  I will
Orig Msg> > > not open them!   Principals only at this time.
Orig Msg> > 
Orig Msg> > Is there a linux solution (even a simply C program to strip the M$
Orig Msg> > coding and give some decent plaintext; the M$ format files are rarely
Orig Msg> > in a sensible order) to opening these annoying .DOC files yet?
Orig Msg> 
Orig Msg> StarOffice reads them effectively.
Orig Msg> 
Orig Msg> -- Rod
Orig Msg>    http://www.sunsetsystems.com/
Orig Msg> 

JLF Sends...
Doing my part to educate the Clubie Illiterati
one LART at a time.