How to convert a pdf file to html ?

Steve Holmes steve at holmesgrown.com
Mon Feb 7 15:52:53 MST 2011


Yeah, I see that pdftohtml is a part of the poppler package in Arch
Linux.  I believe if you use 'pdftohtml -layout <file>' you might get
better results.  Personally, I'm only interested in the text portion
so I could be clueless as to what happened visually to the outcome.

On Sun, Jan 09, 2011 at 12:38:40PM -0700, joe at actionline.com wrote:
> 
> Thanks Brian.  I did obtain 'pdf20html' from another source
> and while it does work (somewhat), it does not preserve fonts
> and formatting very well.  The google.doc does a much better
> job, but it was surprisingly and painfully slow for me.  Takes
> a looooong time to resolve the fonts page by page.
> 
> 
> > In ubuntu, install the package poppler-utils to get pdftohtml.  I just
> > tried it out on a few pdfs I had lying around and it doesn't appear to
> > do a very good job of preserving the formatting, but maybe it will work
> > for your PDF's
> >
> > Brian Cluff
> >
> > On 01/08/2011 04:43 AM, joe at actionline.com wrote:
> >>
> >> Is there a viable linux utility to convert a pdf file to html?
> >>
> >> Or, is there a win app that is known to work with wine?
> >>
> >> Or, is there an app to convert a pdf file to an open office document
> >> with rtf formatting retained?
> >>
> >> (I know pdftotext works for extracting ascii text only.)
> >>
> >>
> >>
> >> ---------------------------------------------------
> >> PLUG-discuss mailing list - PLUG-discuss at lists.plug.phoenix.az.us
> >> To subscribe, unsubscribe, or to change your mail settings:
> >> http://lists.PLUG.phoenix.az.us/mailman/listinfo/plug-discuss
> >>
> >
> > ---------------------------------------------------
> > PLUG-discuss mailing list - PLUG-discuss at lists.plug.phoenix.az.us
> > To subscribe, unsubscribe, or to change your mail settings:
> > http://lists.PLUG.phoenix.az.us/mailman/listinfo/plug-discuss
> >
> 
> 
> ---------------------------------------------------
> PLUG-discuss mailing list - PLUG-discuss at lists.plug.phoenix.az.us
> To subscribe, unsubscribe, or to change your mail settings:
> http://lists.PLUG.phoenix.az.us/mailman/listinfo/plug-discuss


More information about the PLUG-discuss mailing list