How to convert a pdf file to html ?

joe at actionline.com joe at actionline.com
Sun Jan 9 12:38:40 MST 2011


Thanks Brian.  I did obtain 'pdf20html' from another source
and while it does work (somewhat), it does not preserve fonts
and formatting very well.  The google.doc does a much better
job, but it was surprisingly and painfully slow for me.  Takes
a looooong time to resolve the fonts page by page.


> In ubuntu, install the package poppler-utils to get pdftohtml.  I just
> tried it out on a few pdfs I had lying around and it doesn't appear to
> do a very good job of preserving the formatting, but maybe it will work
> for your PDF's
>
> Brian Cluff
>
> On 01/08/2011 04:43 AM, joe at actionline.com wrote:
>>
>> Is there a viable linux utility to convert a pdf file to html?
>>
>> Or, is there a win app that is known to work with wine?
>>
>> Or, is there an app to convert a pdf file to an open office document
>> with rtf formatting retained?
>>
>> (I know pdftotext works for extracting ascii text only.)
>>
>>
>>
>> ---------------------------------------------------
>> PLUG-discuss mailing list - PLUG-discuss at lists.plug.phoenix.az.us
>> To subscribe, unsubscribe, or to change your mail settings:
>> http://lists.PLUG.phoenix.az.us/mailman/listinfo/plug-discuss
>>
>
> ---------------------------------------------------
> PLUG-discuss mailing list - PLUG-discuss at lists.plug.phoenix.az.us
> To subscribe, unsubscribe, or to change your mail settings:
> http://lists.PLUG.phoenix.az.us/mailman/listinfo/plug-discuss
>




More information about the PLUG-discuss mailing list