making PDFs workable

James Crawford jrefl5 at gmail.com
Wed Sep 12 17:28:55 MST 2012


>As noted earlier, none of this helps if the PDF is just a big image.
>
>The PDF referenced below is an image exported from Xara Xtreme Pro (graphics software for Windows), and every test I can run on it indicates is a big image file; >no native text to copy.

>I don't run Adobe reader, it may have some added specialization (e.g. OCR) to allow text to be copied.

I have some instructions on a work system that will take a pdf image, convert it to tiff then ocr the result to a txt file.
We had a large number of documents that they wanted as text.

I'll try to remember to post the steps and script I used on Thursday.

James C.



More information about the PLUG-discuss mailing list