making PDFs workable
James Crawford
jrefl5 at gmail.com
Wed Sep 12 17:28:55 MST 2012
>As noted earlier, none of this helps if the PDF is just a big image.
>
>The PDF referenced below is an image exported from Xara Xtreme Pro (graphics software for Windows), and every test I can run on it indicates is a big image file; >no native text to copy.
>I don't run Adobe reader, it may have some added specialization (e.g. OCR) to allow text to be copied.
I have some instructions on a work system that will take a pdf image, convert it to tiff then ocr the result to a txt file.
We had a large number of documents that they wanted as text.
I'll try to remember to post the steps and script I used on Thursday.
James C.
More information about the PLUG-discuss
mailing list