Working on an old book

asked 2013-01-05 20:46:50 -0600

Weatherlawyer gravatar image

I have a scanned book saved from a Microsoft machine initially as a .jpg image then opened at the library with Adobe as a pdf.

Now I want to work on it but don't know how to convert it to text. The book is out of copyright but the only way I could scan it was with highly restrictive resources from the local public library.

I have looked for OCR and stuff but can't understand how to download them, having exhausted my search of the add and remove programmes.

Is there a simple system to recognise the text and render it into a word processor or text format so I can re write the subject?

Thanks in advance.


2 Answers

answered 2013-01-06 00:43:46 -0600

sideburns gravatar image

My suggestion would be to install and use yagf ( because it not only acts as a GUI front end, it also brings in both cuneiform and tesseract, two different OCR back ends.

answered 2013-01-05 21:51:26 -0600

ThomasMcA gravatar image

updated 2013-01-07 15:23:12 -0600

Another option is create a Google account and upload your PDFs to Google Docs, which can automatically OCR-scan your PDFs. See more here:

