Ask Your Question
1

Working on an old book

asked 2013-01-05 20:46:50 -0600

Weatherlawyer gravatar image

I have a scanned book saved from a Microsoft machine initially as a .jpg image then opened at the library with Adobe as a pdf.

Now I want to work on it but don't know how to convert it to text. The book is out of copyright but the only way I could scan it was with highly restrictive resources from the local public library.

I have looked for OCR and stuff but can't understand how to download them, having exhausted my search of the add and remove programmes.

Is there a simple system to recognise the text and render it into a word processor or text format so I can re write the subject?

Thanks in advance.

Mike.

edit retag flag offensive close merge delete

2 Answers

Sort by ยป oldest newest most voted
1

answered 2013-01-06 00:43:46 -0600

sideburns gravatar image

My suggestion would be to install and use yagf (http://symmetrica.net/cuneiform-linux/yagf-en.html) because it not only acts as a GUI front end, it also brings in both cuneiform and tesseract, two different OCR back ends.

edit flag offensive delete link more
0

answered 2013-01-05 21:51:26 -0600

ThomasMcA gravatar image

updated 2013-01-07 15:23:12 -0600

http://www.onlineocr.net/

Another option is create a Google account and upload your PDFs to Google Docs, which can automatically OCR-scan your PDFs. See more here: http://blog.rubypdf.com/2010/07/16/google-docs-add-ocr-support-to-pdf-and-images/

edit flag offensive delete link more

Comments

URL theoretically answer the question but it would be preferable to include the essential parts of the answer here, and provide the link for reference, thx.

hhlp gravatar imagehhlp ( 2013-01-06 03:59:40 -0600 )edit

Question Tools

1 follower

Stats

Asked: 2013-01-05 20:46:50 -0600

Seen: 83 times

Last updated: Jan 07 '13