Hacker News new | past | comments | ask | show | jobs | submit login

I did experiment with using this to bring out the text in photos of books. Here's an example:

Input image - http://i.imgur.com/6o5FwxG.jpg

Output image - http://i.imgur.com/7OIOxfO.png

I tried to use vanilla Tesseract on it, but I had no luck getting anything usable out of it.




You have to compensate for the 3D deformation present in the captured image. There has been some impressive work in this area recently.

Some commercial OCR engines such as Nuance Capture SDK have built in functionality for this.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: