In 2019 I was working on a project that involved OCRing millions of scanned historical documents. I evaluated Google, Azure, Amazon, Adobe, ABBYY, and Tesseract somewhat rigorously.
Google's was by far the best, especially for obscured or malformed characters. Azure was second and I ended up merging the results from both.
For my use case (in Spring 2019) Tesseract was not very accurate and struggled with slanted text especially. Hopefully that has changed.
Google's was by far the best, especially for obscured or malformed characters. Azure was second and I ended up merging the results from both.
For my use case (in Spring 2019) Tesseract was not very accurate and struggled with slanted text especially. Hopefully that has changed.