Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Does Google Cloud Vision OCR API have better accuracy and performance than Tesseract OCR API

I have integrated Google Cloud Vision API in my java application for text recognition from complex formatted documents. One of my colleague suggested to use "Tesseract API".Can anyone please give difference between these two API's.And which is better in terms of accuracy or have any advantage over other.TIA

like image 1000
nik Avatar asked Aug 08 '17 04:08

nik


People also ask

Is Google OCR better than Tesseract?

However, the quality of the Google Vision OCR is still better, especially on difficult cases such as very small text. Since the quality is most important to us, the Google Vision OCR wins the comparison in our use case.

What is better than Tesseract OCR?

ABBYY FineReader Unlike Tesseract, ABBYY Cloud OCR is not free ( pricing). If you want to learn how to use the API, you'll find everything you need to know in these quick start guides. Again, we have different options with respect to the OCR output format.

Is Google Vision API accurate?

Overall Results. Overall Results of OCR Text Accuracy with 90% confidence intervals Google Cloud Platform's Vision OCR tool has the greatest text accuracy by 98.0% when the whole data set is tested.

How accurate is Google OCR?

Leveraging Your Document Data Obviously, the accuracy of the conversion is important, and most OCR software provides 98 to 99 percent accuracy, measured at the page level. This means that in a page of 1,000 characters, 980 to 990 characters will be accurate.


1 Answers

I have used both of them. Google Vision is much faster than Tesseract and If it was a year back then the accuracy was also better.

Tesseract lately adapted LSTM with preferred language choice and trained data which when optimized could get faster about 2X or more. But surely it can not get faster than Google vision unless they start supporting CUDA.

Regarding accuracy I have tested both of them with best and worst document images and were Google Vision landed on 66.6% accuracy, Tesseract went to 82% which is high enough for me.

If you prefer accuracy Tesseract is a winner and if you prefer time Google Vision is the best option.

Also there are couple of other CUDA supported projects which may be better than them.

like image 171
Pratap Singh Mehra Avatar answered Sep 16 '22 22:09

Pratap Singh Mehra