Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Difference between Tesseract 3 and Tesseract 4?

Tags:

ocr

tesseract

What are the major differences between Tesseract 3 and Tesseract 4 ? And why should I choose one over the other ?

like image 913
F.Lin Avatar asked Jan 29 '18 09:01

F.Lin


People also ask

What is the latest version of Tesseract?

Tesseract is an open source text recognition (OCR) Engine, available under the Apache 2.0 license. Major version 5 is the current stable version and started with release 5.0. 0 on November 30, 2021.

How do I know what version of Tesseract I have?

The "get_tesseract_version" function returns the Tesseract version installed in the system.

How do I upgrade Tesseract?

Steps: -Build a static executable (self-contained) tesseract.exe compatible with Windows. If you have any language data files, make sure those are compatible with your version of tesseract and update them if necessary as well. -Check Tesseract engine setting under System settings on the application server.

What model does Tesseract use?

Tesseract OCR Library pros are trainedlanguage models (>192), different kinds of recognition (image as word, text block, vertical text), easy to setup. 3rd party wrapper from github was used as Tesseract OCR was written on C++.


1 Answers

Tesseract 4.0's accuracy is better than a tesseract 3. Tesseract 4 uses deep learning model: Long Short-Term Memory (LSTM) neural network which is a kind of Recurrent Neural Network (RNN).

But please check system requirements e.g. If you are using Ubuntu 16.04 LTS then Tesseract will be installed with version 3 and not 4. And if you are using Ubuntu 18.04 then you can install latest Tesseract version 4.

For more details please refer the following articles:

  1. Here short theory for tesseract is given: https://limitlessdatascience.wordpress.com/2019/07/01/tesseract-4-0-intro-installation/

  2. Tesseract 3 Vs 4 output comparision: https://limitlessdatascience.wordpress.com/2019/07/31/tesseract-3-0-and-4-0-implementation-and-output-comparison/

like image 120
Yogesh Awdhut Gadade Avatar answered Sep 24 '22 17:09

Yogesh Awdhut Gadade