Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Difference between two pip3 packages: pytesseract vs tesseract [closed]

What is the difference between these two packages?

pip3 install pytesseract
pip3 install tesseract
like image 604
Hatshepsut Avatar asked Feb 19 '17 19:02

Hatshepsut


People also ask

What is the difference between Pytesseract and Tesseract?

Tesserocr is a Python wrapper around the Tesseract C++ API. Whereas Pytesseract is a wrapper for the tesseract-ocr CLI. Therefore with Tesserocr you can load the model at the beginning or your program, and run the model separately (for example in loops to process videos).

Do I need to install Tesseract to use Pytesseract?

OK - one more step before we can use pytesseract, we need to figure out where our tesseract executable is installed. The tesseract executable location is needed for pytesseract to work properly.

Is Tesserocr faster than Pytesseract?

From my experience Tesserocr is much faster than Pytesseract. Tesserocr is a python wrapper aroung the Tesseract C++ API. Whereas pytesseract is a wrapper the tesseract-ocr CLI.

Is Pytesseract slow?

Also the code is able to detect text, its just extremely slow. Recognising text from images is very cpu intensive - as a first step I would look at binarizing the input that is passed into image_to_string - this can speed up text recognition significantly.


1 Answers

pip3 install pytesseract gets me this package

Python-tesseract is a python wrapper for google's Tesseract-OCR

pip install tesseract gets me this package

Tesselation based Recovery of Amorphous halo Concentrations. The TesseRACt package is designed to compute concentrations of simulated dark matter halos from volume info for particles generated using Voronoi tesselation.

like image 92
Anomitra Avatar answered Sep 28 '22 06:09

Anomitra