Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Installing Tesseract-OCR on CentOS 6

I'm trying to install Tesseract-OCR on my server however when I install all what I believe to be the correct repos. When I try to install it the package is not found

I tried adding rpmforge but to no avail. Any ideas from somebody that has done before or is familiar with adding and searching through repos?

like image 387
William Avatar asked May 21 '14 19:05

William


People also ask

How do I install Tesseract modules?

Install tesseract Again, make sure the (tesseract) virtual environment is active before you run the conda install command. Type the command below to install tesseract. You can confirm that tesseract is installed in your virtual environment by running the command below. The command calls up the tesseract help screen.

Where is Tesseract installed on Linux?

Installing Tesseract on Debian and Ubuntu: This will install Tesseract under /usr/share/tesseract-ocr/4.00/tessdata. Note: For other Linux distributions, jump to Install Tesseract from Sources. By default, Tesseract will install the English language pack.


3 Answers

I have written a bash script to install Tesseract 3.05 on Centos 7. This fetches and installs all dependencies, and also installs language files for English, Hindi, Bengali and Thai.

Code available on GitHub

https://github.com/EisenVault/install-tesseract-redhat-centos

Hope this helps.

like image 106
Vipul Swarup Avatar answered Sep 26 '22 12:09

Vipul Swarup


I used these instructions which worked correctly in Centos

Install Tesseract OCR libs from sources in Centos

Download Leptonica and Teseract sources:

$ wget http://www.leptonica.org/source/leptonica-1.69.tar.gz
$ wget https://tesseract-ocr.googlecode.com/files/tesseract-ocr-3.02.02.tar.gz

Configure, compile, install libs:

 $ tar xzvf leptonica-1.69.tar.gz      
 $ cd leptonica-1.69      
 $ ./configure
 $ make
 $ sudo make install

 $ tar xzf tesseract-ocr-3.02.02.tar.gz
 $ cd tesseract-3.01
 $ ./autogen.sh
 $ ./configure
 $ make
 $ sudo make install
 $ sudo ldconfig

Download languages (english) and copy to tessdata folder:

$ wget http://tesseract-ocr.googlecode.com/files/tesseract-ocr-3.02.eng.tar.gz       
$ tar xzf tesseract-ocr-3.02.eng.tar.gz       
$ sudo cp tesseract-ocr/tessdata/* /usr/local/share/tessdata

and enjoy it ;)

like image 32
Yuseferi Avatar answered Sep 25 '22 12:09

Yuseferi


I recommend to try installing from rpm here: http://pkgs.org/download/tesseract There are also several dependencies: libpng-devel, libjpeg-devel, libtiff-devel, zlib and leptonica. Last 2 can also be found on RPM site

like image 41
aboev Avatar answered Sep 23 '22 12:09

aboev