Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

How to install language in tesseract OCR

Tags:

ocr

tesseract

I have installed tesseract OCR and it has only 'eng' and 'osd' in the language list. I need german language. I tired following command

brew install tesseract-ocr-deu

but i am getting error.

Error: No available formula with the name "tesseract-ocr-deu" 
==> Searching for a previously deleted formula (in the last month)...
Warning: homebrew/core is shallow clone. To get complete history run:
   git -C "$(brew --repo homebrew/core)" fetch --unshallow

Error: No previously deleted formula found.
==> Searching for similarly named formulae...
Error: No similarly named formulae found.
==> Searching taps...
==> Searching taps on GitHub...
Error: No formulae found in taps.
like image 920
Lama Madan Avatar asked Oct 19 '18 11:10

Lama Madan


People also ask

Can Tesseract detect language?

Unfortunately tesseract does not have a feature to detect language of the text in an image automatically. An alternative solution is provided by another python module called langdetect which can be installed via pip.

Can Tesseract read other languages?

The Tesseract OCR engine supports multiple languages.

How many languages does Tesseract have?

Figure 2: You can see that Tesseract OCR supports a wide array of languages. In fact, Tesseract supports over 100 languages, including those that comprise characters and symbols, as well as right-to-left languages.


2 Answers

On mac OS type

brew install tesseract-lang 

Installs all languages, you can check them by,

tesseract --list-langs
like image 162
Sahana M Avatar answered Oct 20 '22 11:10

Sahana M


On MacOS Mojave (10.14.3) works:

   brew install tesseract-lang
like image 23
weivall Avatar answered Oct 20 '22 12:10

weivall