Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

What kind of OCR Java library should I use in Android? [closed]

Tags:

java

android

ocr

I would like to build an Android application that, via an OCR library, should scan a picture extracting text from it .

What Java library should I use?

like image 643
systempuntoout Avatar asked Jun 30 '09 09:06

systempuntoout


3 Answers

Don't know how good it is (it definitely needs to be trained first), but there is Ron Cemer's Java OCR library.

like image 141
Thilo Avatar answered Nov 16 '22 06:11

Thilo


If you are looking for a very extensible option or have a specific problem domain you could consider rolling your own using the Java Object Oriented Neural Engine.

I used it successfully in a personal project to identify the letter from an image such as this, you can find all the source for the OCR component of my application on github, here.

like image 7
davetapley Avatar answered Nov 16 '22 04:11

davetapley


try tesseract, checkout this article http://www.itwizard.ro/interfacing-cc-libraries-via-jni-example-tesseract-163.html and this example http://code.google.com/p/mezzofanti/

Edit: some more facts - tesseract is one of the best open source OCR used by google - there is training data available for many languages - mezzofanti is an android app that uses tesseract - beware: OCR does use a lot of CPU power. trying to OCR a A4 page with your T-Mob G1 will take a lot of time and the result may not impress you ;-)

like image 6
raudi Avatar answered Nov 16 '22 06:11

raudi