Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

How to use Opencv for Document Recognition with OCR?

I´m a beginner on computer vision, but I know how to use some functions on opencv. I´m tryng to use Opencv for Document Recognition, I want a help to find the steps for it.

I´m thinking to use opencv example find_obj.cpp , but the documents, for example passport, has some variables, name, birthdate, pictures. So, I need a help to define the steps for it, and if is possible how function I have to use on the steps.

I'm not asking a whole code, but if anyone has any example link or you can just type a walkthrough, it is of great help.

like image 380
Ricardo Avatar asked Sep 24 '11 21:09

Ricardo


1 Answers

There are two very different steps involved here. One is detecting your object, and the other is analyzing it.

For object detection, you're just trying to figure out whether the object is in the frame, and approximately where it's located. The OpenCv features framework is great for this. For some tutorials and comprehensive sample code, see the OpenCv features2d tutorials and especially the feature matching tutorial.

For analysis, you need to dig into optical character recognition (OCR). OpenCv does not include OCR libraries, but I recommend checking out tesseract-ocr, which is a great OCR library. If your documents have a fixed structured (consistent layout of text fields) then tesseract-ocr is all you need. For more advanced analysis checking out ocropus, which uses tesseract-ocr but adds layout analysis.

like image 109
Kyle McDonald Avatar answered Oct 10 '22 01:10

Kyle McDonald