Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

How to OCR engraved text?

I have this image

Image with Engraved Text

How to OCR it? I know this is very challenging, but I would really appreciate any help.

like image 895
Zaher Joukhadar Avatar asked Nov 29 '12 16:11

Zaher Joukhadar


People also ask

Can Yolo be used for text recognition?

Using YOLO(You only look once) for Text Detection There are many versions of it. YOLOv3 is the most recent and the fastest version. YOLOv3 uses Darknet-53 as it's feature extractor.

How do OCR models work?

The OCR model evolves through deep learning character recognition to provide accurate results. The OCR recognizes typewritten text but is known for capturing one character at a time. OWR scans typewritten text word by word. It is often called OCR, but the algorithm is slightly different.

Does OCR use neural network?

An optical character recognition (OCR) system, which uses a multilayer perceptron (MLP) neural network classifier, is described. The neural network classifier has the advantage of being fast (highly parallel), easily trainable, and capable of creating arbitrary partitions of the input feature space.


1 Answers

If you have the time to develop the detection yourself, I would do it roughly like this:

  1. Get 1000 images or so and either OCR them yourself or let the people on Amazon Mechanical Turk do it for you, it will cost virtually nothing. Now you have something to tune your algorithm on and measure how well you are doing.
  2. Like Ryan wrote, play with standard image filters, contrast, color, gauss, etc, manually or with something like http://www.roborealm.com/ . See if you can't find a combination that makes the text really stand out.
  3. Try the libraries again
  4. If the libs still don't work, try to use your knowledge of the picture to split it into separate digits. You know how many digits there should be and roughly how many pixels each should take. Use edge detection or something (perhaps standard OCR feature extraction, together with clustering will give you each digit as a cluster?) to find the digits and split them out separately.
  5. Do standard OCR feature extraction (don't be too creative - use existing libraries or at least read up on what the most common and simple are) on each digit and feed those features, together with the answer you got under 1) into a neural network or a SVM.
  6. Improve your feature set until the machine learning works.

Since you have only ten digits, which are fairly consistent between images, this should work.

like image 162
Gurgeh Avatar answered Oct 04 '22 07:10

Gurgeh