I'm trying to train tesseract to recognize numbers from real images of gas meters. The images that I use for training are made with a camera, for this reason there are many problems: poor images resolution, blurred images, poor lighting or low contrast as a result of the overexposure, reflections, shadows, etc... For training, I have created a large image with a series of digits captured by the images of the gas meter and I manually edited the file box to create the .tr files. The result is that only the digits of the clearer and sharper images are recognized while the digits of blurred images are not captured by tesseract.

As far as I can tell you need to OpenCV to recognize box in which numbers are located, but OpenCV is not god for OCR. After you locate box, just crop that part, do image processing and then hand it over to tesseract for OCR. I need help with OpenCV because I don't know how to program in OpenCV. Here are few real world examples. <ul> <li>First image is original image (croped power meter numbers)</li> <li>Second image is slightly cleaned up image in GIMP, around 50% OCR accuracy in tesseract</li> <li>Third image is completely cleaned image - 100% OCR recognized without any training!</li> </ul> <img src="https://i.stack.imgur.com/JUNNJ.jpg" alt="first image"><img src="https://i.stack.imgur.com/i48J7.png" alt="second image"><img src="https://i.stack.imgur.com/KjD0v.png" alt="third image">

I suggest you to: <ul> <li>use a tool to edit the boxes, such jTessBoxEditor, it's so helpful and let you winning a time. You can install it easily from here </li> <li>it's good idea to train the letters of actual situation (noisy, blurred). Your training set is still limited, you can add more training samples.</li> <li> I recommend you to use Tesseract's API themselves to enhance the image (denoise, normalize, sharpen...) for example : <code>Boxa * tesseract::TessBaseAPI::GetConnectedComponents(Pixa** pixa)</code> (it allows you to get to the bounding boxes of each character) Pix* pimg = tess_api->GetThresholdedImage(); </li> </ul> Here you find few examples

Tesseract is a pretty decent OCR package, but doesn't pre-process images properly. My experience is that you can get a good OCR result if you just do some pre-processing before passing it on to tesseract. There are a couple of key pointers that improves recognition significantly: <ol> <li>Remove background noise. Basically this means using mean adaptive thresholding. I'd also ensure that the characters are black and the background is white.</li> <li>Use the correct resolution. If you get bad results, scale the image up or down until you get good results. You want to aim at approx. font size 14 at 300 dpi; in my software that processes invoices that works best.</li> <li>Don't store images as JPEG; use BMP or PNG or something else that doesn't make the image noisy.</li> <li>If you're only using one or two fonts, try training tesseract on these fonts.</li> </ol> As for point 4, if you know the font that's going to be used, there are some better solutions than using Tesseract like matching these fonts directly on the images... The basic algoritm is to find the digits and match them to all possible characters (which are only 10)... still, the implementation is tricky.

Training Tesseract 3 to recognize numbers from real images of gas meters

Tags:

opencv

ocr

tesseract

I'm trying to train tesseract to recognize numbers from real images of gas meters.

The images that I use for training are made with a camera, for this reason there are many problems: poor images resolution, blurred images, poor lighting or low contrast as a result of the overexposure, reflections, shadows, etc...

For training, I have created a large image with a series of digits captured by the images of the gas meter and I manually edited the file box to create the .tr files. The result is that only the digits of the clearer and sharper images are recognized while the digits of blurred images are not captured by tesseract.

373

asked Jul 18 '11 13:07

Alessandro

4 Answers

As far as I can tell you need to OpenCV to recognize box in which numbers are located, but OpenCV is not god for OCR. After you locate box, just crop that part, do image processing and then hand it over to tesseract for OCR.

I need help with OpenCV because I don't know how to program in OpenCV.

Here are few real world examples.

First image is original image (croped power meter numbers)
Second image is slightly cleaned up image in GIMP, around 50% OCR accuracy in tesseract
Third image is completely cleaned image - 100% OCR recognized without any training!

first image second image third image

answered Oct 02 '22 11:10

valentt

I would try this simple ImageMagick command first:

 convert          \
    original.jpg  \
   -threshold 50% \
    result.jpg

(Play a bit with the 50% parameter -- try with smaller and higher values...)

Thresholding basically leaves over only 2 values, zero or maximum, for each color channel. Values below the threshold get set to 0, values above it get set to 255 (or 65535 if working at 16-bit depth).

Depending on your original.jpg, you may have a OCR-able, working, very high contrast image as a result.

answered Oct 02 '22 11:10

Kurt Pfeifle

I suggest you to:

use a tool to edit the boxes, such jTessBoxEditor, it's so helpful and let you winning a time. You can install it easily from here
it's good idea to train the letters of actual situation (noisy, blurred). Your training set is still limited, you can add more training samples.
I recommend you to use Tesseract's API themselves to enhance the image (denoise, normalize, sharpen...) for example : Boxa * tesseract::TessBaseAPI::GetConnectedComponents(Pixa** pixa) (it allows you to get to the bounding boxes of each character)

Pix* pimg = tess_api->GetThresholdedImage();

Here you find few examples

answered Oct 02 '22 11:10

Y.AL

Tesseract is a pretty decent OCR package, but doesn't pre-process images properly. My experience is that you can get a good OCR result if you just do some pre-processing before passing it on to tesseract.

There are a couple of key pointers that improves recognition significantly:

Remove background noise. Basically this means using mean adaptive thresholding. I'd also ensure that the characters are black and the background is white.
Use the correct resolution. If you get bad results, scale the image up or down until you get good results. You want to aim at approx. font size 14 at 300 dpi; in my software that processes invoices that works best.
Don't store images as JPEG; use BMP or PNG or something else that doesn't make the image noisy.
If you're only using one or two fonts, try training tesseract on these fonts.

As for point 4, if you know the font that's going to be used, there are some better solutions than using Tesseract like matching these fonts directly on the images... The basic algoritm is to find the digits and match them to all possible characters (which are only 10)... still, the implementation is tricky.

answered Oct 02 '22 13:10

atlaste

Related questions
                            
                                np.rot90() corrupts an opencv image
                            
                                in Python use of hierarchy for findContours
                            
                                Finding Minimum Distance between Contours
                            
                                HoughlinesP parameters "threshold" and "minLineLength"
                            
                                Size of BoundingBox/ROI to track object keeps on increasing despite fixed initial size
                            
                                How to convert image file object to numpy array in with openCv python?
                            
                                How to play any video with a fixed frame rate (fps) using OpenCV?
                            
                                How To Measure Contrast in OpenCV + Visual C++
                            
                                Python freezes after cv2.destroyWindow()
                            
                                Correct way to extract Translation from Essential Matrix through SVD
                            
                                Computer Vision: How to split horizontally an image by the line with least entropy?
                            
                                ImportError: numpy.core.multiarray failed to import while using mod_wsgi
                            
                                Android document scanner using opencv
                            
                                OpenCV with AWS Lambda
                            
                                Using OpenCV in Hololens
                            
                                How to apply RANSAC in Python OpenCV
                            
                                Python - Perspective transform for OpenCV from a rotation angle
                            
                                How to use OpenCV functions in Keras Lambda Layer?
                            
                                How do I get the ROI coordinates based on my prediction?
                            
                                Pattern Recognition using OpenCV

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Training Tesseract 3 to recognize numbers from real images of gas meters

Tags:

opencv

ocr

tesseract

Alessandro

People also ask

4 Answers

valentt

Kurt Pfeifle

Y.AL

atlaste

Recent Activity

Donate For Us