The problem that I am running with is to extract the text out of an image and for this I have used Tesseract v3.02. The sample images from which I have to extract text are related to meter readings. Some of them are with solid sheet background and some of them have LED display. I have trained the dataset for solid sheet background and the results are some how effective. The major problem I have now is the text images with LED/LCD background which are not recognized by Tesseract and due to this the training set isn't generated. Can anyone guide me to the right direction on how to use Tesseract with the Seven Segment Display(LCD/LED background) or is there any other alternative that I can use instead of Tesseract. <img src="https://i.stack.imgur.com/peB3w.png" alt="LED background image 1"><img src="https://i.stack.imgur.com/MIe6s.png" alt="LED background image 2"><img src="https://i.stack.imgur.com/2rbal.png" alt="Meter 1 with solid sheet background"><img src="https://i.stack.imgur.com/KUfwD.png" alt="enter image description here"><img src="https://i.stack.imgur.com/oGsK8.png" alt="enter image description here">

https://github.com/upupnaway/digital-display-character-rec/blob/master/digital_display_ocr.py Did this using openCV and tesseract and the "letsgodigital" trained data -steps include edge detection and extracting the display using the largest contour. Then threshold image using otsu or binarization and pass it through pytesseracts image_to_string function.

Text detection on Seven Segment Display via Tesseract OCR

Tags:

ocr

tesseract

seven-segment-display

The problem that I am running with is to extract the text out of an image and for this I have used Tesseract v3.02. The sample images from which I have to extract text are related to meter readings. Some of them are with solid sheet background and some of them have LED display. I have trained the dataset for solid sheet background and the results are some how effective.

The major problem I have now is the text images with LED/LCD background which are not recognized by Tesseract and due to this the training set isn't generated.

Can anyone guide me to the right direction on how to use Tesseract with the Seven Segment Display(LCD/LED background) or is there any other alternative that I can use instead of Tesseract.

LED background image 1 LED background image 2 Meter 1 with solid sheet background enter image description here

657

asked Jul 16 '13 09:07

yunas

1 Answers

https://github.com/upupnaway/digital-display-character-rec/blob/master/digital_display_ocr.py

Did this using openCV and tesseract and the "letsgodigital" trained data

-steps include edge detection and extracting the display using the largest contour. Then threshold image using otsu or binarization and pass it through pytesseracts image_to_string function.

answered Sep 17 '22 14:09

Raymond Ma

Related questions
                            
                                Suggestions for digit recognition
                            
                                Can I use OCR to detect font style (bold, italic)? [closed]
                            
                                what's the best image input type for tesseract?
                            
                                Suggest an OCR Library for iOS [closed]
                            
                                chinese character recognition using Tesseract OCR
                            
                                Scoreboard digit recognition using OpenCV
                            
                                (-215:Assertion failed) !_src.empty() in function 'cv::cvtColor' with cv::imread
                            
                                Stroke Width Transform (SWT) implementation (Java, C#...) [closed]
                            
                                How to convert an image into character segments?
                            
                                Tesseract OCR Library - Learning Font
                            
                                Convert Non-Searchable Pdf to Searchable Pdf in Windows Python
                            
                                What's the best way to ocr as much text as possible from video game screenshots?
                            
                                Open source OCR [closed]
                            
                                Google Cloud Vision - Numbers and Numerals OCR
                            
                                Batch OCR Program for PDFs [closed]
                            
                                Get correct image orientation by Google Cloud Vision api (TEXT_DETECTION)
                            
                                WinError 5:Access denied PyTesseract
                            
                                Select only specific parts of the image
                            
                                Preprocessing poorly scanned handwritten digits

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With