tesseract not recognize one number image

Tags:

tesseract

i am using tesseract with python. It recognizes almost all of my images with 2 or more numbers or characteres. But tesseract can't recognizes image with only one number. I tried to use the command line, and it's giving me "empty page" as response.

I don't want to train tesseract with "only digits" because i am recognizing characters too.

What is the problem?

Below the image that its not recognized by tesseract.

enter image description here

Code:

 #getPng(pathImg, '3') -> creates the path to the figure.
 pytesseract.image_to_string( Image.open(getPng(pathImg, '3'))

491

asked Mar 26 '18 20:03

Luiza Rodrigues

2 Answers

If you add the parameter --psm 13 it should works, because it will consider it as a raw text line, without searching for pages and paragraphs.

So try:

pytesseract.image_to_string(PATH, config="--psm 13")

125

answered Sep 24 '22 20:09

sinecode

Try converting image into gray-scale and then to binary image, then most probably it will read. If not duplicate the image , then you have 2 letters to read. So simply you can extract single letter

answered Sep 24 '22 20:09

Ashane.E

Related questions
                            
                                How to configure and build Tesseract OCR C++ using Visual Studio 2015 x64 on Windows 10
                            
                                OCR: check if letter is in (string) of image (Opencv, Python, Tesseract)
                            
                                How to get character wise confidence in tesseract using command line?
                            
                                What does the key values of the dictionary output of the following code in tesseract signify?
                            
                                How to extract text or numbers from images using python
                            
                                Clean text images with OpenCV for OCR reading
                            
                                tesseract install mac os
                            
                                Which algorithm is used in google's tesseract-OCR for Recognition?
                            
                                Extracting text out of images
                            
                                How to extract text from table in image?
                            
                                How to get coordinates of recognized characters
                            
                                Tesseract .NET Process image from memory object
                            
                                Adding New Fonts to Tesseract 3
                            
                                "language_model_penalty_non_dict_word" has no effect in tesseract 3.01
                            
                                How do I enlarge a picture so that it is 300 DPI?
                            
                                Disable dictionary in Tesseract
                            
                                How To Customize Tesseract Ignores Noise?
                            
                                Android OCR detecting digits only using popular tessercat fork tess-two
                            
                                Tesseract-ocr gem issue on mac os x
                            
                                How to find parameters supported in Tesseract OCR config file

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With