Why does pytesseract fail to recognise digits from image with darker background?

Tags:

I've this python code which I use to convert a text written in a picture to a string, it does work for certain images which have large characters, but not for the one I'm trying right now which contains only digits.

This is the picture:

Digits

This is my code:

import pytesseract
from PIL import Image

img = Image.open('img.png')
pytesseract.pytesseract.tesseract_cmd = 'C:/Program Files (x86)/Tesseract-OCR/tesseract'
result = pytesseract.image_to_string(img)
print (result)

Why is it failing at recognising this specific image and how can I solve this problem?

996

asked May 05 '19 17:05

alioua walid

1 Answers

I have two suggestions.

First, and this is by far the most important, in OCR preprocessing images is key to obtaining good results. In your case I suggest binarization. Your images look extremely good so you shouldn't have any problem but if you do, then maybe you should try to binarize your images:

import cv2
from PIL import Image

img = cv2.imread('gradient.png')
# If your image is not already grayscale :
# img = cv2.cvtColor(img, cv2.COLOR_BGR2GRAY)
threshold = 180 # to be determined
_, img_binarized = cv2.threshold(img, threshold, 255, cv2.THRESH_BINARY)
pil_img = Image.fromarray(img_binarized)

And then try the ocr again with the binarized image.

Check if your image is in grayscale and uncomment if needed.

This is simple thresholding. Adaptive thresholding also exists but it is noisy and does not bring anything in your case.

Binarized images will be much easier for Tesseract to handle. This is already done internally (https://github.com/tesseract-ocr/tesseract/wiki/ImproveQuality) but sometimes things can be messed up and very often it's useful to do your own preprocessing.

You can check if the threshold value is right by looking at the images :

import matplotlib.pyplot as plt
plt.imshow(img, cmap='gray')
plt.imshow(img_binarized, cmap='gray')

Second, if what I said above still doesn't work, I know this doesn't answer "why doesn't pytesseract work here" but I suggest you try out tesserocr. It is a maintained python wrapper for Tesseract.

You could try:

import tesserocr
text_from_ocr = tesserocr.image_to_text(pil_img)

Here is the doc for tesserocr from pypi : https://pypi.org/project/tesserocr/

And for opencv : https://pypi.org/project/opencv-python/

As a side-note, black and white is treated symetrically in Tesseract so having white digits on a black background is not a problem.

answered Sep 21 '22 08:09

Ashargin

Related questions
                            
                                How to determine the uncertainty of fit parameters with Python?
                            
                                Possible values for platform.machine()
                            
                                Flask - store object directly in a session [duplicate]
                            
                                Connecting to Amazon Aurora using SQLAlchemy
                            
                                metaclass and __prepare__ ()
                            
                                Flask's built-in server always 404 with SERVER_NAME set
                            
                                Python Abstract class with concrete methods
                            
                                python - how to docstring kwargs and their expected types
                            
                                How to bulk write TFRecords?
                            
                                Render current status only on template in StreamingHttpResponse in Django
                            
                                Django OneToOneField default value
                            
                                Call an async function in an normal function
                            
                                How to generate random numbers to satisfy a specific mean and median in python?
                            
                                How to use SMOP to convert Matlab into Python code
                            
                                How do I use an AWS SessionToken to read from S3 in pyspark?
                            
                                How to run Keras.model() for prediction inside a tensorflow session?
                            
                                Python freezes on smtplib.SMTP("smtp.gmail.com", 587)
                            
                                How to use boto3 client with Python multiprocessing?
                            
                                Pytest: How to parametrize a test with a list that is returned from a fixture?
                            
                                Full gradient descent in keras

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why does pytesseract fail to recognise digits from image with darker background?

Tags:

python

python-3.x

ocr

python-tesseract

alioua walid

People also ask

1 Answers

Ashargin

Recent Activity

Donate For Us