How to process and extract text from image

Tags:

I'm trying to extract text from image using python cv2. The result is pathetic and I can't figure out a way to improve my code. I believe the image needs to be processed before the extraction of text but not sure how.

Sample image

I've tried to convert it into black and white but no luck.

import cv2
import os
import pytesseract
from PIL import Image
import time

pytesseract.pytesseract.tesseract_cmd='C:\\Program Files\\Tesseract-OCR\\tesseract.exe'

cam = cv2.VideoCapture(1,cv2.CAP_DSHOW)

cam.set(cv2.CAP_PROP_FRAME_WIDTH, 8000)
cam.set(cv2.CAP_PROP_FRAME_HEIGHT, 6000)

while True:
    return_value,image = cam.read()
    image=cv2.cvtColor(image,cv2.COLOR_BGR2GRAY)
    image = image[127:219, 508:722]
    #(thresh, image) = cv2.threshold(image, 128, 255, cv2.THRESH_BINARY | cv2.THRESH_OTSU)
    cv2.imwrite('test.jpg',image)
    print('Text detected: {}'.format(pytesseract.image_to_string(Image.open('test.jpg'))))
    time.sleep(2)

cam.release()
#os.system('del test.jpg')

503

asked Aug 28 '19 15:08

idar

1 Answers

Preprocessing to clean the image before performing text extraction can help. Here's a simple approach

Convert image to grayscale and sharpen image
Adaptive threshold
Perform morpholgical operations to clean image
Invert image

First we convert to grayscale then sharpen the image using a sharpening kernel

enter image description here

Next we adaptive threshold to obtain a binary image

enter image description here

Now we perform morphological transformations to smooth the image

enter image description here

Finally we invert the image

enter image description here

import cv2
import numpy as np

image = cv2.imread('1.jpg')
gray = cv2.cvtColor(image, cv2.COLOR_BGR2GRAY)
sharpen_kernel = np.array([[-1,-1,-1], [-1,9,-1], [-1,-1,-1]])
sharpen = cv2.filter2D(gray, -1, sharpen_kernel)
thresh = cv2.threshold(sharpen, 0, 255, cv2.THRESH_BINARY_INV + cv2.THRESH_OTSU)[1]

kernel = cv2.getStructuringElement(cv2.MORPH_RECT, (3,3))
close = cv2.morphologyEx(thresh, cv2.MORPH_CLOSE, kernel, iterations=1)
result = 255 - close

cv2.imshow('sharpen', sharpen)
cv2.imshow('thresh', thresh)
cv2.imshow('close', close)
cv2.imshow('result', result)
cv2.waitKey()

102

answered Sep 28 '22 20:09

nathancy

Related questions
                            
                                How to get psycopg2's description from PostgreSQL server side cursor
                            
                                How to change markers shape manually in plotly interactive plot
                            
                                Can the CP solver be initialised at a specific point?
                            
                                OpenCV: How to use the convertScaleAbs() function
                            
                                Detect horizontal blank lines in .pdf form image with OpenCV
                            
                                How to split a sentence string into words, but also make punctuation a separate element
                            
                                What's the Pythonic way to write conditional statements based on installed modules?
                            
                                How to initialize Numpy array of list objects
                            
                                python kubernetes watch pod logs not working
                            
                                Covert to sparse matrix - TypeError: no supported conversion for types: (dtype('0'),)
                            
                                RuntimeWarning: coroutine was never awaited. How to async / await a callback
                            
                                How can I convert an image from pixels to one-hot encodings?
                            
                                How to make a Pygame Zero window full screen?
                            
                                Round a series to N number of significant figures
                            
                                Pass arguments to Python running in Docker container on AWS Fargate
                            
                                How to use groupby.first() with transform function
                            
                                How to reach the same performance with the C# mongo driver than PyMongo in python?
                            
                                Tensorflow 2.0 , replace 0 values in a tensor with 1s
                            
                                RuntimeError: one of the variables needed for gradient computation has been modified by an inplace operation
                            
                                json.decoder.JSONDecodeError: Expecting value: line 1 column 1 (char 0) python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to process and extract text from image

Tags:

python

image

image-processing

opencv

python-tesseract

idar

People also ask

1 Answers

nathancy

Recent Activity

Donate For Us