Extracting text out of images

1 Answers

from PIL import Image
import pytesseract
import argparse
import cv2

# construct the argument parser and parse the arguments
ap = argparse.ArgumentParser()
ap.add_argument("-i", "--image", required=True, help="Path to the image")
args = vars(ap.parse_args())

# load the image and convert it to grayscale
image = cv2.imread(args["image"])
cv2.imshow("Original", image)

# Apply an "average" blur to the image

blurred = cv2.blur(image, (3,3))
cv2.imshow("Blurred_image", blurred)
img = Image.fromarray(blurred)
text = pytesseract.image_to_string(img, lang='eng')
print (text)
cv2.waitKey(0)

As as result i get = "Stay: in an Overwoter Bungalow $3»"

What about using Contour and taking unnecessary blobs from it ? might work

answered Oct 17 '22 12:10

Deepan Raj

Related questions
                            
                                __slots__ conflicts with a class variable in a generic class
                            
                                Strange error with Keras and Spyder
                            
                                How to rotate an element in Holoviews
                            
                                Are the async/await keywords in python 3.5 inspired by async/await in C#? [closed]
                            
                                Replace a list of numbers with flat sub-ranges
                            
                                How to save OpenCV image with contour
                            
                                Using Chardet to find encoding of very large file
                            
                                Line hover text in Plotly
                            
                                Getting the three smallest values per row and returning the correspondent column names
                            
                                Why did I have problems with alembic migrations
                            
                                plt.subplot axis sharing not working
                            
                                Pandas Multi-Index DataFrame to Numpy Ndarray
                            
                                Numpy Advanced Indexing: same index used multiple times in += [duplicate]
                            
                                How to stop sound in pygame?
                            
                                running aws athena query via pyathena
                            
                                How to draw an arc on a tkinter canvas?
                            
                                Array manipulation in python
                            
                                Count distinct strings in rolling window using pandas
                            
                                Updating pandas to version 0.19 in Azure ML Studio
                            
                                How to host a private python package manager in Azure or AWS

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Extracting text out of images

Tags:

python

image-processing

ocr

tesseract

python-tesseract

Yash Arora

People also ask

1 Answers

Deepan Raj

Recent Activity

Donate For Us