I am trying to do OCR from this toy example of Receipts. Using Python 2.7 and OpenCV 3.1. <img src="https://i.stack.imgur.com/WQbGH.jpg" alt="enter image description here"> Grayscale + Blur + External Edge Detection + Segmentation of each area in the Receipts (for example "Category" to see later which one is marked -in this case cash-). I find complicated when the image is "skewed" to be able to properly transform and then "automatically" segment each segment of the receipts. Example: <img src="https://i.stack.imgur.com/EnA6A.jpg" alt="enter image description here"> Any suggestion? The code below is an example to get until the edge detection, but when the receipt is like the first image. My issue is not the Image to text. Is the pre-processing of the image. Any help more than appreciated! :) <pre class="prettyprint"><code>import os; os.chdir() # Put your own directory import cv2 import numpy as np image = cv2.imread("Rent-Receipt.jpg", cv2.IMREAD_GRAYSCALE) blurred = cv2.GaussianBlur(image, (5, 5), 0) #blurred = cv2.bilateralFilter(gray,9,75,75) # apply Canny Edge Detection edged = cv2.Canny(blurred, 0, 20) #Find external contour (_,contours, _) = cv2.findContours(edged, cv2.RETR_EXTERNAL, cv2.CHAIN_APPROX_NONE) </code></pre>

The option on the top of my head requires the extractions of 4 corners of the skewed image. This is done by using <code>cv2.CHAIN_APPROX_SIMPLE</code> instead of <code>cv2.CHAIN_APPROX_NONE</code> when finding contours. Afterwards, you could use <code>cv2.approxPolyDP</code> and hopefully remain with the 4 corners of the receipt (If all your images are like this one then there is no reason why it shouldn't work). Now use <code>cv2.findHomography</code> and <code>cv2.wardPerspective</code> to rectify the image according to source points which are the 4 points extracted from the skewed image and destination points that should form a rectangle, for example the full image dimensions. Here you could find code samples and more information: OpenCV-Geometric Transformations of Images Also this answer may be useful - SO - Detect and fix text skew EDIT: Corrected the second chain approx to <code>cv2.CHAIN_APPROX_NONE</code>.

Python + OpenCV: OCR Image Segmentation

Tags:

python

image-processing

opencv

computer-vision

I am trying to do OCR from this toy example of Receipts. Using Python 2.7 and OpenCV 3.1.

enter image description here

Grayscale + Blur + External Edge Detection + Segmentation of each area in the Receipts (for example "Category" to see later which one is marked -in this case cash-).

I find complicated when the image is "skewed" to be able to properly transform and then "automatically" segment each segment of the receipts.

Example:

enter image description here

Any suggestion?

The code below is an example to get until the edge detection, but when the receipt is like the first image. My issue is not the Image to text. Is the pre-processing of the image.

Any help more than appreciated! :)

import os;
os.chdir() # Put your own directory

import cv2 
import numpy as np

image = cv2.imread("Rent-Receipt.jpg", cv2.IMREAD_GRAYSCALE)

blurred = cv2.GaussianBlur(image, (5, 5), 0)

#blurred  = cv2.bilateralFilter(gray,9,75,75)

# apply Canny Edge Detection
edged = cv2.Canny(blurred, 0, 20)

#Find external contour

(_,contours, _) = cv2.findContours(edged, cv2.RETR_EXTERNAL, cv2.CHAIN_APPROX_NONE)

554

asked Nov 05 '16 22:11

donpresente

2 Answers

A great tutorial on the first step you described is available at pyimagesearch (and they have great tutorials in general)

In short, as described by Ella, you would have to use cv2.CHAIN_APPROX_SIMPLE. A slightly more robust method would be to use cv2.RETR_LIST instead of cv2.RETR_EXTERNAL and then sort the areas, as it should decently work even in white backgrounds/if the page inscribes a bigger shape in the background, etc.

Coming to the second part of your question, a good way to segment the characters would be to use the Maximally stable extremal region extractor available in OpenCV. A complete implementation in CPP is available here in a project I was helping out in recently. The Python implementation would go along the lines of (Code below works for OpenCV 3.0+. For the OpenCV 2.x syntax, check it up online)

import cv2

img = cv2.imread('test.jpg')
mser = cv2.MSER_create()

#Resize the image so that MSER can work better
img = cv2.resize(img, (img.shape[1]*2, img.shape[0]*2))

gray = cv2.cvtColor(img, cv2.COLOR_BGR2GRAY)
vis = img.copy()

regions = mser.detectRegions(gray)
hulls = [cv2.convexHull(p.reshape(-1, 1, 2)) for p in regions[0]]
cv2.polylines(vis, hulls, 1, (0,255,0)) 

cv2.namedWindow('img', 0)
cv2.imshow('img', vis)
while(cv2.waitKey()!=ord('q')):
    continue
cv2.destroyAllWindows()

This gives the output as

enter image description here

Now, to eliminate the false positives, you can simply cycle through the points in hulls, and calculate the perimeter (sum of distance between all adjacent points in hulls[i], where hulls[i] is a list of all points in one convexHull). If the perimeter is too large, classify it as not a character.

The diagnol lines across the image are coming because the border of the image is black. that can simply be removed by adding the following line as soon as the image is read (below line 7)

img = img[5:-5,5:-5,:]

which gives the output

enter image description here

130

answered Sep 22 '22 06:09

R. S. Nikhil Krishna

The option on the top of my head requires the extractions of 4 corners of the skewed image. This is done by using cv2.CHAIN_APPROX_SIMPLE instead of cv2.CHAIN_APPROX_NONE when finding contours. Afterwards, you could use cv2.approxPolyDP and hopefully remain with the 4 corners of the receipt (If all your images are like this one then there is no reason why it shouldn't work).

Now use cv2.findHomography and cv2.wardPerspective to rectify the image according to source points which are the 4 points extracted from the skewed image and destination points that should form a rectangle, for example the full image dimensions.

Here you could find code samples and more information: OpenCV-Geometric Transformations of Images

Also this answer may be useful - SO - Detect and fix text skew

EDIT: Corrected the second chain approx to cv2.CHAIN_APPROX_NONE.

answered Sep 22 '22 06:09

Elia

Related questions
                            
                                Enable executing multiple statements while execution via sqlalchemy
                            
                                Pylint W0223: Method ... is abstract in class ... but is not overridden
                            
                                Python3 rounding to nearest even
                            
                                How to partially copy using python an Hdf5 file into a new one keeping the same structure?
                            
                                Pass column name as parameter to PostgreSQL using psycopg2
                            
                                ImportError: cannot import name 'QStringList' in PyQt5
                            
                                How to filter/smooth with SciPy/Numpy?
                            
                                Unable to debug in PyCharm because of an ImportError in pydevconsole.py
                            
                                Fastest way to compare row and previous row in pandas dataframe with millions of rows
                            
                                Why "rv" in Flask testing tutorial? [closed]
                            
                                How to convert a single number into a single item list in python
                            
                                How / why does Python type hinting syntax work?
                            
                                Check database schema matches SQLAlchemy models on application startup
                            
                                Convert pandas freq string to timedelta
                            
                                Pandas type error trying to plot
                            
                                Pandas html: Don't truncate long values
                            
                                pyenv: pip: command not found
                            
                                How do I strip all leading and trailing punctuation in Python? [duplicate]
                            
                                how to save a scikit-learn pipline with keras regressor inside to disk?
                            
                                Efficient Python Pandas Stock Beta Calculation on Many Dataframes

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python + OpenCV: OCR Image Segmentation

Tags:

python

image-processing

opencv

computer-vision

donpresente

People also ask

2 Answers

R. S. Nikhil Krishna

Elia

Recent Activity

Donate For Us