What is the best way to detect the corners of an invoice/receipt/sheet-of-paper in a photo? This is to be used for subsequent perspective correction, before OCR. <h3>My current approach has been:</h3> RGB > Gray > Canny Edge Detection with thresholding > Dilate(1) > Remove small objects(6) > clear boarder objects > pick larges blog based on Convex Area. > [corner detection - Not implemented] I can't help but think there must be a more robust 'intelligent'/statistical approach to handle this type of segmentation. I don't have a lot of training examples, but I could probably get 100 images together. <h3>Broader context:</h3> I'm using matlab to prototype, and planning to implement the system in OpenCV and Tesserect-OCR. This is the first of a number of image processing problems I need to solve for this specific application. So I'm looking to roll my own solution and re-familiarize myself with image processing algorithms. Here are some sample image that I'd like the algorithm to handle: If you'd like to take up the challenge the large images are at http://madteckhead.com/tmp <img src="https://i.stack.imgur.com/z13MW.jpg" alt="case 1"> (source: madteckhead.com) <img src="https://i.stack.imgur.com/6GOPm.jpg" alt="case 2"> (source: madteckhead.com) <img src="https://i.stack.imgur.com/y6Nq8.jpg" alt="case 3"> (source: madteckhead.com) <img src="https://i.stack.imgur.com/Mzaud.jpg" alt="case 4"> (source: madteckhead.com) <h3>In the best case this gives:</h3> <img src="https://i.stack.imgur.com/817rw.jpg" alt="case 1 - canny"> (source: madteckhead.com) <img src="https://i.stack.imgur.com/NFior.jpg" alt="case 1 - post canny"> (source: madteckhead.com) <img src="https://i.stack.imgur.com/suaqb.jpg" alt="case 1 - largest blog"> (source: madteckhead.com) <h3>However it fails easily on other cases:</h3> <img src="https://i.stack.imgur.com/nKNwp.jpg" alt="case 2 - canny"> (source: madteckhead.com) <img src="https://i.stack.imgur.com/ZqznM.jpg" alt="case 2 - post canny"> (source: madteckhead.com) <img src="https://i.stack.imgur.com/X8hba.jpg" alt="case 2 - largest blog"> (source: madteckhead.com) Thanks in advance for all the great ideas! I love SO! <h3>EDIT: Hough Transform Progress</h3> Q: What algorithm would cluster the hough lines to find corners? Following advice from answers I was able to use the Hough Transform, pick lines, and filter them. My current approach is rather crude. I've made the assumption the invoice will always be less than 15deg out of alignment with the image. I end up with reasonable results for lines if this is the case (see below). But am not entirely sure of a suitable algorithm to cluster the lines (or vote) to extrapolate for the corners. The Hough lines are not continuous. And in the noisy images, there can be parallel lines so some form or distance from line origin metrics are required. Any ideas? <img src="https://web.archive.org/web/20130913020727/http://madteckhead.com/tmp/IMG_0773_hough.jpg" alt="case 1"><img src="https://i.stack.imgur.com/NYGGm.jpg" alt="case 2"> <img src="https://i.stack.imgur.com/toesV.jpg" alt="case 3"> <img src="https://i.stack.imgur.com/Cyq6f.jpg" alt="case 4"> (source: madteckhead.com)

I'm Martin's friend who was working on this earlier this year. This was my first ever coding project, and kinda ended in a bit of a rush, so the code needs some errr...decoding... I'll give a few tips from what I've seen you doing already, and then sort my code on my day off tomorrow. First tip, <code>OpenCV</code> and <code>python</code> are awesome, move to them as soon as possible. :D Instead of removing small objects and or noise, lower the canny restraints, so it accepts more edges, and then find the largest closed contour (in OpenCV use <code>findcontour()</code> with some simple parameters, I think I used <code>CV_RETR_LIST</code>). might still struggle when it's on a white piece of paper, but was definitely providing best results. For the <code>Houghline2()</code> Transform, try with the <code>CV_HOUGH_STANDARD</code> as opposed to the <code>CV_HOUGH_PROBABILISTIC</code>, it'll give rho and theta values, defining the line in polar coordinates, and then you can group the lines within a certain tolerance to those. My grouping worked as a look up table, for each line outputted from the hough transform it would give a rho and theta pair. If these values were within, say 5% of a pair of values in the table, they were discarded, if they were outside that 5%, a new entry was added to the table. You can then do analysis of parallel lines or distance between lines much more easily. Hope this helps.

Algorithm to detect corners of paper sheet in photo

My current approach has been:

RGB > Gray > Canny Edge Detection with thresholding > Dilate(1) > Remove small objects(6) > clear boarder objects > pick larges blog based on Convex Area. > [corner detection - Not implemented]

I can't help but think there must be a more robust 'intelligent'/statistical approach to handle this type of segmentation. I don't have a lot of training examples, but I could probably get 100 images together.

Broader context:

I'm using matlab to prototype, and planning to implement the system in OpenCV and Tesserect-OCR. This is the first of a number of image processing problems I need to solve for this specific application. So I'm looking to roll my own solution and re-familiarize myself with image processing algorithms.

Here are some sample image that I'd like the algorithm to handle: If you'd like to take up the challenge the large images are at http://madteckhead.com/tmp

case 1
_{(source: madteckhead.com)}

case 2
_{(source: madteckhead.com)}

case 3
_{(source: madteckhead.com)}

case 4
_{(source: madteckhead.com)}

In the best case this gives:

case 1 - canny
_{(source: madteckhead.com)}

case 1 - post canny
_{(source: madteckhead.com)}

case 1 - largest blog
_{(source: madteckhead.com)}

However it fails easily on other cases:

case 2 - canny
_{(source: madteckhead.com)}

case 2 - post canny
_{(source: madteckhead.com)}

case 2 - largest blog
_{(source: madteckhead.com)}

Thanks in advance for all the great ideas! I love SO!

EDIT: Hough Transform Progress

Q: What algorithm would cluster the hough lines to find corners? Following advice from answers I was able to use the Hough Transform, pick lines, and filter them. My current approach is rather crude. I've made the assumption the invoice will always be less than 15deg out of alignment with the image. I end up with reasonable results for lines if this is the case (see below). But am not entirely sure of a suitable algorithm to cluster the lines (or vote) to extrapolate for the corners. The Hough lines are not continuous. And in the noisy images, there can be parallel lines so some form or distance from line origin metrics are required. Any ideas?

case 1 case 2 case 3 case 4
_{(source: madteckhead.com)}

422

asked Jul 02 '11 07:07

Nathan Keller

1 Answers

I'm Martin's friend who was working on this earlier this year. This was my first ever coding project, and kinda ended in a bit of a rush, so the code needs some errr...decoding... I'll give a few tips from what I've seen you doing already, and then sort my code on my day off tomorrow.

First tip, OpenCV and python are awesome, move to them as soon as possible. :D

Instead of removing small objects and or noise, lower the canny restraints, so it accepts more edges, and then find the largest closed contour (in OpenCV use findcontour() with some simple parameters, I think I used CV_RETR_LIST). might still struggle when it's on a white piece of paper, but was definitely providing best results.

For the Houghline2() Transform, try with the CV_HOUGH_STANDARD as opposed to the CV_HOUGH_PROBABILISTIC, it'll give rho and theta values, defining the line in polar coordinates, and then you can group the lines within a certain tolerance to those.

My grouping worked as a look up table, for each line outputted from the hough transform it would give a rho and theta pair. If these values were within, say 5% of a pair of values in the table, they were discarded, if they were outside that 5%, a new entry was added to the table.

You can then do analysis of parallel lines or distance between lines much more easily.

Hope this helps.

111

answered Sep 18 '22 12:09

Daniel Crowley

Related questions
                            
                                Remove White Background from an Image and Make It Transparent
                            
                                Is it possible to tell the quality level of a JPEG?
                            
                                How to fill OpenCV image with one solid color?
                            
                                How do I find Wally with Python?
                            
                                Viola-Jones' face detection claims 180k features
                            
                                What are keypoints in image processing?
                            
                                Python - Find dominant/most common color in an image
                            
                                How does photoshop blend two images together? [closed]
                            
                                inverting image in Python with OpenCV
                            
                                How would I tint an image programmatically on iOS?
                            
                                Merging two images
                            
                                c# Image resizing to different size while preserving aspect ratio
                            
                                GD vs ImageMagick vs Gmagick for jpg? [closed]
                            
                                OpenCV & Python - Image too big to display
                            
                                How does one convert a grayscale image to RGB in OpenCV (Python)?
                            
                                Near-Duplicate Image Detection [closed]
                            
                                How do you composite an image onto another image with PIL in Python?
                            
                                Image fingerprint to compare similarity of many images
                            
                                How can I measure the similarity between two images? [closed]
                            
                                What is "semantic segmentation" compared to "segmentation" and "scene labeling"?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Algorithm to detect corners of paper sheet in photo

Tags:

image-processing

opencv

hough-transform

image-segmentation

edge-detection