Shape/Pattern Matching Approach in Computer Vision

Tags:

I am currently facing a, in my opinion, rather common problem which should be quite easy to solve but so far all my approached have failed so I am turning to you for help.

I think the problem is explained best with some illustrations. I have some Patterns like these two:

Pattern 1 Pattern 3

I also have an Image like (probably better, because the photo this one originated from was quite poorly lit) this:

(Note how the Template was scaled to kinda fit the size of the image)

The ultimate goal is a tool which determines whether the user shows a thumb up/thumbs down gesture and also some angles in between. So I want to match the patterns against the image and see which one resembles the picture the most (or to be more precise, the angle the hand is showing). I know the direction in which the thumb is showing in the pattern, so if i find the pattern which looks identical I also have the angle.

I am working with OpenCV (with Python Bindings) and already tried cvMatchTemplate and MatchShapes but so far its not really working reliably.

I can only guess why MatchTemplate failed but I think that a smaller pattern with a smaller white are fits fully into the white area of a picture thus creating the best matching factor although its obvious that they dont really look the same.

Are there some Methods hidden in OpenCV I havent found yet or is there a known algorithm for those kinds of problem I should reimplement?

Happy New Year.

555

asked Dec 27 '11 12:12

Nicolas

1 Answers

A few simple techniques could work:

After binarization and segmentation, find Feret's diameter of the blob (a.k.a. the farthest distance between points, or the major axis).
Find the convex hull of the point set, flood fill it, and treat it as a connected region. Subtract the original image with the thumb. The difference will be the area between the thumb and fist, and the position of that area relative to the center of mass should give you an indication of rotation.
Use a watershed algorithm on the distances of each point to the blob edge. This can help identify the connected thin region (the thumb).
Fit the largest circle (or largest inscribed polygon) within the blob. Dilate this circle or polygon until some fraction of its edge overlaps the background. Subtract this dilated figure from the original image; only the thumb will remain.
If the size of the hand is consistent (or relatively consistent), then you could also perform N morphological erode operations until the thumb disappears, then N dilate operations to grow the fist back to its original approximate size. Subtract this fist-only blob from the original blob to get the thumb blob. Then uses the thumb blob direction (Feret's diameter) and/or center of mass relative to the fist blob center of mass to determine direction.

Techniques to find critical points (regions of strong direction change) are trickier. At the simplest, you might also use corner detectors and then check the distance from one corner to another to identify the place when the inner edge of the thumb meets the fist.

For more complex methods, look into papers about shape decomposition by authors such as Kimia, Siddiqi, and Xiaofing Mi.

110

answered Oct 06 '22 04:10

Rethunk

Related questions
                            
                                Python conversion of PIL image to numpy array very slow
                            
                                OpenCV VideoCapture and error: (-215:Assertion failed) !_src.empty() in function 'cv::cvtColor'
                            
                                How can I integrate OpenCV 4.0 into a pure C++ Android NDK project?
                            
                                How to compute the Delta E between two images using OpenCV
                            
                                How to open an image from an url with opencv using requests from python
                            
                                How to mask image with binary mask
                            
                                How to convert a rgb image into a cmyk?
                            
                                How to send a cv::Mat to python over shared memory?
                            
                                Generate video from numpy arrays with openCV
                            
                                Detecting angle difference between two circular objects
                            
                                OpenCV cvLoadImage() does not load images in visual studio debugger?
                            
                                Find rectangles without corners using opencv
                            
                                Rotation matrix openCV
                            
                                "Segmentation fault" during "import cv" on Mac OS
                            
                                Calculate Intrinsics for a Thermal Camera?
                            
                                openCV vs GIMP, edge detection fails in openCV
                            
                                OpenCV/Android compilation errors
                            
                                Correct YUV422 to RGB conversion
                            
                                Recommendations for real-time pixel-level analysis of television (TV) video
                            
                                How to know if matchTemplate found an object or not?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Shape/Pattern Matching Approach in Computer Vision

Tags:

pattern-matching

design-patterns

opencv

computer-vision

vision

Nicolas

People also ask

1 Answers

Rethunk

Recent Activity

Donate For Us