metrics for feature detection/extraction methods

Tags:

I wonder how do we evaluate feature detection/extraction methods (SIFT,SURF,MSER...) for object detection and tracking like pedestrians, lane vehicles etc.. Are there standard metrics for comparison? I have read blogs like http://computer-vision-talks.com/2011/07/comparison-of-the-opencvs-feature-detection-algorithms-ii/ some research papers like this. The problem is the more I learn the more I am confused.

767

asked Jan 15 '14 15:01

sy456

1 Answers

It is very hard to estimate feature detectors per se, because features are only computation artifacts and not things that you are actually searching in images. Feature detectors do not make sense outside their intended context, which is affine-invariant image part matching for the descriptors that you have mentioned.

The very first usage of SIFT, SURF, MSER was multi-view reconstruction and automatic 3D reconstruction pipe-lines. Thus, these features are usually assessed from the quality of the 3D reconstrucution or image part matching that they provide. Roughly speaking, you have a pair of images that are related by a known transform (an affinity or an homography) and you measure the difference between the estimated homography (from the feature detector) and the real one. This is also the method used in the blog post that you quote by the way.

In order to assess the practical interest of a detector (and not only its precision in an ideal multi-view pipe-line) some additional measurements of stability (under geometric and photometric changes) were added: does the number of detected features vary, does the quality of the estimated homography vary, etc.

Accidentally, it happens that these detectors may also work (also it was not their design purpose) for object detection and track (in tracking-by-detection cases). In this case, their performance is classically evaluated from more-or-less standardized image datasets, and typically expressed in terms of precision (probability of good answer, linked to the false alarm rate) and recall (probability of finding an object when it is present). You can read for example Wikipedia on this topic.

Addendum: What exactly do I mean by accidentally?

Well, as written above, SIFT and the like were designed to match planar and textured image parts. This is why you always see example with similar images from a dataset of graffiti.

Their extension to detection and tracking was then developed in two different ways:

While doing multiview matching (with a spherical rig), Furukawa and Ponce built some kind of 3D locally-planar object model, that they applied then to object detection in presence of severe occlusions. This worlk exploits the fact that an interesting object is often locally planar and textured;
Other people developed a less original (but still efficient in good conditions) approach by considering that they had a query image of the object to track. Individual frame detections are then performed by matching (using SIFT, etc.) the template image with the current frame. This exploits the fact that there are few false matchings with SIFT, that objects are usually observed in a distance (hence are usually almost planar in images) and that they are textured. See for example this paper.

197

answered Nov 02 '22 12:11

sansuiso

Related questions
                            
                                OpenCV doesn't save the video
                            
                                How to make PerspectiveTransform work?
                            
                                Explanation of cascade.xml in a haar classifier
                            
                                Noisy hue in OpenCV
                            
                                How can I perform Template Matching process in SUB-IMAGE extracted from ORIGINAL-IMAGE and Display the results in Original Image
                            
                                OpenCV on Android converting to grayscale not working
                            
                                OpenCV as JBoss-as global module
                            
                                Channel order in OpenCV
                            
                                Import trained SVM from scikit-learn to OpenCV
                            
                                Image size consideration for Haar cascades
                            
                                opencv ios transparent image loaded without transparency
                            
                                Normalize pixel values between 0 and 1
                            
                                Can't get OpenCV's warpPerspective to work on Android
                            
                                cv2.cornersSubPix gives only None
                            
                                Comparing two OpenCV images/2D Numpy arrays
                            
                                convert vector of points to Mat (OpenCV )
                            
                                Compiling mexopencv in OS X 10.9 with Xcode 5 and Matlab R2013b
                            
                                Convert RGB array to Mat (OpenCv)
                            
                                Include path for adding an external library in Qt Creator?
                            
                                OpenCV RGB single channel color regulation

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

metrics for feature detection/extraction methods

Tags:

opencv

computer-vision

feature-detection

feature-extraction

sy456

People also ask

1 Answers

sansuiso

Recent Activity

Donate For Us