Histogram of Oriented Gradients object detection [closed]

1 Answers

Yes, HOG (Histogram of Oriented Gradients) can be used to detect any kind of objects, as to a computer, an image is a bunch of pixels and you may extract features regardless of their contents. Another question, though, is its effectiveness in doing so.

HOG, SIFT, and other such feature extractors are methods used to extract relevant information from an image to describe it in a more meaningful way. When you want to detect an object or person in an image with thousands (and maybe millions) of pixels, it is inefficient to simply feed a vector with millions of numbers to a machine learning algorithm as

It will take a large amount of time to complete
There will be a lot of noisy information (background, blur, lightning and rotation changes) which we do not wish to regard as important

The HOG algorithm, specifically, creates histograms of edge orientations from certain patches in images. A patch may come from an object, a person, meaningless background, or anything else, and is merely a way to describe an area using edge information. As mentioned previously, this information can then be used to feed a machine learning algorithm such as the classical support vector machines to train a classifier able to distinguish one type of object from another.

The reason HOG has had so much success with pedestrian detection is because a person can greatly vary in color, clothing, and other factors, but the general edges of a pedestrian remain relatively constant, especially around the leg area. This does not mean that it cannot be used to detect other types of objects, but its success can vary depending on your particular application. The HOG paper shows in detail how these descriptors can be used for classification.

It is worthwhile to note that for several applications, the results obtained by HOG can be greatly improved using a pyramidal scheme. This works as follows: Instead of extracting a single HOG vector from an image, you can successively divide the image (or patch) into several sub-images, extracting from each of these smaller divisions an individual HOG vector. The process can then be repeated. In the end, you can obtain a final descriptor by concatenating all of the HOG vectors into a single vector, as shown in the following image.

Pyramidal HOG

This has the advantage that in larger scales the HOG features provide more global information, while in smaller scales (that is, in smaller subdivisions) they provide more fine-grained detail. The disadvantage is that the final descriptor vector grows larger, thus taking more time to extract and to train using a given classifier.

In short: Yes, you can use them.

122

answered Oct 06 '22 08:10

Pablo Lluch

Related questions
                            
                                Q matrix for the reprojectImageTo3D function in opencv
                            
                                'CV_LOAD_IMAGE_GRAYSCALE' is not defined{PY}
                            
                                How to make a simple window with one button using OpenCV HighGui only?
                            
                                Extracting the dimensions of a rectangle
                            
                                How to convert GpuMat to CvMat in OpenCV?
                            
                                python-opencv AttributeError: 'module' object has no attribute 'createBackgroundSubtractorGMG'
                            
                                Black color object detection HSV range in opencv
                            
                                how to get opencv_contrib module in anaconda
                            
                                To calculate world coordinates from screen coordinates with OpenCV
                            
                                Accessing elements of OpenCV CV_8UC1 cv::Mat
                            
                                Convert HSV to grayscale in OpenCV
                            
                                Python3.4 error - Cannot enable executable stack as shared object requires: Invalid argument
                            
                                Dilate/erode modify kernel option
                            
                                How to visualize descriptor matching using opencv module in python
                            
                                Display sequence of images using matplotlib
                            
                                How to do motion tracking of an object using video? [closed]
                            
                                convert images from [-1; 1] to [0; 255]
                            
                                OpenCV Binary Image Mask for Image Analysis in C++
                            
                                Set white background for a png instead of transparency with OpenCV
                            
                                How to use OpenCV with IntelliJ IDEA 12

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Histogram of Oriented Gradients object detection [closed]

Tags:

image-processing

opencv

rish

People also ask

1 Answers

Pablo Lluch

Recent Activity

Donate For Us