I'm working in a project. A part of project consist to integrate the HOG people detector of OpenCV with a camera streaming . Currently It's working the camera and the basic HOG detector (CPP detectMultiScale -> http://docs.opencv.org/modules/gpu/doc/object_detection.html). But don't work very well... The detections are very noising and the algorithm isn't very accuracy... Why? My camera image is 640 x 480 pixels. The snippet code I'm using is: <pre class="prettyprint"><code>std::vector<cv::Rect> found, found_filtered; cv::HOGDescriptor hog; hog.setSVMDetector(cv::HOGDescriptor::getDefaultPeopleDetector()); hog.detectMultiScale(image, found, 0, cv::Size(8,8), cv::Size(32,32), 1.05, 2); </code></pre> Why don't work properly? What need for improve the accuracy? Is necessary some image size particular? PS: Do you know some precise people detection algorithm, faster and developed in cpp ??

The size of the default people detector is 64x128, that mean that the people you would want to detect have to be atleast 64x128. For your camera resolution that would mean that a person would have to take up quite some space before getting properly detected. Depending on your specific situation, you could try your hand at training your own HOG Descriptor, with a smaller size. You could take a look at this answer and the referenced library if you want to train your own HOG Descriptor. For the Parameters: win_stride: Given your input image has a size of 640 x 480, and the defaultpeopleDetector has a window size of 64x128, you can fit the HOG Detection window ( the 64x128 window) multiple times in the input image. The winstride tells HOG to move the detection window a certain amount each time. How does this work: Hog places the detection window on the top left of your input image. and moves the detection window each time by the win_stride. Like this (small win_stride): <img src="https://i.stack.imgur.com/Bvs2W.png" alt="enter image description here"> or like this (large win_stride) <img src="https://i.stack.imgur.com/HAhXR.png" alt="enter image description here"> A smaller winstride should improve accuracy, but decreases preformance, and the other way around padding Padding adds a certain amount of extra pixels on each side of the input image. That way the detection window is placed a bit outside the input image. It's because of that padding that HOG can detect people who are very close to the edge of the input image. group_threshold The group_treshold determines a value by when detected parts should be placed in a group. Low value provides no result grouping, a higher value provides result grouping if the amount of treshold has been found inside the detection windows. (in my own experience, I have never needed to change the default value) I hope this makes a bit of sense for you. I've been working with HOG for the past few weeks, and read alot of papers, but I lost some of the references, so I can't link you the pages where this info comes from, I'm sorry.

Improving accuracy OpenCV HOG people detector

Tags:

c++

opencv

detection

I'm working in a project. A part of project consist to integrate the HOG people detector of OpenCV with a camera streaming .

Currently It's working the camera and the basic HOG detector (CPP detectMultiScale -> http://docs.opencv.org/modules/gpu/doc/object_detection.html). But don't work very well... The detections are very noising and the algorithm isn't very accuracy...

Why?

My camera image is 640 x 480 pixels.

The snippet code I'm using is:

std::vector<cv::Rect> found, found_filtered;
cv::HOGDescriptor hog;
hog.setSVMDetector(cv::HOGDescriptor::getDefaultPeopleDetector());
hog.detectMultiScale(image, found, 0, cv::Size(8,8), cv::Size(32,32), 1.05, 2);

Why don't work properly? What need for improve the accuracy? Is necessary some image size particular?

PS: Do you know some precise people detection algorithm, faster and developed in cpp ??

835

asked Oct 28 '14 11:10

Ricardo

1 Answers

The size of the default people detector is 64x128, that mean that the people you would want to detect have to be atleast 64x128. For your camera resolution that would mean that a person would have to take up quite some space before getting properly detected.

Depending on your specific situation, you could try your hand at training your own HOG Descriptor, with a smaller size. You could take a look at this answer and the referenced library if you want to train your own HOG Descriptor.

For the Parameters:

win_stride: Given your input image has a size of 640 x 480, and the defaultpeopleDetector has a window size of 64x128, you can fit the HOG Detection window ( the 64x128 window) multiple times in the input image. The winstride tells HOG to move the detection window a certain amount each time. How does this work: Hog places the detection window on the top left of your input image. and moves the detection window each time by the win_stride.

Like this (small win_stride): enter image description here

or like this (large win_stride) enter image description here

A smaller winstride should improve accuracy, but decreases preformance, and the other way around

padding Padding adds a certain amount of extra pixels on each side of the input image. That way the detection window is placed a bit outside the input image. It's because of that padding that HOG can detect people who are very close to the edge of the input image.

group_threshold The group_treshold determines a value by when detected parts should be placed in a group. Low value provides no result grouping, a higher value provides result grouping if the amount of treshold has been found inside the detection windows. (in my own experience, I have never needed to change the default value)

I hope this makes a bit of sense for you. I've been working with HOG for the past few weeks, and read alot of papers, but I lost some of the references, so I can't link you the pages where this info comes from, I'm sorry.

117

answered Oct 17 '22 06:10

Timmynator0

Related questions
                            
                                how to determine base of a number?
                            
                                Return reference from class to this
                            
                                What Does an OS Actually Do?
                            
                                How to Run Only One Instance of Application
                            
                                Detecting if casting an int to an enum results into a non-enumerated value
                            
                                Static variable in a Header File
                            
                                Is the code in "The C++ Programming Language Third Edition" on page 854 correct?
                            
                                C++: How to pass a generic function name?
                            
                                libarchive - Extract to specified directory
                            
                                Why is C++ numeric_limits<enum_type>::max() == 0?
                            
                                rgb to yuv420 algorithm efficiency
                            
                                `dynamic_cast` from Base to Derived
                            
                                How to compile OpenCV with libjpeg-turbo?
                            
                                Incrementing Pointers
                            
                                Create a default constructor in C++
                            
                                Returning a pointer of a local variable C++
                            
                                Make Qt application not to quit when last window is closed
                            
                                DIfference in structs?
                            
                                How does a 32 bit processor support 64 bit integers?
                            
                                Embedding Python3 in Qt 5

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With