In OpenCV's haar cascade files, what are the "left" and "right" values, and how does this refer to the "threshold" value? Thanks! Just for reference, here's the structure of the files: <pre class="prettyprint"><code><haarcascade_frontalface_alt type_id="opencv-haar-classifier"> <size>20 20</size> <stages> <_>  <trees> <_>  <_>  <feature> <rects> <_>3 7 14 4 -1.</_> <_>3 9 14 2 2.</_></rects> <tilted>0</tilted></feature> <threshold>4.0141958743333817e-003</threshold> <left_val>0.0337941907346249</left_val> <right_val>0.8378106951713562</right_val></_></_> <_> </code></pre>

The "left" and "right" refer to the gradient values of a particular shape. These particular shapes are not specifically a left rectangle and a right rectangle. Instead, it just refers to sections of a particular configuration (sometimes more than one section if there are more than 2). There is a diagram in the David Haar paper which helps explain this. Here is an ascii representation (= is filled, - unfilled): <pre class="prettyprint"><code>==== ==-- =--= ==== ==-- =--= ---- ==-- =--= ---- ==-- =--= </code></pre> Overall, the naming is bad convention. Instead, it should be named "gradient top", "gradient bottom" (2), "gradient left", "gradient right" (2), "gradient left", "gradient center", "gradient bottom" (3), respectively. Rotated, edge, and other shapes should be named to uniquely identify the sections.

What do the "left" and "right" values mean in the haar cascade xml files?

Tags:

opencv

In OpenCV's haar cascade files, what are the "left" and "right" values, and how does this refer to the "threshold" value? Thanks!

Just for reference, here's the structure of the files:

<haarcascade_frontalface_alt type_id="opencv-haar-classifier">
  <size>20 20</size>
  <stages>
    <_>
      <!-- stage 0 -->
      <trees>
        <_>
          <!-- tree 0 -->
          <_>
            <!-- root node -->
            <feature>
              <rects>
                <_>3 7 14 4 -1.</_>
                <_>3 9 14 2 2.</_></rects>
              <tilted>0</tilted></feature>
            <threshold>4.0141958743333817e-003</threshold>
            <left_val>0.0337941907346249</left_val>
            <right_val>0.8378106951713562</right_val></_></_>
        <_>

510

asked Jun 11 '09 00:06

user117046

2 Answers

The "left" and "right" refer to the gradient values of a particular shape. These particular shapes are not specifically a left rectangle and a right rectangle. Instead, it just refers to sections of a particular configuration (sometimes more than one section if there are more than 2). There is a diagram in the David Haar paper which helps explain this.

Here is an ascii representation (= is filled, - unfilled):

====    ==--   =--=
====    ==--   =--=
----    ==--   =--=
----    ==--   =--=

Overall, the naming is bad convention. Instead, it should be named "gradient top", "gradient bottom" (2), "gradient left", "gradient right" (2), "gradient left", "gradient center", "gradient bottom" (3), respectively. Rotated, edge, and other shapes should be named to uniquely identify the sections.

115

answered Oct 13 '22 23:10

user117046

In the source code of OpenCV, you will find cvhaar.cpp that gives some insight into how Haar cascade works. Unfortunately, this is essentially no commentary, nor does the documentation help much. Here's my understanding of how it works.

In the function icvEvalHidHaarClassifier(), the sum is computed for the the features of a single CvHidHaarTreeNode.

If this sum is less than the threshold, the "left" node is followed, and the process is repeated. Otherwise, the "right" node is followed, again repeating. This is reflected by the following statement:

idx = sum < t ? node->left : node->right;

The loop is broken when the "left" or "right" node is a negative value. In this case, the sum is no longer computed for this feature, but the threshold value for that feature is returned as the result of the classifier.

I put "left" and "right" in quotes because, as you say, they have nothing to do with the feature position. Instead, they reflect which way the cascade "falls": below the threshold, the cascade falls left, above the threshold, it falls right.

Let us now step back to the representation of these nodes. In the XML, you will see the representation of the nodes not as indexes, but as values:

<left_val>0.0337941907346249</left_val>
<right_val>0.8378106951713562</right_val>

These numbers are in fact node names that are looked up using cvGetFileNodeByName(). I don't know exactly how this works inside OpenCV, but now I hope you at least have a better idea how the cascade works.

answered Oct 14 '22 00:10

Paul Lammertsma

Related questions
                            
                                Reusing models from grabcut in OpenCV
                            
                                Save float array to image (with EXR format)
                            
                                How to solve 'does not name a type' during opencv compiling using mingw32-make?
                            
                                OpenCV python canny Required argument 'threshold2' (pos 4) not found
                            
                                python , opencv, image array to binary
                            
                                When i try to build apk(s) on android studio 3 it gives me error
                            
                                comparing HOG feature vectors without SVM
                            
                                Opencv Python open dng format
                            
                                Why is random video seeks with OpenCV slow?
                            
                                How to extract a specific section of an image using OpenCV in Python?
                            
                                How to find shortest path in skeletonized maze image?
                            
                                Undefined reference to `cv::String::deallocate()' error in OpenCV 3.4.3 [duplicate]
                            
                                Using OpenCV Hough Tranform for line detection in 2D point cloud
                            
                                get coordinates of 4 corners of display screen on image
                            
                                How to detect Sudoku grid board in OpenCV
                            
                                Python opencv cv2.VideoCapture.read() getting stuck indefinitely after running the first time
                            
                                kivy camera application with opencv in android shows black screen
                            
                                How to I compute matching features between high resolution images?
                            
                                Image blur detection for iOS in Objective C
                            
                                How to obtain smooth histogram after scaling image?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What do the "left" and "right" values mean in the haar cascade xml files?

Tags:

opencv

user117046

People also ask

2 Answers

user117046

Paul Lammertsma

Recent Activity

Donate For Us