anchor box or bounding boxes in Yolo or Faster RCNN

Tags:

I don't know the difference between anchor box and bounding boxes, or proposal area. I am confused with these definitions. And I don't know the meaning of these boxes in the detection model, since the default length never changes! And finally, I confuse with the fact that RCNN series and Yolo series both output the prediction boxes location (x,y,w,h). Or output the delta position (ground truth_x - predicted_x)/prediction_w?

914

asked May 21 '18 14:05

Luv

2 Answers

Anchor Boxes: predefined landmark rectangles for bounding boxes to pick and use offsets to give location for a detected object

Bounding Box: predicted rectangle for a detected object relative to an anchor box

Basically the idea is comparable to landmarks used in object detection models like in Snapchat's camera. A set of nodes are pre-decided for the network on specific regions of the image based on how selfie portraits are characterised, the network learns how to offset the nodes relative to different faces fed into the network before a filter or mask is applied for some visual m*sturbation to really excite the user

143

answered Oct 18 '22 10:10

LiNKeR

Bounding Boxes Bounding boxes are boxes that are predicted by the network. These predicted boxes are overwritten on the input image so that you can visually understand what the position ans shape of rectangle are detected by the prediction. That is, they are rectangles you can see in this youtube video.

Anchor Boxes We can put some assumption on the shapes of bounding boxes. For example, if we want to detect humans, we should search humans with some vertical rectangular boxes. They are anchor boxes. The anchor boxes are fed to the network, before training and prediction, as a list of some numbers, which is a series of pairs of width and height:

anchors = [1.08, 1.19, 3.42, 4.41, 6.63, 11.38, 9.42, 5.11, 16.62, 10.52]

This list above defines 5 anchor boxes. We can feed arbitrary number of anchor boxes to the network.

These values are determined from the training data with some statistical procedure.

answered Oct 18 '22 10:10

spl

Related questions
                            
                                Why CIFAR-10 images are not displayed properly using matplotlib?
                            
                                How to get a score for cv2.CascadeClassifier.detectMultiScale()?
                            
                                Matching a curve pattern to the edges of an image
                            
                                Scene change/shot detection/image extraction using ffmpeg from video
                            
                                Detect all branches in a plant picture
                            
                                Find the best Region of Interest after edge detection in OpenCV
                            
                                3D reconstruction from two calibrated cameras - where is the error in this pipeline?
                            
                                solvePNP vs recoverPose by rotation composition: why translations are not same?
                            
                                How can I access my laptop's built-in infrared webcam using python?
                            
                                number of parameters in Caffe LENET or Imagenet models
                            
                                What's the best way of understanding opencv's warpperspective and warpaffine?
                            
                                Finding pits in an image
                            
                                Implementing log Gabor filter bank
                            
                                3D image rotation in python
                            
                                Binary Image Orientation
                            
                                Weird result from the Kuwahara filter
                            
                                How to configure Probabilistic Occupancy Map people detector
                            
                                Continued - Vehicle License Plate Detection
                            
                                How to detect Hotspots in an image
                            
                                Using get() and put() to access pixel values in OpenCV for Java

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

anchor box or bounding boxes in Yolo or Faster RCNN

Tags:

computer-vision

object-detection

yolo

Luv

People also ask

2 Answers

LiNKeR

spl

Recent Activity

Donate For Us