Understanding darknet's yolo.cfg config files

Tags:

I have searched around the internet but found very little information around this, I don't understand what each variable/value represents in yolo's .cfg files. So I was hoping some of you could help, I don't think I'm the only one having this problem, so if anyone knows 2 or 3 variables please post them so that people who needs such info in the future might find them.

The main one that I'd like to know are :

batch
subdivisions
decay
momentum
channels
filters
activation

765

asked May 17 '18 11:05

Reda Drissi

1 Answers

Here is my current understanding of some of the variables. Not necessarily correct though:

[net]

batch: That many images+labels are used in the forward pass to compute a gradient and update the weights via backpropagation.
subdivisions: The batch is subdivided in this many "blocks". The images of a block are ran in parallel on the gpu.
decay: Maybe a term to diminish the weights to avoid having large values. For stability reasons I guess.
channels: Better explained in this image :

On the left we have a single channel with 4x4 pixels, The reorganization layer reduces the size to half then creates 4 channels with adjacent pixels in different channels.

momentum: I guess the new gradient is computed by momentum * previous_gradient + (1-momentum) * gradient_of_current_batch. Makes the gradient more stable.
adam: Uses the adam optimizer? Doesn't work for me though
burn_in: For the first x batches, slowly increase the learning rate until its final value (your learning_rate parameter value). Use this to decide on a learning rate by monitoring until what value the loss decreases (before it starts to diverge).
policy=steps: Use the steps and scales parameters below to adjust the learning rate during training
steps=500,1000: Adjust the learning rate after 500 and 1000 batches
scales=0.1,0.2: After 500, multiply the LR by 0.1, then after 1000 multiply again by 0.2
angle: augment image by rotation up to this angle (in degree)

layers

filters: How many convolutional kernels there are in a layer.
activation: Activation function, relu, leaky relu, etc. See src/activations.h
stopbackward: Do backpropagation until this layer only. Put it in the panultimate convolution layer before the first yolo layer to train only the layers behind that, e.g. when using pretrained weights.
random: Put in the yolo layers. If set to 1 do data augmentation by resizing the images to different sizes every few batches. Use to generalize over object sizes.

Many things are more or less self-explanatory (size, stride, batch_normalize, max_batches, width, height). If you have more questions, feel free to comment.

Again, please keep in mind that I am not 100% certain about many of those.

answered Oct 16 '22 09:10

FelEnd

Related questions
                            
                                How to deal with "DNN module was not built with CUDA backend; switching to CPU" warning in C++?
                            
                                One stage vs two stage object detection
                            
                                How to convert Yolo format bounding box coordinates into OpenCV format
                            
                                How Yolo calculate P(Object) in the YOLO 9000
                            
                                Yolo v1 bounding boxes during training step
                            
                                How to convert bounding box (x1, y1, x2, y2) to YOLO Style (X, Y, W, H)
                            
                                Tracing back deprecated warning in pytorch
                            
                                KeyError: ''val_loss" when training model
                            
                                How can I download a specific part of Coco Dataset?
                            
                                anchor box or bounding boxes in Yolo or Faster RCNN
                            
                                Unsupported gpu architecture compute_30 on a CUDA 5 capable gpu
                            
                                How to reduce number of classes in YOLOv3 files?
                            
                                OpenCV 4.x+ requires enabled C++11 support compilation darknet fatal error
                            
                                How to convert Keras .h5 model to darknet yolo.weights format?
                            
                                Anchor Boxes in YOLO : How are they decided
                            
                                How many images(minimum) should be there in each classes for training YOLO?
                            
                                Extracting the license plate parallelogram from the surrounding bounding box?
                            
                                Using YOLO or other image recognition techniques to identify all alphanumeric text present in images
                            
                                YOLO object detection: how does the algorithm predict bounding boxes larger than a grid cell?
                            
                                Training a Keras model yields multiple optimizer errors

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Understanding darknet's yolo.cfg config files

Tags:

yolo

darknet