Cropping/Scaling ImageNet Images

Tags:

ImageNet images are all different sizes, but neural networks need a fixed size input.

One solution is to take a crop size that is as large as will fit in the image, centered around the center point of the image. This works but has some drawbacks. Often times important parts of the object of interest in the image are cut out, and there are even cases where the correct object is completely missing while another object that belongs to a different class is visible, meaning your model will be trained wrong for that image.

Another solution would be to use the entire image and zero pad it to where each image has the same dimensions. This seems like it would interfere with the training process though, and the model would learn to look for vertical/horizontal patches of black near the edge of images.

What is commonly done?

408

asked May 03 '16 23:05

Frobot

1 Answers

There are several approaches:

Multiple crops. For example AlexNet was originally trained on 5 different crops: center, top-left, top-right, bottom-left, bottom-right.
Random crops. Just take a number of random crops from the image and hope that the Neural Network will not be biased.
Resize and deform. Resize the image to a fixed size without considering the aspect ratio. This witll deform the image contents but preserves but now you are sure that no content is cut.
Variable-sized Inputs. Do not crop and train the network on variable sized images, using something like Spatial Pyramid Pooling to extract a fixed size feature vector that can be used with fully connected layers.

You could take a look how the latest ImageNet networks are trained, like VGG and ResNet. They usually describe this step in detail.

158

answered Oct 15 '22 21:10

Dr. Snoopy

Related questions
                            
                                Keras MSE definition
                            
                                Why "softmax_cross_entropy_with_logits_v2" backprops into labels
                            
                                Neural Networks - Difference between deep autoencoder and stacked autoencoder [closed]
                            
                                Lack of Sparse Solution with L1 Regularization in Pytorch
                            
                                Is there a logit function in tensorflow?
                            
                                Are there alternatives to backpropagation?
                            
                                Can autograd in pytorch handle a repeated use of a layer within the same module?
                            
                                indices[201] = [0,8] is out of order. Many sparse ops require sorted indices.Use `tf.sparse.reorder` to create a correctly ordered copy
                            
                                Pause and resume caret training in R
                            
                                TensorFlow 2 Mask-RCNN? [closed]
                            
                                ModuleNotFoundError: No module named 'tensorflow.python.keras.engine.base_layer_v1
                            
                                XOR problem solvable with 2x2x1 neural network without bias?
                            
                                Neural Network 0 vs -1
                            
                                Training neural network for XOR in Ruby
                            
                                ANN multiple vs single outputs
                            
                                Tensorflow tf.train.Saver saves suspiciously large .ckpt files?
                            
                                Caffe does not make snapshots on SIGINT
                            
                                OpenCL Theano - How to forcefully disable CUDA?
                            
                                Implementing Adversarial Training in TensorFlow
                            
                                Using SparseTensor as a trainable variable?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Cropping/Scaling ImageNet Images

Tags:

neural-network

imagenet

crop

Frobot

People also ask

1 Answers

Dr. Snoopy

Recent Activity

Donate For Us