Do image have to have the same size for deep learning?

1 Answers

Normally for deep learning this does not have to be the case. Convolutional Neural Networks do not depend on the image size and filters can be applied on all image sizes.

Still many frameworks and literally all papers use the same image sizes for training. In https://arxiv.org/pdf/1409.1556/ they used different sizes for evaluating the network. To achieve this you can use either resizing or crops or a combination of the both. Keep in mind that changing the aspect ratio is almost always a bad idea.

To choose a good image size it is important to note that a bigger image sizes will give you better accuracy normally. However all the filter take longer and the memory requirements rise with the image size. Additionally larger sizes yield diminishing improvements. I normally use 224x224, because it is often divisible through 2 and imagenet uses it too.

Finally the image size does not have to be square, but it is most of the time a good idea, because CNNs often cut the image size in half and often end up at something like 4x4 or 6x6. Doing this with a non square starting size will give you an akward ending size like 4x2 or 6x3.

182

answered Oct 01 '22 23:10

Thomas Pinetz

Related questions
                            
                                Why val_loss is different from training loss when use the same training data as validation data?
                            
                                How to train Tensorflow Object Detection images that do not contain objects?
                            
                                Normalization of input data in Keras
                            
                                Understanding when to call zero_grad() in pytorch, when training with multiple losses
                            
                                How to exactly add L1 regularisation to tensorflow error function
                            
                                Why is it possible to have low loss, but also very low accuracy, in a convolutional neural network?
                            
                                What's the difference between optimizer.compute_gradient() and tf.gradients() in tensorflow?
                            
                                Significance of auxiliary output in Multi-input and multi-output model using deep network
                            
                                Are there any computational efficiency differences between nn.functional() Vs nn.sequential() in PyTorch
                            
                                tf.control_dependencies(tf.get_collection(tf.GraphKeys.UPDATE_OPS)) in tensorflow
                            
                                Using Gekko's brain module, how do I determine how many layers and what type of layer to use to solve a deep learning problem?
                            
                                Are modern CNN (convolutional neural network) as DetectNet rotate invariant?
                            
                                How to deal with multi step time series forecasting in multivariate LSTM in keras
                            
                                TensorFlow - Read video frames from TFRecords file
                            
                                is it possible to implement dynamic class weights in keras?
                            
                                How to add parameters in module class in pytorch custom model?
                            
                                Compiling Caffe C++ Classification Example
                            
                                Generating LMDB for Caffe
                            
                                How to read images with different size in a TFRecord file
                            
                                Observations meaning - OpenAI Gym

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Do image have to have the same size for deep learning?

Tags:

deep-learning

Xianyu Wang

People also ask

1 Answers

Thomas Pinetz

Recent Activity

Donate For Us