Is there any particular reason why people pick 224x224 image size for imagenet experiments?

1 Answers

Well bigger images contain more information that could either be relevant or not. The size of your input is important because the bigger the input, the more parameters your network will have to handle. More parameters may lead to several problems, first you'll need more computing power. Then you may need more data to train on, since a lot of parameters and not enough samples may lead to overfitting, specially with CNNs. The choice for a 224 from AlexNet also allowed them to apply some data augmentation.

For instance, if you have a 512x512 image and you want to recognize an object there it would be better to resample it to 256x256 and get smaller patches of 224x224 or 200x200, do some data augmentation and then train. You could also use patches of 400x400 and also do data augmentation and train, provided that you have enough data.

Don't forget to do cross-validation so you can check if there's overfitting.

answered Oct 29 '22 03:10

Lucas Ramos

Related questions
                            
                                how to convert logits to probability in binary classification in tensorflow?
                            
                                Should I use loss or accuracy as the early stopping metric?
                            
                                TensorFlow: getting all states from a RNN
                            
                                How to find how many Image Generated By ImageDataGenerator
                            
                                Is data augmentation in Keras applied to the validation set when using ImageDataGenerator and flow_from_directory
                            
                                Same function in Keras Loss and Metric give different values even without regularization
                            
                                Jupyter Notebook (only) Memory Error, same code run in a conventional .py and works
                            
                                Mnist recognition using keras
                            
                                How to make keras in R use the tensorflow installed by Python
                            
                                Custom combined hinge/kb-divergence loss function in siamese-net fails to generate meaningful speaker-embeddings
                            
                                Can inception model be used for object counting in an image?
                            
                                Getting reproducible results using tensorflow-gpu
                            
                                Prediction is depending on the batch size in Keras
                            
                                Preprocessing function of inception v3 in Keras
                            
                                PyTorch multiprocessing error with Hogwild
                            
                                Possible explanations for loss increasing?
                            
                                Full gradient descent in keras
                            
                                Is the L1 regularization in Keras/Tensorflow *really* L1-regularization?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Is there any particular reason why people pick 224x224 image size for imagenet experiments?

Tags:

deep-learning

user10024395

People also ask

1 Answers

Lucas Ramos

Recent Activity

Donate For Us