Does anyone know the resolution of an image in the ImageNet dataset?
I'm sorry, but I couldn't find it on their website or in any of the papers.
Clocking in at 150 GB, ImageNet is quite a beast. It holds 1,281,167 images for training and 50,000 images for validation, organised in 1,000 categories.
ImageNet is a large dataset of annotated photographs intended for computer vision research.
ImageNet-21K dataset, which contains more pictures and classes, is used less frequently for pretraining, mainly due to its complexity, and underestimation of its added value compared to standard ImageNet-1K pretraining.
The ImageNet dataset contains 14,197,122 annotated images according to the WordNet hierarchy. Since 2010 the dataset is used in the ImageNet Large Scale Visual Recognition Challenge (ILSVRC), a benchmark in image classification and object detection.
The images vary in dimensions and resolution. Many applications resize / crop all of the images to 256x256 pixels.
The average image resolution on ImageNet is 469x387 pixels. Normally it's applied a pre-processing that samples them to 256x256 as @Prune said, but it depends on the task at hand.
In a brief check of a random ImageNet 2012 folder (Fish)... The largest image is 4288 x 2848 pixels. The smallest image is 75 x 56 pixels. This is representative of the aspect ratio range.
It depends of the picture but you can resize them. The smaller dimension of the picture should be at least 256 px and the aspect ratio should be intact.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With