I have two questions about how to load Imagenet datas. I downloaded ILSVRC2012 validation sets(Cause training sets are too large) but I have two problems. <ol> <li>I can't understand how can I find out the labels. There are only jpeg files with file names like "<code>ILSVRC2012_val_00000001.JPEG</code>" but there's no labels. How can I find them?</li> <li>As far as I know, Imagenet uses 224 * 224 pixel image and the problem is just "classification" not "detection", but ILSVRC2012 sets have much more and different pixel sizes. So, how can I get proper boxes for 224 * 224 pixels?</li> </ol>

It's in the Development kit (Task 1 & 2) The filename called "ILSVRC2012_validation_ground_truth.txt"

How can I find Imagenet data labels?

2 Answers

You will download three tar archives: one for training data, one for validation data, and one for test data.

Training data is contained in 1000 folders, one folder per class (each folder should contain 1,300 JPEG images). Validation data is a single folder with 50k JPEG images, look for the corresponding ILSVRC2012_validation_ground_truth.txt file in (as darren1231 mentioned, it needs to be downloaded separately as part of DevKit).

Test data is similar to validation data, but it does not have labels (labels are not provided to you because you need to submit your predicted labels to them, as part of the competition).

ImageNet images have variable resolution, 482x415 on average, and it's up to you how you want to process them to train your model. Most people process it as following: First downsize each image so that its shorter side is 256 pixels. Then crop a random 224x224 patch. Use those patches for training (you will get different crops each epoch). During test, do the same, but extract a center 224x224 patch, and use that for evaluating classification accuracy. Some people also use multiple patches for testing. Again, it's up to you, and you can use higher resolution if you like.

answered Sep 30 '22 16:09

MichaelSB

It's in the Development kit (Task 1 & 2) The filename called "ILSVRC2012_validation_ground_truth.txt"

answered Sep 30 '22 16:09

darren1231

Related questions
                            
                                "Solving Environment" during `conda install -c <my_channel> tensorflow` takes 3+ min but changing the name a bit reduces the time significantly
                            
                                Tensorflow warning: The graph couldn't be sorted in topological order?
                            
                                Loading Images in a Directory As Tensorflow Data set
                            
                                How can I use 100% of VRAM on a secondary GPU from a single process on windows 10?
                            
                                How to control when to compute evaluation vs training using the Estimator API of tensorflow?
                            
                                What has to be inside tf.distribute.Strategy.scope()?
                            
                                Differences between CV2 image processing and tf.image processing
                            
                                Creating many feature columns in Tensorflow
                            
                                Error when building seq2seq model with tensorflow
                            
                                Is there a function to extract image patches in PyTorch?
                            
                                How to visualize a tensor summary in tensorboard
                            
                                How do I fix a dimension error in TensorFlow?
                            
                                TensorFlow freeze_graph.py: The name 'save/Const:0' refers to a Tensor which does not exist
                            
                                Why should preprocessing be done on CPU rather than GPU?
                            
                                Understanding LSTM model using tensorflow for sentiment analysis
                            
                                How to handle large amouts of data in tensorflow?
                            
                                Correct way of doing data augmentation in TensorFlow with the dataset api?
                            
                                How can I combine ImageDataGenerator with TensorFlow datasets in TF2?
                            
                                .predict() runs only on CPU even though GPU is available
                            
                                where is the ./configure of TensorFlow and how to enable the GPU support?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How can I find Imagenet data labels?

Tags:

tensorflow

deep-learning

imagenet

Curious_man

People also ask

2 Answers

MichaelSB

darren1231

Recent Activity

Donate For Us