We are planning to build image classifiers using Google Tensorflow. I wonder what are the minimum and what are the optimum requirements to train a custom image classifier using a convolutional deep neural network? The questions are specifically: <ul> <li>how many images per class should be provided at a minimum?</li> <li>do we need to appx. provide the same amount of training images per class or can the amount per class be disparate?</li> <li>what is the impact of wrong image data in the training data? E.g. 500 images of a tennis shoe and 50 of other shoes. </li> <li>is it possible to train a classifier with much more classes than the recently published inception-v3 model? Let's say: 30.000.</li> </ul>

"how many images per class should be provided at a minimum?" Depends how you train. If training a new model from scratch, purely supervised: For a rule of thumb on the number of images, you can look at the MNIST and CIFAR tasks. These seem to work OK with about 5,000 images per class. That's if you're training from scratch. You can probably bootstrap your network by beginning with a model trained on ImageNet. This model will already have good features, so it should be able to learn to classify new categories without as many labeled examples. I don't think this is well-studied enough to tell you a specific number. If training with unlabeled data, maybe only 100 labeled images per class. There is a lot of recent research work on this topic, though not scaling to as large of tasks as Imagenet. Simple to implement: <pre class="prettyprint"><code>http://arxiv.org/abs/1507.00677 </code></pre> Complicated to implement: <pre class="prettyprint"><code>http://arxiv.org/abs/1507.02672 http://arxiv.org/abs/1511.06390 http://arxiv.org/abs/1511.06440 </code></pre> "do we need to appx. provide the same amount of training images per class or can the amount per class be disparate?" It should work with different numbers of examples per class. "what is the impact of wrong image data in the training data? E.g. 500 images of a tennis shoe and 50 of other shoes." You should use the label smoothing technique described in this paper: <pre class="prettyprint"><code>http://arxiv.org/abs/1512.00567 </code></pre> Smooth the labels based on your estimate of the label error rate. "is it possible to train a classifier with much more classes than the recently published inception-v3 model? Let's say: 30.000." Yes

Miminum requirements for Google tensorflow image classifier

2 Answers

"how many images per class should be provided at a minimum?"

Depends how you train.

If training a new model from scratch, purely supervised: For a rule of thumb on the number of images, you can look at the MNIST and CIFAR tasks. These seem to work OK with about 5,000 images per class. That's if you're training from scratch.

You can probably bootstrap your network by beginning with a model trained on ImageNet. This model will already have good features, so it should be able to learn to classify new categories without as many labeled examples. I don't think this is well-studied enough to tell you a specific number.

If training with unlabeled data, maybe only 100 labeled images per class. There is a lot of recent research work on this topic, though not scaling to as large of tasks as Imagenet. Simple to implement:

Click to copy

http://arxiv.org/abs/1507.00677

Complicated to implement:

Click to copy

http://arxiv.org/abs/1507.02672
http://arxiv.org/abs/1511.06390
http://arxiv.org/abs/1511.06440

"do we need to appx. provide the same amount of training images per class or can the amount per class be disparate?"

It should work with different numbers of examples per class.

"what is the impact of wrong image data in the training data? E.g. 500 images of a tennis shoe and 50 of other shoes."

You should use the label smoothing technique described in this paper:

Click to copy

http://arxiv.org/abs/1512.00567

Smooth the labels based on your estimate of the label error rate.

"is it possible to train a classifier with much more classes than the recently published inception-v3 model? Let's say: 30.000."

Yes

109

answered Sep 28 '22 08:09

Ian Goodfellow

How many images per class should be provided at a minimum?

do we need to appx. provide the same amount of training images per class or can the amount per class be disparate?

what is the impact of wrong image data in the training data? E.g. 500 images of a tennis shoe and 50 of other shoes.

These three questions are not really TensorFlow specific. But the short answer is, it depends on the resiliency of your model in handling unbalanced data set and noisy labels.

is it possible to train a classifier with much more classes than the recently published inception-v3 model? Let's say: 30.000.

Yes, definitely. This would mean a much larger classifier layer, so your training time might be longer. Other than that, there are no limitations in TensorFlow.

answered Sep 28 '22 06:09

keveman

Related questions
                            
                                Matlab: neural network time series prediction?
                            
                                Multivariate time series forecasting with 3 months dataset
                            
                                PyTorch: is there a definitive training loop similar to Keras' fit()?
                            
                                How to sample large database and implement K-means and K-nn in R?
                            
                                How to integrate Apache Spark with Spring MVC web application for interactive user sessions
                            
                                Machine learning project: split training/test sets before or after exploratory data analysis?
                            
                                Reinforcement learning in C# [closed]
                            
                                How do you actually apply a trained model?
                            
                                Auto-encoders with tied weights in Caffe
                            
                                Choosing random_state for sklearn algorithms
                            
                                Keras. ValueError: I/O operation on closed file
                            
                                Cross validation with grid search returns worse results than default
                            
                                Is it possible to add your own WordNet to a library?
                            
                                Supervised Motion Detection Library
                            
                                Assign new data point to cluster in kernel k-means (kernlab package in R)?
                            
                                How to obtain information gain from a scikit-learn DecisionTreeClassifier?
                            
                                Python's implementation of Mutual Information
                            
                                what's the use of transformer_weights in scikit-learn pipeline?
                            
                                difference between LinearRegression and svm.SVR(kernel="linear")
                            
                                What are good algorithms for detecting abnormality?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Miminum requirements for Google tensorflow image classifier

Tags:

machine-learning

neural-network

tensorflow

classification

computer-vision

Jabb

People also ask

2 Answers

Ian Goodfellow

keveman

Recent Activity

Donate For Us