How to select batch size automatically to fit GPU?

Tags:

I am training deep neural networks with a GPU. If I make samples too large, batches too large, or networks too deep, I get an out of memory error. In this case, it is sometimes possible to make smaller batches and still train.

Is it possible to calculate GPU size required for training and determine what batch size to choose beforehand?

UPDATE

If I print network summary, it displays number of "trainable parameters". Can't I estimate from this value? For example, take this, multiply by batch size, double for gradients etc?

778

asked Jul 16 '17 20:07

Dims

2 Answers

No, it is not possible to do this automatically. So you need to go through a lot of trial and error to find appropriate size if you want your batch to be as much as possible.

Stanford's CNN class provides some guidance how to estimate the memory size, but all suggestions are related to CNN (not sure what do you train).

190

answered Oct 03 '22 02:10

Salvador Dali

PyTorch Lightning recently added a feature called "auto batch size", especially for this! It computes the max batch size that can fit into the memory of your GPU :)

More info can be found here.

Original PR: https://github.com/PyTorchLightning/pytorch-lightning/pull/1638

answered Oct 03 '22 02:10

Niels

Related questions
                            
                                What is a local variable in tensorflow?
                            
                                How to get weights in tf.layers.dense?
                            
                                TensorFlow : failed call to cuInit: CUDA_ERROR_NO_DEVICE
                            
                                TypeError: Fetch argument has invalid type float32, must be a string or Tensor
                            
                                Tensorflow: 'module' object has no attribute 'scalar_summary'
                            
                                Storing tensorflow models in memory
                            
                                How to implement Tensorflow batch normalization in LSTM
                            
                                Float16 slower than float32 in keras
                            
                                Keras model.fit() with tf.dataset API + validation_data
                            
                                Quantize a Keras neural network model
                            
                                Deploy Semantic Segmentation Network (U-Net) with TensorRT (no upsampling support)
                            
                                Why doesn't my Deep Q Network master a simple Gridworld (Tensorflow)? (How to evaluate a Deep-Q-Net)
                            
                                TensorFlow Estimator ServingInputReceiver features vs receiver_tensors: when and why?
                            
                                How to use Batch Normalization correctly in tensorflow?
                            
                                How to deal with UserWarning: Converting sparse IndexedSlices to a dense Tensor of unknown shape
                            
                                Python TensorFlow: How to restart training with optimizer and import_meta_graph?
                            
                                When to use tensorflow datasets api versus pandas or numpy
                            
                                Keras inconsistent prediction time
                            
                                Restoring TensorFlow model
                            
                                How can I use Tensorflow with react-native? [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to select batch size automatically to fit GPU?

Tags:

out-of-memory

tensorflow

deep-learning

gpu

keras

Dims

People also ask

2 Answers

Salvador Dali

Niels

Recent Activity

Donate For Us