I'm looking at the performance and GPU usage during training of a CNN model with Keras+TensorFlow. Similar to this question, I'm having a hard time to understand the combined use of Keras <code>model.fit</code>'s <code>steps_per_epoch</code> and TensorFlow's Dataset API's <code>.batch()</code>: I set a certain batch size on the input pipeline <code>dataset = dataset.batch(batch_size)</code> and later I use <pre class="prettyprint lang-py prettyprint-override"><code>fit = model.fit(dataset, epochs=num_epochs, steps_per_epoch=training_set_size//batch_size) </code></pre> but I see that one can actually set any number of steps per epoch, even more than <code>training_set_size//batch_size</code>. From the documentation I understand that on Keras an epoch is not necessarily a pass over the entire training set as usually, but anyway I'm a bit confused and now I'm not entirely sure if I'm using it right. Is <code>dataset.batch(batch_size)</code> + <code>steps_per_epoch=training_set_size//batch_size</code> defining a minibatch SGD that runs over the entire training set by minibatches of <code>batch_size</code> samples? Are epochs larger than one pass over the training set if <code>steps_per_epoch</code> is set to more than <code>training_set_size//batch_size</code>?

<code>steps_per_epoch</code> is the number of batches of your set batch size is ran through the network in one epoch. You have set your <code>steps_per_epoch</code> to be <code>training_set_size//batch_size</code> for a good reason. This ensures all data are trained upon in one epoch, providing the number divides exactly (if not it rounds by the // operator). That is to say if you had a batch size of 10 and a training set size of 30, then <code>steps_per_epoch = 3</code> ensures all data are used. And to quote your question: <blockquote> "Are epochs larger than one pass over the training set if steps_per_epoch is set to more than training_set_size//batch_size?" </blockquote> Yes. Some data will be passed through again in the same epoch.

Combining Keras model.fit's `steps_per_epoch` with TensorFlow's Dataset API's `batch()`

Tags:

tensorflow

keras

tensorflow-datasets

I'm looking at the performance and GPU usage during training of a CNN model with Keras+TensorFlow. Similar to this question, I'm having a hard time to understand the combined use of Keras model.fit's steps_per_epoch and TensorFlow's Dataset API's .batch(): I set a certain batch size on the input pipeline dataset = dataset.batch(batch_size) and later I use

fit = model.fit(dataset, epochs=num_epochs, steps_per_epoch=training_set_size//batch_size)

but I see that one can actually set any number of steps per epoch, even more than training_set_size//batch_size. From the documentation I understand that on Keras an epoch is not necessarily a pass over the entire training set as usually, but anyway I'm a bit confused and now I'm not entirely sure if I'm using it right.

Is dataset.batch(batch_size) + steps_per_epoch=training_set_size//batch_size defining a minibatch SGD that runs over the entire training set by minibatches of batch_size samples? Are epochs larger than one pass over the training set if steps_per_epoch is set to more than training_set_size//batch_size?

881

asked Feb 07 '19 14:02

rsm

1 Answers

steps_per_epoch is the number of batches of your set batch size is ran through the network in one epoch.

You have set your steps_per_epoch to be training_set_size//batch_size for a good reason. This ensures all data are trained upon in one epoch, providing the number divides exactly (if not it rounds by the // operator).

That is to say if you had a batch size of 10 and a training set size of 30, then steps_per_epoch = 3 ensures all data are used.

And to quote your question:

"Are epochs larger than one pass over the training set if steps_per_epoch is set to more than training_set_size//batch_size?"

Yes. Some data will be passed through again in the same epoch.

122

answered Nov 15 '22 09:11

McGuile

Related questions
                            
                                legacy_init_op in TensorFlow Serving
                            
                                tf.contrib.data.Dataset seems does not support SparseTensor
                            
                                tf.GraphKeys.TRAINABLE_VARIABLES on output_graph.pb resulting in empty list
                            
                                How many epochs should Word2Vec be trained? What is a recommended training dataset?
                            
                                How to use the function merge and switch of tensorflow?
                            
                                Tensorflow: Why my code is running slower and slower?
                            
                                Why use fixed padding when building resnet model in tensorflow
                            
                                Neural network: estimating sine wave frequency
                            
                                Why does defining tf.Session with and without context manager in Tensorflow result in different behaviour?
                            
                                Sample without replacement
                            
                                How to get results from custom loss function in Keras?
                            
                                Predict label of text with multi-layered perceptron model in Tensorflow
                            
                                Tensorflow object detection API killed - OOM. How to reduce shuffle buffer size?
                            
                                Is GEMM or BLAS used in Tensorflow, Theano, Pytorch
                            
                                Parallelize tf.from_generator using tf.contrib.data.parallel_interleave
                            
                                Why does tensorflow/keras choke when I try to fit multiple models in parallel?
                            
                                4D input in LSTM layer in Keras
                            
                                Is there .all() or .any() equivalent in python Tensorflow
                            
                                Multiple outputs in keras Sequential models
                            
                                How to experiment with custom 2d-convolution kernels in Keras?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With