Tensorflow tf.data AUTOTUNE

Tags:

tensorflow-datasets

I was reading the TF performance guide for Data Loading section. For prefetch it says,

The tf.data API provides a software pipelining mechanism through the tf.data.Dataset.prefetch transformation, which can be used to decouple the time when data is produced from the time when data is consumed. In particular, the transformation uses a background thread and an internal buffer to prefetch elements from the input dataset ahead of the time they are requested. The number of elements to prefetch should be equal to (or possibly greater than) the number of batches consumed by a single training step. You could either manually tune this value, or set it to tf.data.experimental.AUTOTUNE which will prompt the tf.data runtime to tune the value dynamically at runtime.

What is AUTOTUNE doing internally? Which algorithm, heuristics are being applied?

Additionally, in practice, what kind of manual tuning is done?

802

asked Jun 15 '19 18:06

1 Answers

tf.data builds a performance model of the input pipeline and runs an optimization algorithm to find a good allocation of its CPU budget across all parameters specified as AUTOTUNE. While the input pipeline is running, tf.data tracks the time spent in each operation, so that these times can be fed into the optimization algorithm.

The OptimizationOptions object gives some control over how autotune will behave.

162

answered Nov 23 '22 10:11

AAudibert

Related questions
                            
                                tensorflow: what's the difference between tf.nn.dropout and tf.layers.dropout
                            
                                What does the function control_dependencies do?
                            
                                How to perform k-fold cross validation with tensorflow?
                            
                                Output from TensorFlow `py_func` has unknown rank/shape
                            
                                Tensorflow: How does tf.get_variable work?
                            
                                Issue feeding a list into feed_dict in TensorFlow
                            
                                TensorBoard doesn't show all data points
                            
                                How to display custom images in TensorBoard using Keras?
                            
                                how to install tensorflow on anaconda python 3.6
                            
                                Choosing number of Steps per Epoch
                            
                                Tensorflow multiple sessions with multiple GPUs
                            
                                How to initialise only optimizer variables in Tensorflow?
                            
                                ImportError: libcudnn when running a TensorFlow program
                            
                                Change images slider step in TensorBoard
                            
                                Train Tensorflow Object Detection on own dataset
                            
                                Can I export a tensorflow summary to CSV?
                            
                                tensorboard: command not found
                            
                                Tensorflow - ValueError: Parent directory of trained_variables.ckpt doesn't exist, can't save
                            
                                ValueError: Shapes (None, 1) and (None, 2) are incompatible
                            
                                Cannot run tflite model on GPU (Jetson Nano) using Python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Tensorflow tf.data AUTOTUNE

Tags:

tensorflow

tensorflow-datasets

dgumo

People also ask

1 Answers

AAudibert

Recent Activity

Donate For Us