<img src="https://i.stack.imgur.com/oyDNI.png" alt="TF Trace"> During training of my data, my GPU utilization is around 40%, and I clearly see that there is a datacopy operation that's using a lot of time, based on tensorflow profiler(see attached picture). I presume that "MEMCPYHtoD" option is copying the batch from CPU to GPU, and is blocking the GPU from being used. Is there anyway to prefetch data to GPU? or is there other problems that I am not seeing? Here is the code for dataset: <pre class="prettyprint"><code>X_placeholder = tf.placeholder(tf.float32, data.train.X.shape) y_placeholder = tf.placeholder(tf.float32, data.train.y[label].shape) dataset = tf.data.Dataset.from_tensor_slices({"X": X_placeholder, "y": y_placeholder}) dataset = dataset.repeat(1000) dataset = dataset.batch(1000) dataset = dataset.prefetch(2) iterator = dataset.make_initializable_iterator() next_element = iterator.get_next() </code></pre>

Prefetching to a single GPU: <ul> <li>Consider using a more flexible approach than <code>prefetch_to_device</code>, e.g. by explicitly copying to the GPU with <code>tf.data.experimental.copy_to_device(...)</code> and then prefetching. This allows to avoid the restriction that <code>prefetch_to_device</code> must be the last transformation in a pipeline, and allow to incorporate further tricks to optimize the <code>Dataset</code> pipeline performance (e.g. by overriding threadpool distribution).</li> <li>Try out the experimental <code>tf.contrib.data.AUTOTUNE</code> option for prefetching, which allows the <code>tf.data</code> runtime to automatically tune the prefetch buffer sizes based on your system and environment.</li> </ul> At the end, you might end up doing something like this: <pre class="prettyprint"><code>dataset = dataset.apply(tf.data.experimental.copy_to_device("/gpu:0")) dataset = dataset.prefetch(tf.contrib.data.AUTOTUNE) </code></pre>

GPU under utilization using tensorflow dataset

Tags:

python

tensorflow

dataset

TF Trace

During training of my data, my GPU utilization is around 40%, and I clearly see that there is a datacopy operation that's using a lot of time, based on tensorflow profiler(see attached picture). I presume that "MEMCPYHtoD" option is copying the batch from CPU to GPU, and is blocking the GPU from being used. Is there anyway to prefetch data to GPU? or is there other problems that I am not seeing?

Here is the code for dataset:

X_placeholder = tf.placeholder(tf.float32, data.train.X.shape)
y_placeholder = tf.placeholder(tf.float32, data.train.y[label].shape)

dataset = tf.data.Dataset.from_tensor_slices({"X": X_placeholder, 
                                              "y": y_placeholder})
dataset = dataset.repeat(1000)
dataset = dataset.batch(1000)
dataset = dataset.prefetch(2)
iterator = dataset.make_initializable_iterator()
next_element = iterator.get_next()

222

asked Jan 20 '18 01:01

Molly Zhang

1 Answers

Prefetching to a single GPU:

Consider using a more flexible approach than prefetch_to_device, e.g. by explicitly copying to the GPU with tf.data.experimental.copy_to_device(...) and then prefetching. This allows to avoid the restriction that prefetch_to_device must be the last transformation in a pipeline, and allow to incorporate further tricks to optimize the Dataset pipeline performance (e.g. by overriding threadpool distribution).
Try out the experimental tf.contrib.data.AUTOTUNE option for prefetching, which allows the tf.data runtime to automatically tune the prefetch buffer sizes based on your system and environment.

At the end, you might end up doing something like this:

dataset = dataset.apply(tf.data.experimental.copy_to_device("/gpu:0"))
dataset = dataset.prefetch(tf.contrib.data.AUTOTUNE)

answered Oct 02 '22 10:10

Alaroff

Related questions
                            
                                How to configure uwsgi to encode logging as json except app output
                            
                                Subclassing multiprocessing.managers.BaseProxy
                            
                                Error in `python': free(): invalid pointer: 0x00007fc3c90dc98e
                            
                                Graphene Mutation error, fields must be a mapping (dict / OrderedDict)
                            
                                Can pip list its binary wheels?
                            
                                Tensorflow Dataset.from_tensor_slices taking too long
                            
                                Python Pandas Key Error When Trying to Access Index
                            
                                How do I debug an error in `ast.literal_eval`?
                            
                                ImportError: No module named flask_login even though I have it installed
                            
                                python multiprocessing Pool not always using all workers
                            
                                Most efficient way to groupby => aggregate for large dataframe in pandas
                            
                                How to make ttk.Scale behave more like tk.Scale?
                            
                                "TypeError: 'Tensor' object is not iterable" error with tensorflow Estimator
                            
                                How to bundle cx_oracle with Pyinstaller
                            
                                How vectorizer fit_transform work in sklearn?
                            
                                Machine Learning: normalize target var based on the impact of independent var
                            
                                How to check the status of a mysql connection in python?
                            
                                Separate computation from socket work in Python
                            
                                How to hide user name and password in pip.log?
                            
                                Django: Change formset error message(s)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With