How do I profile a tf.data.Dataset?

Tags:

I'm trying to understand what bottlenecks I have in my input_fn with tf.data.Dataset so I figured I'd use tf.profiler but it only shows the iterator op. How can I get the profiler to output the relevant ops in my Dataset pipeline instead?

Example

dataset = input_fn()
iterator = dataset.make_one_shot_iterator()
minibatch = iterator.get_next()
run_metadata = tf.RunMetadata()
with tf.Session() as session:
    features, labels = session.run(minibatch, 
                                   options=tf.RunOptions(trace_level=tf.RunOptions.FULL_TRACE),
                                   run_metadata=run_metadata)

tf.profiler.advise(tf.get_default_graph(), run_metadata)

Output:

checkers {
  key: "AcceleratorUtilizationChecker"
  value {
  }
}
checkers {
  key: "ExpensiveOperationChecker"
  value {
    reports: "top 1 operation type: IteratorGetNext, cpu: 79.89sec, accelerator: 0us, total: 79.89sec (99.96%)\ntop 2 operation type: OneShotIterator, cpu: 27.92ms, accelerator: 0us, total: 27.92ms (0.03%)\ntop 3 operation type: _retval_IteratorGetNext_3_3, cpu: 57us, accelerator: 0us, total: 57us (0.00%)"
    reports: "top 1 graph node: IteratorGetNext, cpu: 79.89sec, accelerator: 0us, total: 79.89sec\ntop 2 graph node: OneShotIterator, cpu: 27.92ms, accelerator: 0us, total: 27.92ms"
    reports: "<ipython-input-2-c5f67ba0356f>:49:<module>, cpu: 79.89sec, accelerator: 0us, total: 79.89sec\n<ipython-input-2-c5f67ba0356f>:48:<module>, cpu: 27.92ms, accelerator: 0us, total: 27.92ms"
  }
}
checkers {
  key: "OperationChecker"
  value {
  }
}

841

asked Jan 19 '18 20:01

Carl Thomé

1 Answers

Looks like tf.data profiling wasn't implemented. It seems to be added in version 1.14. This snippet:

import tensorflow as tf

dataset = tf.data.Dataset.range(100)
dataset = dataset.shuffle(30)
dataset = dataset.repeat()

iterator = dataset.make_one_shot_iterator()
minibatch = iterator.get_next()
run_metadata = tf.RunMetadata()
options = tf.RunOptions(trace_level=tf.RunOptions.FULL_TRACE)
with tf.Session() as session:
    session.run(minibatch, options=options, run_metadata=run_metadata)

tf.profiler.advise(tf.get_default_graph(), run_metadata)

Outputs:

Parsing Inputs...

ExpensiveOperationChecker:
top 1 operation type: OneShotIterator, cpu: 3.01ms, accelerator: 0us, total: 3.01ms (87.19%)
top 2 operation type: IteratorGetNext, cpu: 440us, accelerator: 0us, total: 440us (12.75%)
top 3 operation type: _retval_IteratorGetNext_0_0, cpu: 2us, accelerator: 0us, total: 2us (0.06%)
top 1 graph node: OneShotIterator, cpu: 3.01ms, accelerator: 0us, total: 3.01ms
top 2 graph node: IteratorGetNext, cpu: 440us, accelerator: 0us, total: 440us
test.py:7:<module>, cpu: 3.01ms, accelerator: 0us, total: 3.01ms

OperationChecker:

AcceleratorUtilizationChecker:

146

answered Sep 24 '22 07:09

McAngus

Related questions
                            
                                Anaconda: any way to indicate if dependency issues prevent "conda update"ing the *absolute* latest version of a module?
                            
                                Available packages empty in Pycharm with Anaconda interpretter
                            
                                Kivy/Buildozer Import Error - pymssql.so is 64-bit instead of 32-bit
                            
                                Celery and RabbitMQ - queue priority vs. consumer priority vs. task priority
                            
                                Pylinter in Sublime text 3.1.1 still doesn't use Python2.7
                            
                                How do I run a single nosetest via setup.py in the python-active-directory module?
                            
                                How to add more metrics on the country_map in Apache-superset?
                            
                                How to solve view limit minimum is less than 1 and is an invalid Matplotlib date value error?
                            
                                Pandas v1.1.0: Groupby rolling count slower than rolling mean & sum
                            
                                Integrate Qt with Windows 7 taskbar using python?
                            
                                Phonon's VideoWidget show wrong colors on a QGLWidget (Qt, Python)
                            
                                Irregular, non-contiguous Periods in Pandas
                            
                                input() blocks other python processes in Windows 8 (python 3.3)
                            
                                Python: ImportError: No module named pkg_resources [duplicate]
                            
                                uWSGI / Flask / Python logs stop after some time
                            
                                How to write a proxy pool server (when a request comes, choose a proxy to get url content) in python?
                            
                                Sublime Text syntax: Python 3.6 f-strings
                            
                                TensorFlow: How can I evaluate a validation data queue multiple times during training?
                            
                                Decode Micro QR codes with Python
                            
                                Load saved checkpoint and predict not producing same results as in training

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How do I profile a tf.data.Dataset?

Tags:

python

tensorflow

Example

Carl Thomé

People also ask

1 Answers

McAngus

Recent Activity

Donate For Us