speed benchmark for testing tensorflow install

Tags:

I'm doubting whether tensorflow is correctly configured on my gpu box, since it's about 100x slower per iteration to train a simple linear regression model (batchsize = 32, 1500 input features, 150 output variables) on my fancy gpu machine than on my laptop.

I'm using a Titan X, with a modern cpu, etc. nvidia-smi says that I'm only at 10% gpu utilization, but I expect that's because of the small batchsizes. I'm not using a feed_dict to move data into the computation graph. Everything is coming via a tf.decode_csv and tf.train.shuffle_batch.

Does anyone have any recommendations for how to easily test whether my install is correct? Are there any simple speed benchmarks? The speed difference between my laptop and the gpu machine is so dramatic that I'm expecting that things aren't configured properly.

306

asked Feb 29 '16 15:02

DBelanger

1 Answers

Try tensorflow/tensorflow/models/image/mnist/convolutional.py, that'll print per-step timing.

On Tesla K40c that should get about 16 ms per step, while about 120 ms for CPU-only on my 3 year old machine

Edit: This moved to the models repositories: https://github.com/tensorflow/models/blob/master/tutorials/image/mnist/convolutional.py.

The convolutional.py file is now at models/tutorials/image/mnist/convolutional.py

answered Dec 11 '22 22:12

Yaroslav Bulatov

Related questions
                            
                                C++ GDB breakpoint for member functions
                            
                                Retrofit2 + SimpleXML in Kotlin: MethodException: Annotation must mark a set or get method
                            
                                Use of variables like %{buildDir} in QtCreator kit settings in Qt5
                            
                                Using XGBOOST in c++
                            
                                How to get current location or move to current location in Xamarin.Forms.Map
                            
                                How to append to a slice pointer receiver
                            
                                xmltodict does not return a list for one element
                            
                                Specifying private key on PuTTY command-line
                            
                                What does a backslash by itself ('\') mean in Python? [duplicate]
                            
                                Show most recently published versions of npm package, including beta versions
                            
                                Maven Release Plugin use in Jenkins Pipeline
                            
                                RouterLinkActive for RouterLink with parameters (/dynamic)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With