Why is my GPU slower than CPU when training LSTM/RNN models?

Tags:

My machine has the following spec:

CPU: Xeon E5-1620 v4

GPU: Titan X (Pascal)

Ubuntu 16.04

Nvidia driver 375.26

CUDA tookit 8.0

cuDNN 5.1

I've benchmarked on the following Keras examples with Tensorflow as the backed reference:

SCRIPT NAME                  GPU       CPU
stated_lstm.py               5sec      5sec 
babi_rnn.py                  10sec     12sec
imdb_bidirectional_lstm.py   240sec    116sec
imbd_lstm.py                 113sec    106sec

My gpu is clearly out performing my cpu in non-lstm models.

SCRIPT NAME                  GPU       CPU
cifar10_cnn.py               12sec     123sec
imdb_cnn.py                  5sec      119sec
mnist_cnn.py                 3sec      47sec

Has anyone else experienced this?

330

asked Jan 31 '17 01:01

agsolid

3 Answers

If you use Keras, use CuDNNLSTM in place of LSTM or CuDNNGRU in place of GRU. In my case (2 Tesla M60), I am seeing 10x boost of performance. By the way I am using batch size 128 as suggested by @Alexey Golyshev.

answered Oct 18 '22 05:10

neurite

Too small batch size. Try to increase.

Results for my GTX1050Ti:

imdb_bidirectional_lstm.py
batch_size      time
32 (default)    252
64              131
96              87
128             66

imdb_lstm.py
batch_size      time
32 (default)    108
64              50
96              34
128             25

answered Oct 18 '22 06:10

Alexey Golyshev

It's just a tip.

Using GPU is powerful when

1. your neural network model is big.
2. batch size is big.

It's what I found from googling.

answered Oct 18 '22 05:10

Dane Lee

Related questions
                            
                                Submitting Assignment on Coursera ML in Octave
                            
                                How to use advanced activation layers in Keras?
                            
                                Use sklearn's GridSearchCV with a pipeline, preprocessing just once
                            
                                Unbalanced classification using RandomForestClassifier in sklearn
                            
                                ValueError: Layer sequential_20 expects 1 inputs, but it received 2 input tensors
                            
                                When should I use support vector machines as opposed to artificial neural networks?
                            
                                Calculate the Cumulative Distribution Function (CDF) in Python
                            
                                How to interpret scikit's learn confusion matrix and classification report?
                            
                                How to graph grid scores from GridSearchCV?
                            
                                Large scale machine learning - Python or Java? [closed]
                            
                                What is the difference between SVC and SVM in scikit-learn?
                            
                                Help Understanding Cross Validation and Decision Trees
                            
                                What makes the distance measure in k-medoid "better" than k-means?
                            
                                Playground for Artificial Intelligence?
                            
                                Dealing with unbalanced datasets in Spark MLlib
                            
                                A guide to convert_imageset.cpp
                            
                                Getting No loop matching the specified signature and casting error
                            
                                Controlling the threshold in Logistic Regression in Scikit Learn
                            
                                Fastest SVM implementation usable in Python [closed]
                            
                                Python NLTK pos_tag not returning the correct part-of-speech tag

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why is my GPU slower than CPU when training LSTM/RNN models?

Tags:

machine-learning

tensorflow

nvidia

keras