How to enable Keras with Theano to utilize multiple GPUs

Tags:

Setup:

Using a Amazon Linux system with a Nvidia GPU
I'm using Keras 1.0.1
Running Theano v0.8.2 backend
Using CUDA and CuDNN
THEANO_FLAGS="device=gpu,floatX=float32,lib.cnmem=1"

Everything works fine, but I run out of video memory on large models when I increase the batch size to speed up training. I figure moving to a 4 GPU system would in theory either improve total memory available or allow smaller batches to build faster, but observing the the nvidia stats, I can see only one GPU is used by default:

+------------------------------------------------------+ 
| NVIDIA-SMI 361.42     Driver Version: 361.42         |         
|-------------------------------+----------------------+----------------------+ 
| GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr. ECC | 
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |    
|===============================+======================+======================| 
|   0  GRID K520           Off  | 0000:00:03.0     Off |                  N/A | 
| N/A   44C    P0    45W / 125W |   3954MiB /  4095MiB |     94% Default      |
+-------------------------------+----------------------+----------------------+ 
|   1  GRID K520           Off  | 0000:00:04.0     Off |               N/A    | 
| N/A   28C    P8    17W / 125W |     11MiB /  4095MiB |        0% Default    |
+-------------------------------+----------------------+----------------------+ 
|   2  GRID K520           Off  | 0000:00:05.0     Off |               N/A    | 
| N/A   32C    P8    17W / 125W |     11MiB /  4095MiB |           0% Default |
+-------------------------------+----------------------+----------------------+ 
|   3  GRID K520           Off  | 0000:00:06.0     Off |                N/A   |     
| N/A   29C    P8    17W / 125W |     11MiB /  4095MiB |           0% Default |
+-------------------------------+----------------------+----------------------+

+-----------------------------------------------------------------------------+ 
| Processes:                                                       GPU Memory | 
|  GPU       PID  Type  Process name                               Usage      | 
|=============================================================================| 
|    0      9862    C   python34                                      3941MiB |

I know with raw Theano you can use manually multiple GPU's explicitly. Does Keras support use of multiple GPU's? If so, does it abstract it or do you need to map the GPU's to devices as in Theano and explicitly marshall computations to specific GPU's?

248

asked May 02 '16 22:05

Ray

1 Answers

Multi-GPU training is experimental ("The code is rather new and is still considered experimental at this point. It has been tested and seems to perform correctly in all cases observed, but make sure to double-check your results before publishing a paper or anything of the sort.") and hasn't been integrated into Keras yet. However, you can use multiple GPUs with Keras with the Tensorflow backend: https://blog.keras.io/keras-as-a-simplified-interface-to-tensorflow-tutorial.html#multi-gpu-and-distributed-training.

175

answered Oct 12 '22 17:10

1''

Related questions
                            
                                Keras does not use GPU - how to troubleshoot?
                            
                                Using random numbers with GPUs
                            
                                How does a graphics driver programmatically communicate from CPU to GPU?
                            
                                tensorflow code optimization strategy
                            
                                Are triangles a gpu restriction or are there other rendering pathways?
                            
                                nvidia-smi does not display memory usage [closed]
                            
                                Solving dense linear systems AX = B with CUDA
                            
                                How to get memory bandwidth from memory clock/memory speed
                            
                                Could not satisfy explicit device specification '/device:GPU:0' because no devices matching
                            
                                OpenGL GPU Memory cleanup, required?
                            
                                How to convert GpuMat to CvMat in OpenCV?
                            
                                How do I use Nvidia Multi-process Service (MPS) to run multiple non-MPI CUDA applications?
                            
                                Anaconda Integration with Cuda 9.0 shows Incompatible Package Error
                            
                                Is it possible to build an `nvidia/cuda`-based image on a server without a GPU?
                            
                                OpenGL (ES 2.0) VBO Performances in a Shared Memory Architecture
                            
                                MPI + GPU : how to mix the two techniques
                            
                                Why does CUDA float program get faster in full speed FP64 mode?
                            
                                Calculate average of pixels in the front buffer of the gpu without copying the front buffer back to system memory
                            
                                Speeding up rendering in SceneKit
                            
                                Specify either CPU or GPU for multiple models tensorflow java's job

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to enable Keras with Theano to utilize multiple GPUs

Tags:

gpu

keras

cudnn

theano

theano-cuda

Ray

People also ask

1 Answers

1''

Recent Activity

Donate For Us