How to effectively make use of a GPU for reinforcement learning?

Tags:

Recently i looked into reinforcement learning and there was one question bugging me, that i could not find an answer for: How is training effectively done using GPUs? To my understanding constant interaction with an environment is required, which for me seems like a huge bottleneck, since this task is often non-mathematical / non-parallelizable. Yet for example Alpha Go uses multiple TPUs/GPUs. So how are they doing it?

465

asked Mar 08 '18 13:03

Konstantin

1 Answers

Indeed, you will often have interactions with the environment in between learning steps, which will often be better off running on CPU than GPU. So, if your code for taking actions and your code for running an update / learning step are very fast (as in, for example, tabular RL algorithms), it won't be worth the effort of trying to get those on the GPU.

However, when you have a big neural network, that you need to go through whenever you select an action or run a learning step (as is the case in most of the Deep Reinforcement Learning approaches that are popular these days), the speedup of running these on GPU instead of CPU is often enough for it to be worth the effort of running them on GPU (even if it means you're quite regularly ''switching'' between CPU and GPU, and may need to copy some things from RAM to VRAM or the other way around).

143

answered Oct 16 '22 03:10

Dennis Soemers

Related questions
                            
                                How can I test for OpenCL compatibility?
                            
                                Compile OpenCV without GPU?
                            
                                How to install nvidia apex on Google Colab
                            
                                get the CUDA and CUDNN version on windows with Anaconda installe
                            
                                Minimum distances among a Euclidean distance matrix
                            
                                Does C# natively use GPU for graphics?
                            
                                How to match OpenCL devices with a specific GPU given PCI vendor, device and bus IDs in a multi-GPU system?
                            
                                why does a*b*a take longer than (a'*(a*b)')' when using gpuArray in Matlab scripts?
                            
                                pytorch delete model from gpu
                            
                                How to free up all memory pytorch is taken from gpu memory
                            
                                How to find epsilon, min and max constants for CUDA?
                            
                                Julia in Google Colab
                            
                                What is the best way to handle FBOs in OpenGL?
                            
                                Error: OOM when allocating tensor with shape
                            
                                Multiple processes launching CUDA kernels in parallel
                            
                                Any particular function to initialize GPU other than the first cudaMalloc call?
                            
                                keras version to use with tensorflow-gpu 1.4
                            
                                CuDNNLSTM: Failed to call ThenRnnForward
                            
                                In OpenCL, what is the difference between platform, context, and device?
                            
                                How can I use GPU again on Google Colab after exceeding usage limit?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to effectively make use of a GPU for reinforcement learning?

Tags:

gpu

reinforcement-learning

Konstantin

People also ask

1 Answers

Dennis Soemers

Recent Activity

Donate For Us