PyTorch mutiprocessing: Do I need to use Lock() when accessing a shared model?

Tags:

I have some questions about using the torch.multiprocessing module. Let’s say I have a torch.nn.Module called model and I call model.share_memory() on it.

What happens if two threads call the forward(), i.e. model(input) at the same time? Is it safe? Or should I use Lock mechanisms to be sure that model is not accessed at the same time by multiple threads? Similarly, what happens if two or more threads have an optimizer working on model.parameters() and they call optimizer.step() at the same time?

I ask these questions because I often see the optimizer.step() being called on shared models without lock mechanisms (i.e. in RL implementations of A3C or ACER) and I wonder if it is a safe thing to do.

744

asked Dec 03 '20 18:12

Federico Taschin

1 Answers

It doesn't have to be safe, since they are running asynchronously not in parallel. Quoting from the docs,

Using torch.multiprocessing, it is possible to train a model asynchronously, with parameters either shared all the time, or being periodically synchronized. In the first case, we recommend sending over the whole model object, while in the latter, we advise to only send the state_dict().

158

answered Oct 19 '22 18:10

ndrwnaguib

Related questions
                            
                                Pytorch Exception in Thread: ValueError: signal number 32 out of range
                            
                                Concat tensors in PyTorch
                            
                                Taking the last state from BiLSTM (BiGRU) in PyTorch
                            
                                How do you invert a tensor of boolean values in Pytorch?
                            
                                Is column selection in pytorch differentiable?
                            
                                Why does roi_align not seem to work in pytorch?
                            
                                PyTorch: Speed up data loading
                            
                                Correct way of normalizing and scaling the MNIST dataset
                            
                                Is there an efficient way to create a random bit mask in Pytorch?
                            
                                PyTorch DataLoader - "IndexError: too many indices for tensor of dimension 0"
                            
                                what does padding_idx do in nn.embeddings()
                            
                                Documentation for PyTorch .to('cpu') or .to('cuda')
                            
                                Pytorch: AttributeError: 'function' object has no attribute 'copy'
                            
                                Calculating input and output size for Conv2d in PyTorch for image classification
                            
                                PyTorch Datasets: Converting entire Dataset to NumPy
                            
                                Proper Usage of PyTorch's non_blocking=True for Data Prefetching

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

PyTorch mutiprocessing: Do I need to use Lock() when accessing a shared model?

Tags:

pytorch

python-multiprocessing

reinforcement-learning

Federico Taschin

People also ask

1 Answers

ndrwnaguib

Recent Activity

Donate For Us