How to use CUDA stream in Pytorch?

Tags:

python

pytorch

I wanna use CUDA stream in Pytorch to parallel some computations, but I don't know how to do it. For instance, if there's 2 tasks, A and B, need to be parallelized, I wanna do the following things:

stream0 = torch.get_stream()
stream1 = torch.get_stream()
with torch.now_stream(stream0):
    // task A
with torch.now_stream(stream1):
    // task B
torch.synchronize()
// get A and B's answer

How can I achieve the goal in real python code?

724

asked Sep 25 '18 12:09

gasoon

1 Answers

s1 = torch.cuda.Stream()
s2 = torch.cuda.Stream()
# Initialise cuda tensors here. E.g.:
A = torch.rand(1000, 1000, device = ‘cuda’)
B = torch.rand(1000, 1000, device = ‘cuda’)
# Wait for the above tensors to initialise.
torch.cuda.synchronize()
with torch.cuda.stream(s1):
    C = torch.mm(A, A)
with torch.cuda.stream(s2):
    D = torch.mm(B, B)
# Wait for C and D to be computed.
torch.cuda.synchronize()
# Do stuff with C and D.

answered Sep 21 '22 10:09

Tomas

Related questions
                            
                                Why does python behave this way with variables?
                            
                                IllegalArgumentException thrown when count and collect function in spark
                            
                                Plot datetime.timedelta using matplotlib and python
                            
                                Efficient numpy argsort with condition while maintaining original indices
                            
                                multiplying lists of lists with different lengths
                            
                                Perform operation on all "key":"value" pair in dict and store the result in a new dict object
                            
                                Get model name from instance
                            
                                TclError: no display name and no $DISPLAY environment variable in Google Colab
                            
                                What does the 'tearoff' attribute do in a tkinter Menu?
                            
                                Test if any column of a pandas DataFrame satisfies a condition
                            
                                row sum on a pandas pivot table
                            
                                Create a circular barplot in python
                            
                                Pandas: reading Excel file starting from the row below that with a specific value
                            
                                No module named graphframes Jupyter Notebook
                            
                                Check if dataframe has a zero element
                            
                                Fatal Python error: Py_Initialize: can't initialize sys standard streams LookupError: unknown encoding: 65001
                            
                                self.model() in django custom UserManager
                            
                                Fill the diagonal of Pandas DataFrame with elements from Pandas Series
                            
                                np.where() do nothing if condition fails
                            
                                Why does sigmoid & crossentropy of Keras/tensorflow have low precision?