Parallel way of applying function element-wise to a Pytorch CUDA Tensor

1 Answers

I think currently, it is not possible to explicit parallelize a function on a CUDA-Tensor. A possible solution could be, you can define a Function like the for example the non-linear activation functions. So you can feed forward it through the Net and your function.

The drawback is, it probably don't work, because you have to define a CUDA-Function and have to recompile pytorch.

137

answered Jan 07 '23 04:01

loose11

Related questions
                            
                                How to use coalesced memory access
                            
                                Regarding GPU mode error in launching Android virtual device
                            
                                What is the correct way to calculate the FPS given that GPUs have a task queue and are asynchronous?
                            
                                Large matrix multiplication on gpu
                            
                                C# Performance Counter Help, Nvidia GPU
                            
                                How are Direct3D and OpenGL instructions handled in a graphics card?
                            
                                GPU Programming?
                            
                                CUDA: What is the threads per multiprocessor and threads per block distinction? [duplicate]
                            
                                Does cuDNN library works with All nvidia graphic cards?
                            
                                Alternative to nvidia-smi for measuring GPU utilization?
                            
                                GPU out of memory error message on Google Colab
                            
                                Are there functional programming languages that run on the GPU?
                            
                                Graph rendering using 3D acceleration
                            
                                How can I run theano on GPU
                            
                                Multithreading degrades GPU performance
                            
                                Library function capabilities of Mathematica
                            
                                Using more than one GPU in matlab
                            
                                How to debug OpenCL on Nvidia GPUs?
                            
                                GPU Thread Synchronization Multi-Core CPU Threads with OpenCL
                            
                                Resource Exhausted OOM while loading VGG16

Parallel way of applying function element-wise to a Pytorch CUDA Tensor

Tags:

gpu

tensor

pytorch

torch

Abhinav Singh

People also ask

1 Answers

loose11

Recent Activity

Donate For Us