Pycuda messing up numpy matrix transpose

Tags:

2 Answers

The basic reason is that numpy transpose only creates a view, which has no effect on the underlying array storage, and it is that storage which PyCUDA directly accesses when a copy is performed to device memory. The solution is to use the copy method when doing the transpose, which will create an array with data in the transposed order in host memory, then copy that to the device:

Click to copy

data_gpu = gpuarray.to_gpu(data.T.copy())

answered Oct 14 '22 13:10

talonmies

In numpy, data.T doesn't do anything to the underlying 1D array. It simply manipulates the strides to obtain the transpose. This makes it a constant-time and constant-memory operation.

It would appear that pycuda.to_gpu() isn't respecting the strides and is simply copying the underlying 1D array. This would produce the exact behaviour you're observing.

In my view there is nothing wrong with your code. Rather, I would consider this a bug in pycuda.

I've googled around, and have found a thread that discusses this issue in detail.

As a workaround, you could try passing numpy.ascontiguousarray(data.T) to gpuarray.to_gpu(). This will, of course, create a second copy of the data in the host RAM.

answered Oct 14 '22 12:10

NPE

Related questions
                            
                                How to access RGB pixel arrays from DICOM files using pydicom?
                            
                                Index entire array backwards in for loop
                            
                                groupby, count and average in numpy, pandas in python
                            
                                sklearn cannot import name _ellipsoid
                            
                                How to convert vector wrapped as string to numpy array in pandas dataframe?
                            
                                Transform 2D array to a 3D array with overlapping strides
                            
                                Numpy ndarray shape with 3 parameters
                            
                                Count zero rows in 2D numpy array
                            
                                Why does the result of scipy.sparse.csc_matrix.sum() change its type to numpy matrix?
                            
                                plot two seaborn heatmap graphs side by side
                            
                                Could not convert string to float error from the Titanic competition
                            
                                What is the most efficient way of doing square root of sum of square of two numbers?
                            
                                Why is the size of npy bigger than csv?
                            
                                Numpy taking only first character of string
                            
                                matplotlib: assigning different hatch to bars
                            
                                How to concatenate a vector into rows of a numpy matrix?
                            
                                Perform sum over different slice of each row for 2D array
                            
                                I have this error when trying to import tensorflow_hub: cannot import name 'parameter_server_strategy_v2' from 'tensorflow.python.distribute'
                            
                                MemoryError when running Numpy Meshgrid
                            
                                scipy stats geometric mean returns NaN

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Pycuda messing up numpy matrix transpose

Tags:

numpy

pycuda

Framester

People also ask

2 Answers

talonmies

NPE

Recent Activity

Donate For Us