GPU 2D shared memory dynamic allocation

Tags:

I am aware of the dynamic allocation when 1D arrays are used, but how can it be done when 2D arrays are used?

myKernel<<<blocks, threads,sizeofSharedMemoryinBytes>>>();
         ....

__global__ void myKernerl(){
 __shared__ float sData[][];
     .....
}

Say I want to allocate a 2D shared memory array:

__shared__ float sData[32][32];

How can it be done dynamically? would be:

myKernel<<< blocks, threads, sizeof(float)*32*32 >>>();

759

asked Nov 02 '12 13:11

Manolete

1 Answers

As you have correctly written you have to specify size of dynamically allocated shared memory before each kernel calling in configuration of execution (in <<<blocks, threads, sizeofSharedMemoryinBytes>>>). This specifies the number of bytes in shared memory that is dynamically allocated per block for this call in addition to the statically allocated memory. IMHO there is no way to access such memory as 2D array, you have to use 1D array and use it like 2D. Last think, don't forget qualifier extern. So your code should look like this:

   sizeofSharedMemoryinBytes = dimX * dimY * sizeof(float);

   myKernel<<<blocks, threads,sizeofSharedMemoryinBytes>>>();
     ....

   __global__ void myKernerl() {

       extern __shared__ float sData[];
       .....
       sData[dimX * y + x] = ...
   }

135

answered Nov 11 '22 12:11

stuhlo

Related questions
                            
                                Installing CUDA 7.5 on CentOS 7 - Unable to locate the kernel source
                            
                                Where to download CUDA SDK from
                            
                                Sorting objects with Thrust CUDA
                            
                                Compiling code containing dynamic parallelism fails
                            
                                Is it possible to put assembly instructions into CUDA code?
                            
                                CUDA 7.5 installation: Unsupported compiler error
                            
                                How many 'CUDA cores' does each multiprocessor of a GPU have?
                            
                                src/cpp/cuda.hpp:14:10: fatal error: cuda.h: No such file or directory
                            
                                Ubuntu 16.04, Nvidia toolkit 8.0 RC, darknet compilation error: expected a ";"
                            
                                Interview questions on CUDA Programming? [closed]
                            
                                Cuda Hello World printf not working even with -arch=sm_20
                            
                                Bilinear interpolation to enlarge bitmap images
                            
                                scipy.interpolate.griddata equivalent in CUDA
                            
                                How to output preprocessed code AND compile it (Visual Studio)
                            
                                Makefile for CUDA and C
                            
                                Solve small symmetric positive definite Ax = b on GPU only
                            
                                CUDA device stack and synchronization; SSY instruction
                            
                                cuda with mingw - updated
                            
                                undefined reference error for linking CUDA static or shared library with gcc
                            
                                Sorting algorithm with Cuda. Inside or outside kernels?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

GPU 2D shared memory dynamic allocation

Tags:

cuda

gpgpu

gpu

nvidia

Manolete

People also ask

1 Answers

stuhlo

Recent Activity

Donate For Us