Consider the following code: <pre class="prettyprint"><code>__global__ void kernel(int *something) { extern __shared__ int shared_array[]; // Some operations on shared_array here. } </code></pre> Is it possible to set whole shared_array to some value - e.g. 0 - without explicitly addressing each cell in some thread?

You can efficiently initialize shared arrays in parallel like this <pre class="prettyprint"><code>// if SHARED_SIZE == blockDim.x, eliminate this loop for (int i = threadIdx.x; i < SHARED_SIZE; i += blockDim.x) shared_array[i] = INITIAL_VALUE; __syncthreads(); </code></pre>

No. Shared memory is uninitialised. You have to somehow initialise it yourself, one way or another... From CUDA C Programming Guide 3.2, Section B.2.4.2, paragraph 2: <blockquote> <code>__shared__</code> variables cannot have an initialization as part of their declaration. </blockquote> This also discards nontrivial default constructors for shared variables.

Is there a way of setting default value for shared memory array?

Tags:

cuda

Consider the following code:

__global__ void kernel(int *something) {
extern __shared__ int shared_array[];     

// Some operations on shared_array here.

}

Is it possible to set whole shared_array to some value - e.g. 0 - without explicitly addressing each cell in some thread?

353

asked Jun 25 '11 13:06

fsh

2 Answers

You can efficiently initialize shared arrays in parallel like this

// if SHARED_SIZE == blockDim.x, eliminate this loop
for (int i = threadIdx.x; i < SHARED_SIZE; i += blockDim.x) 
    shared_array[i] = INITIAL_VALUE;
__syncthreads();

answered Oct 15 '22 03:10

harrism

No. Shared memory is uninitialised. You have to somehow initialise it yourself, one way or another...

From CUDA C Programming Guide 3.2, Section B.2.4.2, paragraph 2:

__shared__ variables cannot have an initialization as part of their declaration.

This also discards nontrivial default constructors for shared variables.

answered Oct 15 '22 02:10

CygnusX1

Related questions
                            
                                Branch and predicated instructions
                            
                                What does "persistence mode" actually do which reduces CUDA startup time?
                            
                                How to separate CUDA code into multiple files
                            
                                Why is the constant memory size limited in CUDA?
                            
                                Get GPU memory usage programmatically
                            
                                Problems when running nvcc from command line
                            
                                Matrix multiplication on CPU (numpy) and GPU (gnumpy) give different results
                            
                                How is 2D Shared Memory arranged in CUDA
                            
                                CUDA allocate memory in __device__ function
                            
                                How to run CUDA without a GPU using a software implementation?
                            
                                How to Run a cuda code using remote Desktop?
                            
                                CUDA version X complains about not supporting gcc version Y - what to do?
                            
                                CUDA exp() expf() and __expf()
                            
                                How do I enable syntax highlighting of CUDA .cu files in Visual Studio 2010?
                            
                                Installing cuda 5 samples in Ubuntu 12.10
                            
                                Block reduction in CUDA
                            
                                Cuda Image average filter
                            
                                How to enable syntax highlighting for CUDA .cu and .cuh files in Vim?
                            
                                Random Number Generator in CUDA
                            
                                Error in cudaMemcpyToSymbol using CUDA 5

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With