I am using Cuda SDK 4.0 and am encountering an issue which has taken me 2 days to whittle down into the following code. <pre class="prettyprint"><code>#include <cuda.h> #include <cuda_runtime.h> void main (int argc, char ** argv) { int* test; cudaError_t err; err = cudaSetDevice( 1 ); err = cudaMallocHost(&test, 1024*sizeof(int)); err = cudaSetDevice( 0 ); err = cudaFreeHost(test); } </code></pre> This throws the following error when calling cudaFreeHost: <pre class="prettyprint"><code>First-chance exception at 0x000007fefd96aa7d in Test.exe: Microsoft C++ exception: cudaError_enum at memory location 0x0022f958.. </code></pre> The err value is <code>cudaErrorInvalidValue</code> The same error occurs for this variation: <pre class="prettyprint"><code>err = cudaSetDevice( 0 ); err = cudaMallocHost(&test, 1024*sizeof(int)); err = cudaSetDevice( 1 ); err = cudaFreeHost(test); </code></pre> The following variations dont throw the error: <pre class="prettyprint"><code>err = cudaSetDevice( 0 ); err = cudaMallocHost(&test, 1024*sizeof(int)); err = cudaSetDevice( 0 ); err = cudaFreeHost(test); </code></pre> and <pre class="prettyprint"><code>err = cudaSetDevice( 1 ); err = cudaMallocHost(&test, 1024*sizeof(int)); err = cudaSetDevice( 1 ); err = cudaFreeHost(test); </code></pre> I was under the impression you only needed to call cudaSetDevice if you want to allocate memory on a specific GPU. In the above example I am only allocating pinned memory on the CPU. Is this a bug or did I miss something in the manual?

I found the problem. cudaHostAlloc and cudaMallocHost ARE NOT THE SAME. For anyone who encounters this problem the solution is to use <pre class="prettyprint"><code>cudaHostAlloc(&test, 1024*sizeof(int),cudaHostAllocPortable); </code></pre> instead of <pre class="prettyprint"><code>cudaMallocHost(&test, 1024*sizeof(int)); </code></pre>

Does cudaFreeHost care what device is active when cudaMallocHost is used to allocate memory?

Tags:

cuda

I am using Cuda SDK 4.0 and am encountering an issue which has taken me 2 days to whittle down into the following code.

#include <cuda.h>
#include <cuda_runtime.h>
void main (int argc, char ** argv) {

    int* test;
    cudaError_t err;

    err = cudaSetDevice(   1   ); err = cudaMallocHost(&test, 1024*sizeof(int));    
    err = cudaSetDevice(   0   ); err = cudaFreeHost(test);    
}

This throws the following error when calling cudaFreeHost:

First-chance exception at 0x000007fefd96aa7d in Test.exe: Microsoft C++ exception: cudaError_enum at memory location 0x0022f958..

The err value is cudaErrorInvalidValue

The same error occurs for this variation:

err = cudaSetDevice(   0   ); err = cudaMallocHost(&test, 1024*sizeof(int));    
err = cudaSetDevice(   1   ); err = cudaFreeHost(test);

The following variations dont throw the error:

err = cudaSetDevice(   0   ); err = cudaMallocHost(&test, 1024*sizeof(int));    
err = cudaSetDevice(   0   ); err = cudaFreeHost(test);

and

err = cudaSetDevice(   1   ); err = cudaMallocHost(&test, 1024*sizeof(int));    
err = cudaSetDevice(   1   ); err = cudaFreeHost(test);

I was under the impression you only needed to call cudaSetDevice if you want to allocate memory on a specific GPU. In the above example I am only allocating pinned memory on the CPU.

Is this a bug or did I miss something in the manual?

419

asked Dec 21 '11 14:12

twerdster

1 Answers

I found the problem. cudaHostAlloc and cudaMallocHost ARE NOT THE SAME.

For anyone who encounters this problem the solution is to use

cudaHostAlloc(&test, 1024*sizeof(int),cudaHostAllocPortable);

instead of

cudaMallocHost(&test, 1024*sizeof(int));

125

answered Oct 13 '22 00:10

twerdster

Related questions
                            
                                Uncrustify command for CUDA kernel
                            
                                CUDA: illegal combination of memory qualifiers
                            
                                Can I prefetch specific data to a specific cache level in a CUDA kernel?
                            
                                Where does Cuda kernel code reside on nvidia GPU?
                            
                                Best strategy for profiling memory usage of my code (open source) and 3rd party code(closed source)
                            
                                Tracking down cuda kernel register usage
                            
                                CUDA constant memory banks
                            
                                No CUDA-capable device is detected
                            
                                Is it possible to call cufft library calls in device function?
                            
                                Is it possible to have a persistent cuda kernel running and communicating with cpu asynchronously ?
                            
                                Is it possible to emulate a GPU for CUDA/OpenCL unit testing purposes?
                            
                                CUDA C using single precision flop on doubles
                            
                                CUDA without CUDA enabled gpu [duplicate]
                            
                                How is GPU and memory utilization defined in nvidia-smi results?
                            
                                Sparse matrix-vector multiplication in CUDA
                            
                                CUDA: cudaEventElapsedTime returns device not ready error
                            
                                Can using kernel parameters cause bank conflicts? [closed]
                            
                                Creating DLL from CUDA using nvcc
                            
                                Instruction Level Parallelism (ILP) and out-of-order execution on NVIDIA GPUs
                            
                                Constant Memory vs Texture Memory vs Global Memory in CUDA

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Does cudaFreeHost care what device is active when cudaMallocHost is used to allocate memory?

Tags:

cuda

twerdster

People also ask

1 Answers

twerdster

Recent Activity

Donate For Us