CUDA: illegal combination of memory qualifiers

Tags:

cuda

I have the following code:

main.cu:

#include "class.h"
int main () {}

class.h:

class Class {
    __global__
    void Function() {};
};

When I compile this code using the command nvcc -c main.cu -o main.o, I get the following errors:

class.h(3): warning: inline qualifier ignored for "global" function
class.h(3): error: illegal combination of memory qualifiers

I have a question about each of these errors. Why does it "ignore" the __global__ qualifier for the function, and why is the __global__ memory qualifier illegal in this context? I have read in the documentation that

E.2.10.2. Function Members
Static member functions cannot be __global__ functions.

However, my function is not a static member, as far as I know. Removing the __global__ line allows it to compile, and so does moving the __global__ and void Function(); lines into main.cu. If this actually ISN'T allowed, why does CUDA force this limitation, and what is a way to get around this while still maintaining structured code?

To clarify, I know no other way to make classes that have functions which can create GPU kernels. It seems to me like kernels can only be created from global functions in main.cu. I am fairly new to CUDA programming, so I may just be missing some CUDA conventions which may have been unclear to me. If this is the case, then please let me know so I can keep up with proper programming practice.

500

asked Nov 12 '16 02:11

Simon Ewing

Video Answer

1 Answers

My understanding is that you want to use CUDA kernels in an OOP fashion. If this was the case, the class structure below should work:

// myclass.h
class MyClass {
    public:
        void call_kernel( ... );
};

// myclass.cu
__global__
void my_kernel( ... ) {
    // do some work
}

void MyClass::call_kernel() {
    // prepare data for the kernel, e.g. allocating memory, copying from host to device, etc.

    // run kernel
    my_kernel <<< ... >>>( ... );

    // copy results from device to host, clean up, etc.
}

Please note that if you have multiple classes containing kernel code, their source code file should all use .cu extension, and you should enable separate compilation.

139

answered Sep 25 '22 07:09

yhf8377

Related questions
                            
                                Protecting QML source code from plagiarism
                            
                                FlatBuffers: How to write giant files
                            
                                Using/storing derived member in derived class with base class that stores base member
                            
                                Preventing L-value instantiation in C++14
                            
                                Deduce type from literal string
                            
                                libev c++ wrapper function_thunk
                            
                                template object's template friend functions and namespaces
                            
                                Is there only one unnamed namespace per compilation unit?
                            
                                constructing enum with underlying "bool" type from a boolean?
                            
                                address sanitizer won't work with bash on windows
                            
                                Storing a pointer to object returned with NRVO
                            
                                std::vector<bool> optimization implementation
                            
                                Eigen: Efficient Kronecker Product
                            
                                How to find duplicate elements' index in C++?
                            
                                Why doesn't clang warn about implicit conversion from double to int, but do it when from long to int?
                            
                                Function template modifies parameter declared with top-level const: clang bug?
                            
                                Different stack depth for lambdas and regular functions in C++?
                            
                                C++ Type-erasure of a function template using lambdas
                            
                                C++: static on static member variable dependent initialization with int vs struct
                            
                                Sort a matrix defined as a vector<double>

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With