Different ways to make kernel

Tags:

opencl

In this tutorial

There are 2 methods to run the kernel, and another one mentioned in the comments:

cl::KernelFunctor simple_add(cl::Kernel(program,"simple_add"),queue,cl::NullRange,cl::NDRange(10),cl::NullRange);
simple_add(buffer_A,buffer_B,buffer_C);

However, I found out, that KernelFunctor has gone.

So I tried the alternative way:

cl::Kernel kernel_add=cl::Kernel(program,"simple_add");
kernel_add.setArg(0,buffer_A);
kernel_add.setArg(1,buffer_B);
kernel_add.setArg(2,buffer_C);
queue.enqueueNDRangeKernel(kernel_add,cl::NullRange,cl::NDRange(10),cl::NullRange);
queue.finish();

It compiles and runs succussfully.

However, there is a 3rd option in the comments:

cl::make_kernel simple_add(cl::Kernel(program,"simple_add"));
cl::EnqueueArgs eargs(queue,cl::NullRange,cl::NDRange(10),cl::NullRange);
simple_add(eargs, buffer_A,buffer_B,buffer_C).wait();

Which does not compile, I think the make_kernel needs template arguments. I'm new to OpenCl, and didn't manage to fix the code.

My question is:

1. How should I modify the 3. code to compile?

2. Which way is better and why? 2. vs. 3.?

620

asked Jan 25 '14 15:01

1 Answers

You can check the OpenCL C++ Bindings Specification for a detailed description of the cl::make_kernel API (in section 3.6.1), which includes an example of usage.

In your case, you could write something like this to create the kernel functor:

auto simple_add = cl::make_kernel<cl::Buffer&, cl::Buffer&, cl::Buffer&>(program, "simple_add");

Your second question is primarily opinion based, and so is difficult to answer. One could argue that the kernel functor approach is simpler, as it allows you to 'call' the kernel almost as if it were just a function and pass the arguments in a familiar manner. The alternative approach (option 2 in your question) is more explicit about setting arguments and enqueuing the kernel, but more closely represents how you would write the same code using the OpenCL C API. Which method you use is entirely down to personal preference.

159

answered Oct 02 '22 23:10

jprice

Related questions
                            
                                Debug a Python C/C++ extension in VSCode on Windows
                            
                                Can you get a specific error condition when a C++ stream open fails?
                            
                                Variable Length Array overhead in C++?
                            
                                Where to define exception classes, inside classes or on a higher level?
                            
                                c++ how to write/read ofstream in unicode / utf8
                            
                                Most complete c++ facebook library
                            
                                Is there a workaround for this C4702 link-time warning?
                            
                                High performance low latency C++ custom string class
                            
                                How to convert a double to a C# decimal in C++?
                            
                                can custom C++ classes replicate the performance of inbuilt types?
                            
                                Read image files with QImageReader using QtConcurrent
                            
                                How do I generate an integer from a string literal at compile-time?
                            
                                How to define a Python metaclass with Boost.Python?
                            
                                Visual C++ 10.0 bug in std::reference_wrapper?
                            
                                "Deep" function currying in C++ using template metaprogramming
                            
                                Precedence of overloaded cast operators
                            
                                How to make a tuple of const references?
                            
                                c++ array zero-initialization: Is this a bug, or is this correct?
                            
                                What is multiple re-inheritance?
                            
                                How to mark a Google Test test-case as "expected to fail"?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Different ways to make kernel

Tags:

c++

opencl

otisonoza

People also ask

1 Answers

jprice

Recent Activity

Donate For Us