I'm trying to find ways to copy multidimensional arrays from host to device in opencl and thought an approach was to use an image... which can be 1, 2, or 3 dimensional objects. However I'm confused because when reading a pixle from an array, they are using vector datatypes. Normally I would think double pointer, but it doesn't sound like that is what is meant by vector datatypes. Anyway here are my questions: 1) What is actually meant to vector datatype, why wouldn't we just specify 2 or 3 indices when denoting pixel coordinates? It looks like a single value such as float2 is being used to denote coordinates, but that makes no sense to me. I'm looking at the function read_imageui and read_image. 2) Can the input image just be a subset of the entire image and sampler be the subset of the input image? I don't understand how the coordinates are actually specified here either since read_image() only seams to take a single value for input and a single value for sampler. 3) If doing linear algebra, should I just bite the bullet and translate 1-D array data from the buffer into multi-dim arrays in opencl? 4) I'm still interested in images, so even if what I want to do is not best for images, could you still explain questions 1 and 2? Thanks! EDIT I wanted to refine my question and ask, in the following khronos documentation they define... <pre class="prettyprint"><code> int4 read_imagei ( image2d_t image, sampler_t sampler, int2 coord) </code></pre> But nowhere can I find what image2d_t's definition or structure is supposed to be. The samething for sampler_t and int2 coord. They seem like structs to me or pointers to structs since opencl is supposed to be based on ansi c, but what are the fields of these structs or how do I note the coord with what looks like a scala?! I've seen the notation (int2)(x,y), but that's not ansi c, that looks like scala, haha. Things seem conflicting to me. Thanks again!

In general you can read from images in three different ways: <ol> <li>direct pixel access, no sampling</li> <li>sampling, normalized coordinates</li> <li>sampling, integer coordinates</li> </ol> The first one is what you want, that is, you pass integer pixel coordinates like (10, 43) and it will return the contents of the image at that point, with no filtering whatsoever, as if it were a memory buffer. You can use the <code>read_image*()</code> family of functions which take no <code>sampler_t</code> param. The second one is what most people want from images, you specify normalized image coords between 0 and 1, and the return value is the interpolated image color at the specified point (so if your coordinates specify a point in between pixels, the color is interpolated based on surrounding pixel colors). The interpolation, and the way out-of-bounds coordinates are handled, are defined by the configuration of the <code>sampler_t</code> parameter you pass to the function. The third one is the same as the second one, except the texture coordinates are not normalized, and the sampler needs to be configured accordingly. In some sense the third way is closer to the first, and the only additional feature it provides is the ability to handle out-of-bounds pixel coordinates (for instance, by wrapping or clamping them) instead of you doing it manually. Finally, the different versions of each function, e.g. <code>read_imagef</code>, <code>read_imagei</code>, <code>read_imageui</code> are to be used depending on the pixel format of your image. If it contains floats (in each channel), use <code>read_imagef</code>, if it contains signed integers (in each channel), use <code>read_imagei</code>, etc... Writing to an image on the other hand is straightforward, there are <code>write_image{f,i,ui}()</code> functions that take an image object, integer pixel coordinates and a pixel color, all very easy. Note that you cannot read and write to the same image in the same kernel! (I don't know if recent OpenCL versions have changed that). In general I would recommend using a buffer if you are not going to be using images as actual images (i.e. input textures that you sample or output textures that you write to only once at the end of your kernel). <hr> About the <code>image2d_t</code>, <code>sampler_t</code> types, they are OpenCL "pseudo-objects" that you can pass into a kernel from C (they are reserved types). You send your image or your sampler from the C side into clSetKernelArg, and the kernel gets back a sampler_t or an image2d_t in the kernel's parameter list (just like you pass in a buffer object and it gets a pointer). The objects themselves cannot be meaningfully manipulated inside the kernel, they are just handles that you can send into the read_image/write_image functions, along with a few others. <hr> As for the "actual" low-level difference between images and buffers, GPU's often have specially reserved texture memory that is highly optimized for "read often, write once" access patterns, with special texture sampling hardware and texture caches to optimize scatter reads, mipmaps, etc.. On the CPU there is probably no underlying difference between an image and a buffer, and your runtime likely implements both as memory arrays while enforcing image semantics.

How do images work in opencl kernel?

Tags:

c++

c

gpgpu

opencl

I'm trying to find ways to copy multidimensional arrays from host to device in opencl and thought an approach was to use an image... which can be 1, 2, or 3 dimensional objects. However I'm confused because when reading a pixle from an array, they are using vector datatypes. Normally I would think double pointer, but it doesn't sound like that is what is meant by vector datatypes. Anyway here are my questions:

1) What is actually meant to vector datatype, why wouldn't we just specify 2 or 3 indices when denoting pixel coordinates? It looks like a single value such as float2 is being used to denote coordinates, but that makes no sense to me. I'm looking at the function read_imageui and read_image.

2) Can the input image just be a subset of the entire image and sampler be the subset of the input image? I don't understand how the coordinates are actually specified here either since read_image() only seams to take a single value for input and a single value for sampler.

3) If doing linear algebra, should I just bite the bullet and translate 1-D array data from the buffer into multi-dim arrays in opencl?

4) I'm still interested in images, so even if what I want to do is not best for images, could you still explain questions 1 and 2?

Thanks!

EDIT I wanted to refine my question and ask, in the following khronos documentation they define...

 int4 read_imagei (
    image2d_t image,
    sampler_t sampler,
    int2 coord)

But nowhere can I find what image2d_t's definition or structure is supposed to be. The samething for sampler_t and int2 coord. They seem like structs to me or pointers to structs since opencl is supposed to be based on ansi c, but what are the fields of these structs or how do I note the coord with what looks like a scala?! I've seen the notation (int2)(x,y), but that's not ansi c, that looks like scala, haha. Things seem conflicting to me. Thanks again!

294

asked May 04 '15 01:05

Dr.Knowitall

1 Answers

In general you can read from images in three different ways:

direct pixel access, no sampling
sampling, normalized coordinates
sampling, integer coordinates

The first one is what you want, that is, you pass integer pixel coordinates like (10, 43) and it will return the contents of the image at that point, with no filtering whatsoever, as if it were a memory buffer. You can use the read_image*() family of functions which take no sampler_t param.

The second one is what most people want from images, you specify normalized image coords between 0 and 1, and the return value is the interpolated image color at the specified point (so if your coordinates specify a point in between pixels, the color is interpolated based on surrounding pixel colors). The interpolation, and the way out-of-bounds coordinates are handled, are defined by the configuration of the sampler_t parameter you pass to the function.

The third one is the same as the second one, except the texture coordinates are not normalized, and the sampler needs to be configured accordingly. In some sense the third way is closer to the first, and the only additional feature it provides is the ability to handle out-of-bounds pixel coordinates (for instance, by wrapping or clamping them) instead of you doing it manually.

Finally, the different versions of each function, e.g. read_imagef, read_imagei, read_imageui are to be used depending on the pixel format of your image. If it contains floats (in each channel), use read_imagef, if it contains signed integers (in each channel), use read_imagei, etc...

Writing to an image on the other hand is straightforward, there are write_image{f,i,ui}() functions that take an image object, integer pixel coordinates and a pixel color, all very easy.

Note that you cannot read and write to the same image in the same kernel! (I don't know if recent OpenCL versions have changed that). In general I would recommend using a buffer if you are not going to be using images as actual images (i.e. input textures that you sample or output textures that you write to only once at the end of your kernel).

About the image2d_t, sampler_t types, they are OpenCL "pseudo-objects" that you can pass into a kernel from C (they are reserved types). You send your image or your sampler from the C side into clSetKernelArg, and the kernel gets back a sampler_t or an image2d_t in the kernel's parameter list (just like you pass in a buffer object and it gets a pointer). The objects themselves cannot be meaningfully manipulated inside the kernel, they are just handles that you can send into the read_image/write_image functions, along with a few others.

As for the "actual" low-level difference between images and buffers, GPU's often have specially reserved texture memory that is highly optimized for "read often, write once" access patterns, with special texture sampling hardware and texture caches to optimize scatter reads, mipmaps, etc..

On the CPU there is probably no underlying difference between an image and a buffer, and your runtime likely implements both as memory arrays while enforcing image semantics.

187

answered Sep 19 '22 18:09

Thomas

Related questions
                            
                                Iterating boost::icl::interval_set
                            
                                Should a operator const Base&() be used for an inaccessible base class?
                            
                                Why is there a blas subroutine (ISAMAX) for argmax abs but none for argmax?
                            
                                Animating the color of QML rectangles after a button is clicked
                            
                                c++ Best Way to Share state between threads
                            
                                Applying a rule on how to increment an enum
                            
                                Pass by reference avoids object slicing
                            
                                How to properly declare a friend of a nested class of a template class?
                            
                                How to initialize a queue with n default values?
                            
                                Destruction order of class data members?
                            
                                Successive calls to RegGetValue return two different sizes for the same string
                            
                                Is there bare c++ 11 (or boost) replacement for InterlockedExchangePointer?
                            
                                Memory allocation crashes the OS. Who's to blame beside the OS [closed]
                            
                                C++ int with preceding 0 changes entire value
                            
                                Inherit and overload default constructor
                            
                                Binding r-value to l-value reference is non-standard Microsoft C++ extension
                            
                                How to set number of threads in C++
                            
                                c++ set and shared_ptr
                            
                                how to force recompile when changing Makefile flags?
                            
                                data structure advice on c++

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With