In OpenCL, does having multiple work dimensions in a work group/item provides a speedup? If so, please point to some code or link.
No, in fact it solws execution down, since the runtime has to perform the local and global id mapping.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With