In simple CUDA programs we can print messages by threads by including cuPrintf.h but doing this in PyCUDA is not explained anywhere. How to do this in PyCUDA?
On Compute Capability 2.0 and later GPUs, cuPrintf.h
is discouraged in favor of just using CUDA's built-in printf(). To use it, just #include <stdio.h>
and call printf()
just like on the host.
The PyCUDA wiki has a specific example of this.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With