I've been playing around with the NVIDIA profiler (nvprof) and there are two particular metrics which I do not understand:
inst_inter_thread_communication
Number of inter-thread communication instructions executed by non-predicated threads
inst_misc
Number of miscellaneous instructions executed by non-predicated threads
I'm just wondering what instructions would be inter-thread communication instructions and which instructions would fall under miscellaneous.
Reference: http://docs.nvidia.com/cuda/profiler-users-guide/#metrics-reference
The SASS instructions that fall into the two categories are as follows:
inst_inter_thread_communication
inst_misc
The document CUDA Binary Utilities section Instruction Set Reference contains a brief description of the SASS instructions. There is close to a 1:1 relationship between SASS and PTX so you can also review the PTX ISA manual.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With