I would like to extract the data from my GPU application in order to check its limits. I have to use nvprof because the application runs on a remote server, so I should create a file to import locally in the Visual Profiler. I've tried to create the file with nvprof -o file_name <app> <params>
and with nvprof --analysis-metrics --output-profile file_name <app> <params>
but when I import these files on the Visual Profiler, in the Analysis section some fields are empty: "insufficient global memory load data", "insufficient global memory store data", "insufficient kernel SM data"... . How could I generate a file (or more) in order to have all the information for the Analysis section? I compile the cuda code with nvcc with the flags -lineinfo -arch compute_20 -code sm_20 --ptxas-options=-v
.
These are some examples of empty fields:
Profiling Overview The Visual Profiler is a graphical profiling tool that displays a timeline of your application's CPU and GPU activity, and that includes an automated analysis engine to identify optimization opportunities.
Nvprof is a command-line light-weight GUI-less profiler available for Linux, Windows, and Mac OS. This tool allows you to collect and view profiling data of CUDA-related activities on both CPU and GPU, including kernel execution, memory transfers, etc.
The Profile GPU Rendering tool indicates the relative time that each stage of the rendering pipeline takes to render the previous frame. This knowledge can help you identify bottlenecks in the pipeline, so that you can know what to optimize to improve your app's rendering performance.
You can try to add a session instead of importing prof file into the visual profiler. I run into the similar problem. what I did is adding a session according to the instructions in here, and you will be able to see all the information.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With