Export CUDA nvprof output to the Visual Profiler

Tags:

I would like to extract the data from my GPU application in order to check its limits. I have to use nvprof because the application runs on a remote server, so I should create a file to import locally in the Visual Profiler. I've tried to create the file with nvprof -o file_name <app> <params> and with nvprof --analysis-metrics --output-profile file_name <app> <params> but when I import these files on the Visual Profiler, in the Analysis section some fields are empty: "insufficient global memory load data", "insufficient global memory store data", "insufficient kernel SM data"... . How could I generate a file (or more) in order to have all the information for the Analysis section? I compile the cuda code with nvcc with the flags -lineinfo -arch compute_20 -code sm_20 --ptxas-options=-v. These are some examples of empty fields: enter image description here

562

asked Jan 21 '16 21:01

Stefano Sandonà

1 Answers

You can try to add a session instead of importing prof file into the visual profiler. I run into the similar problem. what I did is adding a session according to the instructions in here, and you will be able to see all the information.

170

answered Sep 28 '22 00:09

doody1986

Related questions
                            
                                A simple c++ HelloWorld with cuda
                            
                                Pycuda Blocks and Grids to work with big datas
                            
                                Could not load library cudnn_cnn_infer64_8.dll. Error code 126
                            
                                Operations on arbitrary value types
                            
                                How to transpose a matrix in CUDA/cublas?
                            
                                How to set CUDA compiler flags in Visual Studio 2010?
                            
                                CUDA: nvcc cannot be detected though installed
                            
                                Inter-block barrier on CUDA
                            
                                C++ 2.5 bytes (20-bit) integer
                            
                                math_functions.hpp not found when using CUDA with Eigen
                            
                                nvlink error when linking CUDA code against CUDA static library - CMake [duplicate]
                            
                                Simple console program will not exit if cudaMalloc is called
                            
                                What does NVIDIA GPU do with device memory 0x0?
                            
                                pycuda seems nondeterministic
                            
                                CUDA Primes Generation
                            
                                In CUDA, how to copy an array of device pointers to device memory?
                            
                                float1 vs float in CUDA

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Export CUDA nvprof output to the Visual Profiler

Tags:

cuda

nvprof

nvvp

Stefano Sandonà

People also ask

1 Answers

doody1986

Recent Activity

Donate For Us