How is GPU and memory utilization defined in nvidia-smi results?

Tags:

I am currently using a tool shipped with nvidia's driver 'nvidia-smi' for performance monitoring on GPU. When we use 'nvidia-smi -a', it will give the information of current GPU information, including GPU core and memory usage, temperature and so on like this:

==============NVSMI LOG==============

Timestamp : Tue

Feb 22 22:39:09 2011

Driver Version : 260.19.26

GPU 0:

    Product Name            : GeForce 8800 GTX
    PCI Device/Vendor ID    : 19110de
    PCI Location ID         : 0:4:0
    Board Serial            : 211561763875
    Display                 : Connected
    Temperature             : 55 C
    Fan Speed               : 47%
    Utilization
        GPU                 : 1%
        Memory              : 0%

I am curious about how are the GPU and memory Utilization defined? For example, GPU core's utilization is 47%. It means there are 47% of SMs active working? Or all the GPU cores are busy in 47% time while idle other 53% time? For memory, the utilization stands for the ratio between current bandwidth and max bandwidth, or the busy time ratio in last time unit?

746

asked Feb 23 '11 03:02

fflower

2 Answers

A post by a moderator on the NVIDIA forums says the GPU utilization and memory utilization figures are based on activity over the last second:

GPU busy is actually the percentage of time over the last second the SMs were busy, and the memory utilization is actually the percentage of bandwidth used during the last second. Full memory consumption statistics come with the next release.

188

answered Nov 05 '22 17:11

Matt

You can refer to this official API document: http://docs.nvidia.com/deploy/nvml-api/structnvmlUtilization__t.html#structnvmlUtilization__t

It says : "Percent of time over the past sample period during which one or more kernels was executing on the GPU."

answered Nov 05 '22 15:11

ackratos

Related questions
                            
                                Makefile for CUDA and C
                            
                                Solve small symmetric positive definite Ax = b on GPU only
                            
                                CUDA device stack and synchronization; SSY instruction
                            
                                cuda with mingw - updated
                            
                                undefined reference error for linking CUDA static or shared library with gcc
                            
                                Sorting algorithm with Cuda. Inside or outside kernels?
                            
                                GPU 2D shared memory dynamic allocation
                            
                                Uncrustify command for CUDA kernel
                            
                                CUDA: illegal combination of memory qualifiers
                            
                                Can I prefetch specific data to a specific cache level in a CUDA kernel?
                            
                                Where does Cuda kernel code reside on nvidia GPU?
                            
                                Best strategy for profiling memory usage of my code (open source) and 3rd party code(closed source)
                            
                                Tracking down cuda kernel register usage
                            
                                CUDA constant memory banks
                            
                                No CUDA-capable device is detected
                            
                                Is it possible to call cufft library calls in device function?
                            
                                Is it possible to have a persistent cuda kernel running and communicating with cpu asynchronously ?
                            
                                Is it possible to emulate a GPU for CUDA/OpenCL unit testing purposes?
                            
                                CUDA C using single precision flop on doubles
                            
                                CUDA without CUDA enabled gpu [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How is GPU and memory utilization defined in nvidia-smi results?

Tags:

cuda

gpu

nvidia

fflower

People also ask

2 Answers

Matt

ackratos

Recent Activity

Donate For Us