How to get usage of decoder only?

kungedefaxing · March 11, 2025, 2:33am

Hi, team!

Now I am using Tesla L4 card for both video decoding and inference.
As I know, they use different parts of the card, so I want to get the usage of them seperately.
Besides, I know there are four decoder units of Tesla L4 card, and can I get each usage of them?

Many thanks!

ddesouza · April 4, 2025, 9:15am

Hi @kungedefaxing, could you try the following command line in a terminal:

nvidia-smi dmon -s puct --gpm-metrics 30,31,32,33,34,35,36,37

Best regards,

Diego

kungedefaxing · April 10, 2025, 2:53am

Thanks a lot! Looks it is the solution!
Besides, do you know how to monitor the tensor cores usage only?
Because I want to monitor the usage of video stream decoding and model inference seperately

Looking forward to your reply!

ddesouza · April 11, 2025, 9:07am

Hi @kungedefaxing, you can select the options that you want listed here:

❯ nvidia-smi dmon -h

    GPU statistics are displayed in scrolling format with one line
    per sampling interval. Metrics to be monitored can be adjusted
    based on the width of terminal window. Monitoring is limited to
    a maximum of 16 devices. If no devices are specified, then up to
    first 16 supported devices under natural enumeration (starting
    with GPU index 0) are used for monitoring purpose.
    It is supported on Tesla, GRID, Quadro and limited GeForce products
    for Kepler or newer GPUs under x64 and ppc64 bare metal Linux.
    Note: On MIG-enabled GPUs, querying the utilization of encoder,
    decoder, jpeg, ofa, gpu, and memory is not currently supported.

    Usage: nvidia-smi dmon [options]

    Options include:
    [-i | --id]:          Comma separated Enumeration index, PCI bus ID or UUID
    [-d | --delay]:       Collection delay/interval in seconds [default=1sec]
    [-c | --count]:       Collect specified number of samples and exit
    [-s | --select]:      One or more metrics [default=puc]
                          Can be any of the following:
                              p - Power Usage and Temperature
                              u - Utilization
                              c - Proc and Mem Clocks
                              v - Power and Thermal Violations
                              m - FB, Bar1 and CC Protected Memory
                              e - ECC Errors and PCIe Replay errors
                              t - PCIe Rx and Tx Throughput
    [N/A | --gpm-metrics]: Comma-separated list of GPM metrics (no space in between) to watch
                           Available metrics:
                               Graphics Activity       = 1
                               SM Activity             = 2
                               SM Occupancy            = 3
                               Integer Activity        = 4
                               Tensor Activity         = 5
                               DFMA Tensor Activity    = 6
                               HMMA Tensor Activity    = 7
                               IMMA Tensor Activity    = 9
                               DRAM Activity           = 10
                               FP64 Activity           = 11
                               FP32 Activity           = 12
                               FP16 Activity           = 13
                               PCIe TX                 = 20
                               PCIe RX                 = 21
                               NVDEC 0-7 Activity      = 30-37
                               NVJPG 0-7 Activity      = 40-47
                               NVOFA 0 Activity        = 50
                               NVLink Total RX         = 60
                               NVLink Total TX         = 61
                               NVLink L0-17 RX         = 62,64,66,...,96
                               NVLink L0-17 TX         = 63,65,67,...,97

    [N/A | --gpm-options]: options of which level of GPM metrics to monitor:
                              d  - Display Device level GPM Metrics only
                              m  - Display MIG level GPM Metrics only
                              dm - Display both Device and MIG level GPM Metrics only
                              md - Display both Device and MIG level GPM Metrics only
    [-o | --options]:     One or more from the following:
                              D - Include Date (YYYYMMDD) in scrolling output
                              T - Include Time (HH:MM:SS) in scrolling output
    [-f | --filename]:    Log to a specified file, rather than to stdout
    [-h | --help]:        Display help information
    [N/A | --format]:     Output format specifiers:
                               csv - Format dmon output as a CSV
                               nounit - Remove units line from dmon output
                               noheader - Remove heading line from dmon output

Maybe you are interested in something this:

nvidia-smi dmon --gpm-metrics 5,6,7,9

Feel free to change as you wish.

kungedefaxing · June 29, 2025, 10:50am

Thanks a lot! That really helps!

kungedefaxing · July 7, 2025, 6:40am

Hi @ddesouza!

I have a question regarding decoder unit utilization on my RTX 5060 Ti GPU.

I have two decoder units in use, and by running the command you previously provided, I can monitor the usage of different decoder units. However, I noticed something unusual: while Decoder Unit 0 consistently shows 100% utilization, Decoder Units 1 through 7 always report 0% usage, even when decoding workloads are active.

Could you help explain why this imbalance occurs? Is this expected behavior, or could it indicate a configuration issue?

Looking forward to your insights.

Best regards,

ddesouza · July 7, 2025, 6:57am

Hi @kungedefaxing, although you can have multiple decoder sessions open in parallel (software), the NVIDIA RTX 5060 Ti GPU only has one NVDEC (hardware). NVIDIA L4 GPU, on the other hand, has four NVDECs. The command line I provided shows the hardware utilization. That is why it only shows one being used for the 5060 Ti. No matter how many decoder sessions you open, they will share the same NVDEC in this case.

system · July 21, 2025, 6:58am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
How to get usage of decoder only? Visual Profiler and nvprof	4	454	April 25, 2025
General queries Jetson Xavier NX tensorrt	6	501	October 18, 2021
Get tensor core usage through nvml System Management and Monitoring (NVML)	4	2369	December 17, 2022
GPU usage monitoring on TX1 Jetson TX1	10	19582	October 12, 2017
Inconsistent GPU utilization returned by nvidia-smi System Management and Monitoring (NVML)	0	1413	July 2, 2020
Some basic query General Topics and Other SDKs	0	302	November 21, 2020
Nvidia codec sdk for 4K@60fps video Video Processing & Optical Flow gstreamer	0	632	March 5, 2021
FFMPEG, NVDEC Load measure Jetson Nano ffmpeg	4	774	October 15, 2021
Why do I see GPU memory usage when using GStreamer for MP4 decoding on Jetson NX? Where is GPU utilized in hardware decoding? Jetson TX2 decoder	4	396	January 1, 2025
"tools" to monitor Tensor core usage System Management and Monitoring (NVML)	1	2316	December 19, 2022

How to get usage of decoder only?

Related topics