GPU latency

solcadam · August 27, 2023, 9:27am

Hello.

I have application where I need to perform dnn inference with lowest latency, however image samples come with delays (2s) that are causing GPU to lower the clocks. When new sample come its affected with some extra ms. I can solve the issue by setting fixed clocks in nvidia-smi but it requires admin rights. Is there any better solution?

Thanks

njuffa · August 27, 2023, 11:10am

Fixing the clocks with nvidia-smi is the canonical solution. Everything else is going to be hacky. Programmers cannot configure the dynamic clock and power state management.

For example, you could keep on sending the same image through your GPU kernel repeatedly until a new sample arrives, then send that repeatedly, etc. The problem with that: While it keeps the GPU clocks high by redundant activity, it also imposes a maximum latency equal to the kernel execution time.

Using the same principle, you could keep issuing null (empty) kernels to the GPU while waiting for a new image sample, incurring at most the latency for processing a null kernel, which is 3 to 5 microseconds on modern hardware.

njuffa · August 27, 2023, 7:04pm

Pondering it some more, I think the idea of constantly issuing null kernels to the GPUs needs modification. You do not want your image processing kernel queued up behind hundreds of null kernels already in the pipeline. To avoid this, issue a cudaDeviceSynchronize() after each null kernel launch to “flush the pipeline”. This adds another 20 microseconds or so of delay for a total of about 25 microseconds. I am quoting numbers from memory so best to actually run some experiments to determine the actual delay.

Topic		Replies	Views
GPU Penalty CUDA Programming and Performance	5	2430	November 30, 2012
How to troubleshoot the issue when 'nvidia-smi -rgc' is not working? Linux	2	306	January 9, 2024
GPU processing does not give full power after hibernation CUDA Programming and Performance kernel	2	770	May 9, 2023
CUDA performance get slower after sleep in host side CUDA Programming and Performance	7	1174	November 22, 2022
CPU to GPU data transfer latency CUDA Programming and Performance	6	8814	May 4, 2010
Desktop animations lag after a few seconds of inactivity Linux	7	225	February 28, 2025
Prioritization of GPU time between CUDA and DirectX CUDA Programming and Performance cuda	2	642	April 29, 2023
Stability Issues with GPU Inference on Older GPUs (e.g., 1080Ti) CUDA Programming and Performance	15	1125	January 22, 2024
GTX590 latency issues CUDA Programming and Performance	5	3726	February 23, 2012
GPU Performance Degradation GPU performance degrades after 30 sec. CUDA Programming and Performance	8	7792	March 9, 2009

GPU latency

Related topics