Advanced API Performance: SetStablePowerState

jwitsoe · June 28, 2022, 3:00pm

Originally published at: https://developer.nvidia.com/blog/advanced-api-performance-setstablepowerstate/

This post covers best practices for using SetStablePowerState on NVIDIA GPUs. To get a high and consistent frame rate in your applications, see all Advanced API Performance tips.

erez.eyal1 · February 2, 2023, 1:34pm

Thanks for the post!
After many tries, this use of nvidia-smi --lock functions gave us the persistent performance the project requires

It is very useful APIs when using the GPU as a DSP that need to work without any frequency lowering / idle behavior

erez.eyal1 · February 2, 2023, 1:39pm

Related to this post,
can you explain the different usage of the following Performance approaches:

Use “Nvidia Control Panel” GUI settings
Use nvidia-smi command line calls
Use nvml CUDA API

I prefers to use NVML to get the GPU persistent behavior, is it possible to get such persistent behavior only using nvml on GeForce RTX 3050 ?

rprescott · February 2, 2023, 6:26pm

Hi!

That’s great news.

Did you encounter differences between the article’s recommendations and your usage? I’d like to update it to match current uses. nvidia-smi isn’t guaranteed to be a fixed target, unfortunately and it’s been a while since I wrote this article!

Thanks, Ryan

rprescott · February 2, 2023, 6:31pm

Hello again-

I’m not familiar with what you’re referring to with 1/control panel, can you elaborate?

I’m also not certain about the differences between 2 and 3. I suspect on the backend they are poking the same driver components to achieve their goals, because there’s basically one place to set these things. Unfortunately, the best way in the near term to establish this is probably to do a bit of testing.

I’ll try to find the maintainers of these routes and report back if/when I get definitive answers.

Thanks, Ryan

erez.eyal1 · February 5, 2023, 7:19am

Hello,
This sequence from the post works perfectly to set GPU core & GPU memory clock frequencies

nvidia-smi --query-supported-clocks=timestamp,gpu_name,gpu_uuid,memory,graphics --format=csv`
nvidia-smi --lock-gpu-clocks=<core_clock_rate_from_csv>
nvidia-smi --lock-memory-clocks=<memory_clock_rate_from_csv>

erez.eyal1 · February 5, 2023, 7:26am

There is additional GPU performance option on:
Windows → “Nvidia Control Panel” → “3D setting” → “Manage 3D setting” → “Global Setting” / “Program Setting” → “Power management mode” = “Prefer maximum performance”

rprescott · February 6, 2023, 8:18pm

Ah, I would not rely on that setting. That’s more of a behavioral suggestion, it doesn’t pin the frequencies.

rprescott · February 6, 2023, 8:19pm

Excellent, so it looks like the commands are the same. Thanks!

daky · October 17, 2023, 2:42pm

I tried overclocking the gpu by locking the core clock (-lgc) above the default, but it isnt working. It only locks the core below the stock setting.
Anyone having similar issues or knows a solution?

rprescott · October 17, 2023, 3:00pm

Hi daky,

The utility is configured to work within design parameters as a safety precaution. There are other tools that will let you set clocks higher, but I wouldn’t recommend it. Changing settings outside of intended values can cause hardware failures, visual glitching, and other incorrect behavior.

Thanks, Ryan

daky · October 24, 2023, 8:24am

Hey, I did a bit more testing and I cant apply any overclocks, even within the predetermined buckets when the GPU is under load.

For example I can run nvidia-smi -lgc 2100 while idling and the command applies immediately. But if I put the GPU under load first (gaming, benchmarking) the core will raise to 2000 and if I run nvidia-smi -lgc 2100 then, nothing happens. The core will remain at 2000.
However lowering the value still works, so if run nvidia-smi -lgc 1800 the core will drop to 1800 immediately.

I have also tried raising the core delta via Afterburner and that applied without any issues and the GPU happily runs at 2100 so I doubt it is a safety issue since I have seen the GPU run at those numbers for a long time and all of the values are within the default bucket settings (2160).

rprescott · April 2, 2024, 10:02pm

Hi daky,

Sorry for the very late reply!

nvidia-smi is not a general overclocking utility with limited support on GeForce hardware. If you require changing clocks while applications are running, please use whatever means work for your needs. As I said previously, overclocking can lead to temporary misbehavior or permanent damage to hardware.

If you’re seeing the aforementioned behavior on RTX/Quadro or datacenter parts, please let us know.

Thanks,
Ryan

james.park3 · April 22, 2024, 8:49pm

If we lock the clocks, will they still be lowered if the GPU gets too hot? We don’t want to brick our GPUs by accident.

The article mentions using SetStablePowerState to get the stable GPU clock, but it also recommends not using SetStablePowerState because it doesn’t lock the memory clock. How can we query for the stable memory clock if we want to lock it? We can enumerate all the memory clock values, but that doesn’t tell us which one to pick.

hainguyen · June 6, 2024, 3:56am

Hi James,

Answering the first question: locking the clocks does not prevent the GPU’s safety mechanism from working. The GPU’s thermal control will override clock locking if GPU exceeds the safe operating temperature.

Working on the second part, will circle back once I have details.

Topic		Replies	Views
Stability Issues with GPU Inference on Older GPUs (e.g., 1080Ti) CUDA Programming and Performance	15	968	January 22, 2024
GPU overclocking tool CUDA Programming and Performance	33	80651	July 31, 2017
nvidia-smi not fully supported on GTX 1060 Linux	41	39174	January 17, 2018
Having Trouble OverClocking GTX 1070 CUDA Setup and Installation	22	33364	September 25, 2017
How to use CLI to set memory, CPU and powerlimit settings? Linux	4	40647	July 19, 2020
SM Clock on RTX A6000 never reaches max frequency CUDA Programming and Performance nvidia-smi	4	5036	February 18, 2022
Clock frequency management on non-proffesional CC 3.5 cards? CUDA Programming and Performance	8	2898	February 14, 2014
Kernel module option NVreg_RegistryDwords for PowerMizerEnable doesnt work on 530.41.03 Linux	20	3379	June 12, 2024
1080FE powersave frequency scaling CUDA Programming and Performance	4	1491	November 28, 2016
NVIDIA GPU 3090 performance mode setting CUDA Programming and Performance	18	638	October 21, 2024

Advanced API Performance: SetStablePowerState

Related topics