Hi,
I just upgraded my debian server from bullseye (11) to bookworm (12).
This resulted in new default debian nvidia drivers being used.
Debian 11: 470.182.03 → Debian 12: 525.105.17
With the new driver Idle power won’t go back to P8 with 7 Watts power usage instead it is consuming 20 Watts at idle now.
When i reboot the system it will stay at 7 Watts first but once i load my LLM Model it won’t idle at 7 Watts anymore.
The RTX 3090 card is connected to a headless system attached via external usb pci-e 2.0 1x breakout adapter from 16x slot no x.org or nouveau kernel driver running and persistence mode was on.
It got a LLM Model loaded not being used (0%).
Any ideas how i could fix this without going back to old drivers?
Which system dumps or other nvidia-smi outputs are needed to analyze this driver issue further?
Update: When i disable the LLM and then run “nvidia-smi -r” power is back to 7-8 Watts.
Display Mode : Disabled
Display Active : Disabled
Persistence Mode : Enabled
nvidia-smi output:
Sat Jun 17 22:40:38 2023
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 470.182.03 Driver Version: 470.1 82.03 CUDA Version: 11.4 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|===============================+======================+======================|
| 0 NVIDIA GeForce ... On | 00000000:03:00.0 Off | N/A |
| 0% 39C P8 7W / 350W | 18520MiB / 24268MiB | 0% Default |
| | | N/A |
+-------------------------------+----------------------+----------------------+
+-----------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=============================================================================|
| 0 N/A N/A 58715 C python 16952MiB |
+-----------------------------------------------------------------------------+
Sun Jun 18 18:47:44 2023
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 525.105.17 Driver Version: 525.105.17 CUDA Version: 12.0 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|===============================+======================+======================|
| 0 NVIDIA GeForce ... On | 00000000:03:00.0 Off | N/A |
| 0% 56C P8 20W / 350W | 16955MiB / 24576MiB | 0% Default |
| | | N/A |
+-------------------------------+----------------------+----------------------+
+-----------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=============================================================================|
| 0 N/A N/A 35226 C python 16952MiB |
+-----------------------------------------------------------------------------+