Hello everyone,
I’m looking for some help with an issue I’m having regarding overheating graphics cards (3080 Ti, Blower Design). I currently have three graphics cards installed in my system, and two of them are from the same manufacturers. Despite my efforts to adjust the fan control settings, the graphics cards continue to overheat.
The fan control has been changed over
DISPLAY=:0 XAUTHORITY=/run/user/125/gdm/Xauthority nvidia-settings -a '[gpu:0-2]/GPUFanControlState=1'
DISPLAY=:0 XAUTHORITY=/run/user/125/gdm/Xauthority nvidia-settings -a '[fan:0-4]/GPUTargetFanSpeed=100'
I’ve tried adjusting the fan control settings to increase the fan speed, but the settings seem to get lost every time I restart the system. I’m not sure what’s causing this issue, and I’m hoping someone can offer some advice on how to fix it. I also noticed the numbers are inconsistent …
I’ve also checked to make sure the airflow in my system is adequate. However, the cards still seem to run hot. The system runs on Ubuntu 22.04 LTS. Drivers are used by standard packaging.
If anyone has experienced a similar issue or has any suggestions on how to resolve it, I would greatly appreciate your help. Thank you in advance for your time and expertise!
Overheating before:
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 525.85.12 Driver Version: 525.85.12 CUDA Version: 12.0 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|===============================+======================+======================|
| 0 NVIDIA GeForce ... Off | 00000000:01:00.0 Off | N/A |
| 38% 30C P2 344W / 350W| 3167MiB / 12288MiB | 100% Default |
| | | N/A |
+-------------------------------+----------------------+----------------------+
| 1 NVIDIA GeForce ... Off | 00000000:02:00.0 Off | N/A |
|43% 89C P2 235W / 350W | 3342MiB / 12288MiB | 100% Default |
| | | N/A |
+-------------------------------+----------------------+----------------------+
| 2 NVIDIA GeForce ... Off | 00000000:03:00.0 Off | N/A |
|59% 75C P2 247W / 250W | 3047MiB / 11264MiB | 100% Default |
| | | N/A |
+-------------------------------+----------------------+----------------------+
After changing the fan control:
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 525.85.12 Driver Version: 525.85.12 CUDA Version: 12.0 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|===============================+======================+======================|
| 0 NVIDIA GeForce ... Off | 00000000:01:00.0 Off | N/A |
| 55% 30C P8 37W / 350W | 38MiB / 12288MiB | 0% Default |
| | | N/A |
+-------------------------------+----------------------+----------------------+
| 1 NVIDIA GeForce ... Off | 00000000:02:00.0 Off | N/A |
|100% 32C P8 42W / 350W | 214MiB / 12288MiB | 0% Default |
| | | N/A |
+-------------------------------+----------------------+----------------------+
| 2 NVIDIA GeForce ... Off | 00000000:03:00.0 Off | N/A |
|100% 30C P8 10W / 250W | 6MiB / 11264MiB | 0% Default |
| | | N/A |
+-------------------------------+----------------------+----------------------+
Running them again
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 525.85.12 Driver Version: 525.85.12 CUDA Version: 12.0 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|===============================+======================+======================|
| 0 NVIDIA GeForce ... Off | 00000000:01:00.0 Off | N/A |
| 30% 89C P2 236W / 350W | 3167MiB / 12288MiB | 99% Default |
| | | N/A |
+-------------------------------+----------------------+----------------------+
| 1 NVIDIA GeForce ... Off | 00000000:02:00.0 Off | N/A |
|100% 80C P2 327W / 350W | 3343MiB / 12288MiB | 100% Default |
| | | N/A |
+-------------------------------+----------------------+----------------------+
| 2 NVIDIA GeForce ... Off | 00000000:03:00.0 Off | N/A |
|130% 84C P8 244W / 250W | 3047MiB / 11264MiB | 100% Default |
| | | N/A |
+-------------------------------+----------------------+----------------------+