I have two RTX 3090 GPUs. One from gigabyte and one from Zotac. I am using them to run OpenACC/CUDA jobs on Linux Mint. I have nvidia hpc SDK 23.1. I am getting ERR! under fan speed in nvidia-smi after I finish one application running. The GPU gets stuck in P5 state. The only way to reset it is to restart the system. This error happens to the GPU in slot 0, ie the gigabyte one. Any help?
Related topics
Topic | Replies | Views | Activity | |
---|---|---|---|---|
RTX 4090 Fan state says "ERR!", performance is throttled | 2 | 1468 | September 15, 2023 | |
Nvidia-smi gives ERR! under Fan section when GPU temperature is low enough | 2 | 2065 | May 14, 2022 | |
Nvidia-smi GPU Fan ERR! | 2 | 3267 | March 22, 2019 | |
NVIDIA-SMI Shows ERR! on both Fan and Power Usage | 0 | 1209 | October 25, 2022 | |
Nvidia-smi gives ERR! under Fan section when GPU temperature is low enough | 2 | 1375 | October 29, 2023 | |
ERR! on Fan in nvidia-smi with driver 410.93 | 1 | 1843 | June 28, 2019 | |
535.86.05 issue with 4090. 4090 stuck in P0 state, Fan speed shows ERR! | 2 | 891 | September 12, 2023 | |
`nvidia-smi -q` shows several "Unknown Error"; GPU ignored by pytorch | 3 | 1912 | September 6, 2023 | |
Nvidia-smi show ERR! on FAN | 0 | 844 | July 13, 2023 | |
NVIDIA-SMI Shows ERR! on both Fan and Power Usage | 32 | 45393 | August 30, 2022 |