We have installed two Tesla K80 GPUs for computing, but they are always out of work after we login into Ubuntu for a couple of minutes. The states of both K80 are OK in the beginning, but the error state would soon appear (in “nvidia-smi” command). And we can see that the temperatures of K80 are extremely high before they go out of work. The motherboard we are using is ASUS X99-E WS. Can anyone tell us where the problem is? Is the motherboard incompatible or the cooling system far from satisfactory (currently two fans above them)?
Related topics
Topic | Replies | Views | Activity | |
---|---|---|---|---|
Tesla K80 overheating | 5 | 6904 | February 12, 2017 | |
cannot install driver correctly for tesla k80 | 3 | 2659 | August 31, 2020 | |
GPU 0 Overheating if >1 Tesla K80 Installed | 2 | 1921 | May 27, 2021 | |
Is it normal for my Tesla P100-PCIE-16GB GPU to restart at 84°C? | 4 | 57 | December 29, 2024 | |
2080ti temperature shutdowns on Linux | 1 | 638 | October 9, 2018 | |
You do not appear to have an NVIDIA GPU supported | 0 | 2618 | November 12, 2020 | |
Troubleshooting Tesla K80 on Dell PowerEdge R810 running Ubuntu 20.04 | 1 | 1412 | February 15, 2021 | |
I have several question about model tesla K80 | 1 | 2691 | June 13, 2019 | |
Tesla V100 PCIE fails after some time on Ubuntu 18.04 | 1 | 1327 | January 29, 2019 | |
Tesla V100 GPU thermal causing shutdown even it's doing nothing | 10 | 1524 | December 17, 2020 |