Dear forum,
I use some K40 on CentOS6 workstation. I have a question about nvidia-smi report.
It always reports this warning.
±-----------------------------------------------------+
| NVIDIA-SMI 340.65 Driver Version: 340.65 |
|-------------------------------±---------------------±---------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
|===============================+======================+======================|
| 0 Tesla K40c Off | 0000:02:00.0 Off | 0 |
| 23% 34C P0 62W / 235W | 23MiB / 11519MiB | 0% Default |
±------------------------------±---------------------±---------------------+
| 1 Tesla K40c Off | 0000:03:00.0 Off | 0 |
| 23% 30C P0 61W / 235W | 23MiB / 11519MiB | 0% Default |
±------------------------------±---------------------±---------------------+
| 2 Tesla K40c Off | 0000:82:00.0 Off | 0 |
| 23% 30C P0 61W / 235W | 23MiB / 11519MiB | 0% Default |
±------------------------------±---------------------±---------------------+
| 3 Tesla K40c Off | 0000:83:00.0 Off | 0 |
| 23% 29C P0 67W / 235W | 23MiB / 11519MiB | 77% Default |
±------------------------------±---------------------±---------------------+
±----------------------------------------------------------------------------+
| Compute processes: GPU Memory |
| GPU PID Process name Usage |
|=============================================================================|
| No running compute processes found |
±----------------------------------------------------------------------------+
WARNING: infoROM is corrupted at gpu 0000:03:00.0
I reinstalled os, changed cuda driver and changed server but it wasn’t fixed.
The warning is occured by one card. This card seems to work well.
Does anyone know about this warning ?
I would like to fix it if possible.