Hi all,
Here is what i tried and my GPU’s are throwing unknown error. is there a way to disable persistence mode and still use GPU’s?
[root@p95a30 ~]# nvidia-smi -pm 0
Disabled persistence mode for GPU 00000004:04:00.0.
Disabled persistence mode for GPU 00000004:05:00.0.
Disabled persistence mode for GPU 00000035:03:00.0.
Disabled persistence mode for GPU 00000035:04:00.0.
All done.
[root@p95a30 ~]# nvidia-smi
Tue Sep 18 15:21:45 2018
±----------------------------------------------------------------------------+
| NVIDIA-SMI 396.44 Driver Version: 396.44 |
|-------------------------------±---------------------±---------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
|===============================+======================+======================|
| 0 Tesla V100-SXM2… Off | 00000004:04:00.0 Off | 0 |
| N/A 25C P0 49W / 300W | Unknown Error | 0% Default |
±------------------------------±---------------------±---------------------+
| 1 Tesla V100-SXM2… Off | 00000004:05:00.0 Off | 0 |
| N/A 25C P0 48W / 300W | Unknown Error | 0% Default |
±------------------------------±---------------------±---------------------+
| 2 Tesla V100-SXM2… Off | 00000035:03:00.0 Off | 0 |
| N/A 25C P0 47W / 300W | Unknown Error | 0% Default |
±------------------------------±---------------------±---------------------+
| 3 Tesla V100-SXM2… Off | 00000035:04:00.0 Off | 0 |
| N/A 25C P0 48W / 300W | Unknown Error | 0% Default |
±------------------------------±---------------------±---------------------+
±----------------------------------------------------------------------------+
| Processes: GPU Memory |
| GPU PID Type Process name Usage |
|=============================================================================|
| No running processes found |
±----------------------------------------------------------------------------+