Hi,
I’m not sure where to go with this, but hopefully someone can point me in the right direction.
I’m trying to view GPU usage stats with Data Center GPU Manager.
Trying what I am led to believe is a basic test of the GPU, but it reports - Memory Used it 0 (bytes).
dcgmi stats --jstart demojob -g 486
dcgmi stats --jstop demojob -g 486
dcgmi stats --job demojob -g 486
Successfully retrieved statistics for job: demojob.
±-----------------------------------------------------------------------------+
| Summary |
+====================================+=========================================+
|----- Execution Stats ------------±----------------------------------------|
| Start Time | Mon Jan 20 17:04:19 2020 |
| End Time | Mon Jan 20 17:04:37 2020 |
| Total Execution Time (sec) | 17.6 |
| No. of Processes | 0 |
±---- Performance Stats ----------±----------------------------------------+
| Energy Consumed (Joules) | 379 |
| Power Usage (Watts) | Avg: 23.7039, Max: N/A, Min: N/A |
| Max GPU Memory Used (bytes) | 0 |
| Clocks and PCIe Performance | Available per GPU in verbose mode |
±---- Event Stats ----------------±----------------------------------------+
| Single Bit ECC Errors | Not Specified |
| Double Bit ECC Errors | Not Specified |
| PCIe Replay Warnings | Not Specified |
| Critical XID Errors | 0 |
±---- Slowdown Stats -------------±----------------------------------------+
| Due to - Power (%) | 0 |
| - Thermal (%) | 0 |
| - Reliability (%) | Not Supported |
| - Board Limit (%) | Not Supported |
| - Low Utilization (%) | Not Supported |
| - Sync Boost (%) | 0 |
±---- Overall Health -------------±----------------------------------------+
| Overall Health | Healthy |
±-----------------------------------±----------------------------------------+
The Card is a Titan V
nvidia-smi
Mon Jan 20 17:06:54 2020
±----------------------------------------------------------------------------+
| NVIDIA-SMI 390.46 Driver Version: 390.46 |
|-------------------------------±---------------------±---------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
|===============================+======================+======================|
| 0 TITAN V Off | 00000000:65:00.0 Off | N/A |
| 28% 26C P8 23W / 250W | 0MiB / 12064MiB | 0% Default |
±------------------------------±---------------------±---------------------+
Maybe I am not using the correct method to test DCGM, if so is there a simple way of showing that it does record Memory usage?
Many thanks in advance,