Hi all,
I have a workstation with two GPUs installed (Ubuntu 16.04, CUDA 10, CuDNN 7.4.2)
I’m trying to run yolov3 - it runs as expected on GPU 0 but fails on GPU 1 - unless I run with sudo.
ie.
⟫ ./darknet -i 0 detector test cfg/coco.data cfg/yolov3.cfg yolov3.weights data/dog.jpg
[expected output]
⟫ sudo ./darknet -i 1 detector test cfg/coco.data cfg/yolov3.cfg yolov3.weights data/dog.jpg
[expected output]
⟫ ./darknet -i 1 detector test cfg/coco.data cfg/yolov3.cfg yolov3.weights data/dog.jpg
CUDA Error: invalid device ordinal
darknet: ./src/cuda.c:36: check_error: Assertion `0' failed.
Aborted (core dumped)
How can I run on GPU 1 without sudo?
⟫ nvidia-smi
Fri Feb 1 13:02:54 2019
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 410.79 Driver Version: 410.79 CUDA Version: 10.0 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
|===============================+======================+======================|
| 0 Quadro M4000 Off | 00000000:02:00.0 On | N/A |
| 46% 35C P8 13W / 120W | 183MiB / 8126MiB | 0% Default |
+-------------------------------+----------------------+----------------------+
| 1 Tesla K40c Off | 00000000:81:00.0 Off | 0 |
| 23% 40C P8 23W / 235W | 0MiB / 11441MiB | 0% Default |
+-------------------------------+----------------------+----------------------+
+-----------------------------------------------------------------------------+
| Processes: GPU Memory |
| GPU PID Type Process name Usage |
|=============================================================================|
| 0 1557 G /usr/lib/xorg/Xorg 137MiB |
| 0 2373 G compiz 42MiB |
+-----------------------------------------------------------------------------+