Hi every one… I installed DIGITS in my AWS cloud server with K80 GPU but when i try to training the model appear this message:
Started BatchTransformer thread 87
Loading mean file from: /workspace/jobs/20190818-080628-28c8/train_db/mean.binaryproto
Loading mean file from: /workspace/jobs/20190818-080628-28c8/train_db/mean.binaryproto
Loading mean file from: /workspace/jobs/20190818-080628-28c8/train_db/mean.binaryproto
Data Reader threads: 3, out queues: 12, depth: 10
{0} Starting 3 internal thread(s) on device 0
Started internal thread 91 on device 0, rank 0
Opened lmdb /workspace/jobs/20190818-080628-28c8/train_db/features
Started internal thread 92 on device 0, rank 0
Opened lmdb /workspace/jobs/20190818-080628-28c8/train_db/features
Started internal thread 93 on device 0, rank 0
Opened lmdb /workspace/jobs/20190818-080628-28c8/train_db/features
Output data size: 10, 3, 384, 1248
Parser threads: 3 (auto)
Transformer threads: 4 (auto)
Started internal thread 78 on device 0, rank 0
Started internal thread 79 on device 0, rank 0
Started internal thread 82 on device 0, rank 0
Started internal thread 80 on device 0, rank 0
Check failed: error == cudaSuccess (209 vs. 0) no kernel image is available for execution on the device
i have installed CUDA 10.1 / NVIDIA-SMI 418
±----------------------------------------------------------------------------+
| NVIDIA-SMI 418.87.00 Driver Version: 418.87.00 CUDA Version: 10.1 |
|-------------------------------±---------------------±---------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
|===============================+======================+======================|
| 0 Tesla K80 On | 00000000:00:1E.0 Off | 0 |
| N/A 64C P0 58W / 149W | 92MiB / 11441MiB | 0% Default |
±------------------------------±---------------------±---------------------+
±----------------------------------------------------------------------------+
| Processes: GPU Memory |
| GPU PID Type Process name Usage |
|=============================================================================|
| 0 1849 C python 81MiB |
±----------------------------------------------------------------------------+
some idea?