One of my colleagues is able to build with PGI19.4 on his local workstation but is encountering a crash at runtime. I was able to reproduce the same error with pgaccelinfo:
nint0112:~/BUILD21/> /common/pgi/linux86-64/19.4/bin/pgaccelinfo -v
CUDA Driver Version: 10020
NVRM version: NVIDIA UNIX x86_64 Kernel Module 440.64 Fri Feb 21 01:17:26 UTC 2020
could not initialize CUDA runtime, error code=999
No accelerators found.
Check the permissions on your CUDA device
Interestingly, nvidia-smi does not indicate any problems
nint0112:~/BUILD21> nvidia-smi
Wed Mar 18 23:19:42 2020
±----------------------------------------------------------------------------+
| NVIDIA-SMI 440.64 Driver Version: 440.64 CUDA Version: 10.2 |
|-------------------------------±---------------------±---------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
|===============================+======================+======================|
| 0 Quadro P2000 Off | 00000000:21:00.0 Off | N/A |
| 52% 45C P0 19W / 75W | 325MiB / 5050MiB | 0% Default |
±------------------------------±---------------------±---------------------+
It looks like the local nvidia drivers are quite new, version 440.64 with CUDA 10.2. I don’t think this should be a problem for a PGI19.4 executable. Is this correct? I saw some other postings that mentioned some file permission issues, but I don’t see any problems in this regard.
int0112:~/BUILD21> ls -lah /dev/nvidia0
crw-rw-rw- 1 root root 195, 0 Mär 18 08:22 /dev/nvidia0
Are you aware of any other workarounds for this issue?
Thanks,
David