Hi,
My CUDA installation does not recognize the Tesla P4 card. Here is the output for nvaccelinfo -v
$ nvaccelinfo -v
NVRM version: NVIDIA UNIX x86_64 Kernel Module 535.104.05 Sat Aug 19 01:15:15 UTC 2023
could not initialize CUDA runtime, error code=34
No accelerators found.
Check the permissions on your CUDA device
nvidia-smi does recognize the correct card. nvaccelinfo -v, however, doesn’t.
As per the advice in one of the prior posts, I have blacklisted nouveau and set LD_LIBRARY_PATH so that libcuda.so may be found
$ echo $LD_LIBRARY_PATH
/usr/local/cuda-12.2/lib64:/opt/nvidia/hpc_sdk/Linux_x86_64/23.7/REDIST/cuda/12.2/targets/x86_64-linux/lib/stubs:/opt/nvidia/hpc_sdk/Linux_x86_64/23.7/cuda/12.2/targets/x86_64-linux/lib/stubs
I have installed the NVIDIA driver, CUDA Toolkit and the HPC SDK
$ nvcc --version
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2023 NVIDIA Corporation
Built on Tue_Aug_15_22:02:13_PDT_2023
Cuda compilation tools, release 12.2, V12.2.140
Build cuda_12.2.r12.2/compiler.33191640_0
$ inxi -G
Graphics:
Device-1: NVIDIA GP104GL [Tesla P4] driver: nvidia v: 535.104.05
Device-2: Matrox Systems G200eR2 driver: mgag200 v: kernel
Display: server: X.Org v: 1.21.1.4 driver: X: loaded: modesetting,nvidia
unloaded: fbdev,nouveau,vesa gpu: mgag200 resolution: 2048x1152~60Hz
OpenGL: renderer: N/A v: N/A
Any ideas?
Cheers,
Jyoti