I am attempting to install cuDNN to use with CUDA. CUDA is already installed when I run
nvcc --version
I receive:
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2017 NVIDIA Corporation
Built on Fri_Nov__3_21:07:56_CDT_2017
Cuda compilation tools, release 9.1, V9.1.85
step of testing the cuDNN install I receive the following error:
rm -rf *o
rm -rf mnistCUDNN
Linking agains cublasLt = false
CUDA VERSION: 9010
TARGET ARCH: x86_64
HOST_ARCH: x86_64
TARGET OS: linux
SMS: 30 35 50 53 60 61 62 70
/bin/sh: 1: /usr/local/cuda/bin/nvcc: not found
>>> WARNING - FreeImage is not set up correctly. Please ensure FreeImage is set up correctly. <<<
So it looks like there is an issue with a root file in
/usr/local/cuda/bin/
. After checking the
/usr/local/cuda/
directory it only has ‘include’ and ‘lib64’ directories, was there a problem with CUDA install? Did a directory get deleted somewhere along the way?
The driver reports, via nvidia-smi, the latest CUDA version it supports: 10.1
The compiler reports what CUDA version it belongs to: 9.1
Generally speaking, newer drivers can support older CUDA versions. So a driver that supports versions up to and including 10.1 also supports CUDA 9.x and CUDA 8.x, for example. Example from the machine on which I am writing this post:
C:\Users\Norbert\My Programs>nvcc --version
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2016 NVIDIA Corporation
Built on Mon_Jan__9_17:32:33_CST_2017
Cuda compilation tools, release 8.0, V8.0.60
C:\Users\Norbert\My Programs>nvidia-smi
Tue Oct 08 13:22:55 2019
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 431.02 Driver Version: 431.02 CUDA Version: 10.2 |
|-------------------------------+----------------------+----------------------+
| GPU Name TCC/WDDM | Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
|===============================+======================+======================|
| 0 Quadro P2000 WDDM | 00000000:01:00.0 On | N/A |
| 82% 80C P0 62W / 75W | 1030MiB / 5120MiB | 94% Default |
+-------------------------------+----------------------+----------------------+
Hi I was able to fix the make issue but manually copying the nvcc file to the cuda folder. Now when I compile and test I get the following output:
Testing single precision
Loading image data/one_28x28.pgm
Performing forward propagation ...
Testing cudnnGetConvolutionForwardAlgorithm ...
Fastest algorithm is Algo 0
Testing cudnnFindConvolutionForwardAlgorithm ...
^^^^ CUDNN_STATUS_SUCCESS for Algo 0: 0.010688 time requiring 0 memory
^^^^ CUDNN_STATUS_SUCCESS for Algo 2: 0.031968 time requiring 57600 memory
^^^^ CUDNN_STATUS_SUCCESS for Algo 1: 0.040896 time requiring 3464 memory
^^^^ CUDNN_STATUS_SUCCESS for Algo 7: 0.051840 time requiring 2057744 memory
^^^^ CUDNN_STATUS_SUCCESS for Algo 4: 0.069632 time requiring 207360 memory
Cublas failure
Error code 0
gemv.h:77
Aborting...
I’ve tried with all 4 cuDNN download options and have also tried two archived releases as well and they all report the same error.
I’ve seen other posts on the error but no good solutions. I am running a GeForce 2080 Ti, with driver 430.50 and Cuda 9.1.85. Is the issue the version of Cuda, does it need to be 10.0 or greater?
I already indicated if it were me, I would switch to CUDA 10.0:
In fact, if it were me, I would do a fresh load of the OS and start with a clean slate.
Thats a result of a ubuntu-provided install method, not a NVIDIA-provided install method. When you use the CUDNN package, it expects you to use a NVIDIA-provided proper install of CUDA.