Tensorflow Error after Confirming CUDA/Cudnn Installation being Successful

wvictor · August 30, 2018, 4:13pm

Hello All,

I have been browsing through webpages and posts for days, attempted solutions I could find, yet no success.
The system settings:
GPU: Tesla M60; OS: Ubuntu-16-04; Nvidia Driver: 375.26; CUDA-ToolKit: 8.0; Cudnn: tried both 6.0 and 7.2
Tensorflow-GPU: V1.10.0 installed through Anaconda

I have passed deviceQuery and bandwidthTest for CUDA, as well as mnist sample for cudnn. However, when trying to start a tensorflow session, I encountered the following error:
“Internal: cudaGetDevice() failed. Status: CUDA driver version is insufficient for CUDA runtime version”.

I have ensured both CUDA driver and runtime version are listed as 8.0 through nvidia-smi command. I have also checked that CUDA 8.0 is compatible with Nvidia driver 375 (in fact, I installed nvidia driver through CUDA’s runfile). Could anyone kindly provide insights to this problem?

wvictor · August 30, 2018, 4:25pm

Meanwhile, several questions that I couldn’t figure out despite searching through online information:

Is there a restriction on CUDA version for being compatible with Tesla M60 card?
Tesla M60 is with compute capability 5.3, thus Maxwell structure. Through here: https://docs.nvidia.com/cuda/maxwell-compatibility-guide/index.html, they only mentioned the compatibility upto CUDA 7.0, yet no concrete specification on CUDA versions is made. I believe CUDA 8.0 would be compatible, since I have successfully run tensorflow over a GTX 750-Ti with CUDA 8.0 on Windows OS, while 750-Ti is with compute capability 5.3.
Installation methods for NVidia Driver?
This is the most confusing part, as 3 methods have been widely mentioned without any comparison:
a). Install Nvidia driver first, through Nvidia driver download webpage.
b). Install Nvidia driver first, through sudo apt-get (for Ubuntu).
c). Install Nvidia driver, yet through CUDA’s runfile.
I understand b), compared with a), is lagging in terms of updated versions. However, some posts have suggested certain errors would be caused by one but not the other. Essentially, how should we go about and choose from these 3 methods?
Version compability list between Nvidia driver, CUDA, and cudnn?
It’s interesting that I couldn’t find any definite list on official webpages, as such information is vital. I could only find a list from one forum post indicating the latest nvidia driver version that each CUDA version supports. It would be definitely helpful if such information could be obtained in an organized manner from the official source.

Thanks a lot for anyone who could address to these inquiries.

Robert_Crovella · August 30, 2018, 4:45pm

The tensorflow you are using is built against CUDA 9 and therefore won’t work with your CUDA 8 setup.

Tesla M60 is not compute capability 5.3. It is 5.2.

There are multiple methods to install a GPU driver.
Here is a recent driver for tesla M60 on Ubuntu 16.04:

(you can always find drivers by using the driver wizard at [url]Official Drivers | NVIDIA )
(you can always find CUDA toolkit installers at [url]http://www.nvidia.com/getcuda[/url] )
(and note that the install guides are linked from the download pages, I suggest reading a cuda linux install guide)
https://www.nvidia.com/drivers/results/136950

It will work with CUDA 9 and you should be able to install and use CUDA 9 on Tesla M60 if you wish.

The official source for driver/cuda toolkit compatibility is Table 1 here:

[url]https://docs.nvidia.com/cuda/cuda-toolkit-release-notes/index.html#major-components[/url]

For CUDNN compatibility, just study the options on the download page for CUDNN. They indicate which CUDA versions they were built for or support.

wvictor · August 30, 2018, 7:36pm

Hello txbob, thanks for your reply! I am wondering how to obtain the information that tensorflow 1.10.0 is not compatible with CUDA versions before 9.0? Thanks!

Robert_Crovella · August 30, 2018, 8:01pm

https://www.tensorflow.org/install/install_sources#tested_source_configurations

Version:	CPU/GPU:	Python Version:	Compiler:	Build Tools:	cuDNN:	CUDA:
tensorflow-1.10.0	CPU	2.7, 3.3-3.6	GCC 4.8	Bazel 0.15.0    	N/A	N/A
tensorflow_gpu-1.10.0	GPU	2.7, 3.3-3.6	GCC 4.8	Bazel 0.15.0    	7	9

To be clear, I didn’t say “tensorflow 1.10.0 is not compatible with CUDA versions before 9.0”, I said “The tensorflow you are using is built against CUDA 9 and therefore won’t work with your CUDA 8 setup.”

The version of TF you are using installed via Anaconda is built against CUDA 9. TF can be built (from sources) against different versions of CUDA, but the prebuilt binaries from trusted sources are usually built according to the table already given above.

Note that tensorflow is not a NVIDIA product.

wvictor · August 30, 2018, 8:59pm

Hello txbob, thanks again for your prompt reply.

Yet after I installed nvidia driver 396 and CUDA 9.0 and made samples, I encountered the following error in bandwidthTest:
Running on…

Device 0: Tesla M60
Quick Mode

CUDA error at bandwidthTest.cu:730 code=46(cudaErrorDevicesUnavailable) “cudaEventCreate(&start)”

I have seen this error before whenever I install the nvidia driver NOT from CUDA runfile (and thus asking what’s the difference between installing drivers from/not from CUDA’s runfile). Also I am wondering why when installing Nvidia driver through CUDA runfile, it prompts to install gcc and make tools, while installing nvidia-driver through official driver download page’s deb file doesn’t require so (does it automatically install both gcc and make?)

Online search of this code 46 error also suggests no solution that resolve my case.

Update: After I restarted from scratch, I installed CUDA 9.0 and NVidia driver with the runfile (384.81) and cudnn V7. I then proceeded as before: anaconda 5.2 and tensorflow-gpu 1.10.0.
Yet the same error occurs again:
Internal: cudaGetDevice() failed. Status: CUDA driver version is insufficient for CUDA runtime version

Any insights would be highly appreciated!

wvictor · September 4, 2018, 4:45pm

So as I’ve tried with CUDA9.0 yet still no success, any more insights on potential issue causing the “CUDA driver version is insufficient for CUDA runtime version” error?

Have been stuck here for a while, any support would be appreciated!

Topic		Replies	Views
CUDA driver version is insufficient for CUDA runtime version with nvidia driver 390 CUDA Setup and Installation	3	6875	January 4, 2019
CUDA 9.0 ImportError: libcublas.so.8.0 CUDA Setup and Installation	17	39463	January 22, 2018
[Solved] CUDA 8.0 on Ubuntu 16.04 - GPU not available CUDA Setup and Installation	22	103811	January 1, 2018
ubuntu 16.04, python3.6.6, tensorflow samplecode invoke error. cudaGetDevice() failed. please help me. CUDA Setup and Installation	1	811	August 22, 2018
Does the latest GTX 1660 model support cuda? CUDA Setup and Installation	16	65129	October 1, 2023
Tensorflow import error "Couldn't open CUDA library libcuda.so.1" Ubuntu 14.04 Cuda 8.0 Dell 7559 i7 CUDA Setup and Installation	5	17286	November 18, 2016
[Solved] TensorFlow with GPU in Anaconda env [Ubuntu 16.04 + CUDA 7.5 + cuDNN] CUDA Setup and Installation	2	44622	May 24, 2016
Download and install CUDA 8.0 instead of CUDA 9.0 CUDA Setup and Installation	16	115775	March 13, 2018
[Solved] Tensorflow 1.14 - Cuda 10.0 - GTX 970 - Ubuntu 18.04 CUDA Setup and Installation cuda , tensorflow , ubuntu	0	2619	January 27, 2021
Failed call to cuInit CUDA_ERROR_NOT_INITIALIZED (Device mapping: no known devices) CUDA Setup and Installation	7	6377	November 27, 2018

Tensorflow Error after Confirming CUDA/Cudnn Installation being Successful

Related topics