Driver/library version mismatch after updating drivers to version 375.26

Fanta · February 16, 2017, 11:29am

I have updated the drivers for my GTX 970 under Ubuntu 16.04 from version 367.57 to version 375.26. Now when I run nvidia-smi I get:

Failed to initialize NVML: Driver/library version mismatch

And programs using Tensorflow don’t find the GPU anymore, I get these errors:

E tensorflow/stream_executor/cuda/cuda_driver.cc:509] failed call to cuInit: CUDA_ERROR_NO_DEVICE
I tensorflow/stream_executor/cuda/cuda_diagnostics.cc:158] retrieving CUDA diagnostic information for host: Ono-Sendai
I tensorflow/stream_executor/cuda/cuda_diagnostics.cc:165] hostname: Ono-Sendai
I tensorflow/stream_executor/cuda/cuda_diagnostics.cc:189] libcuda reported version is: 375.26.0
I tensorflow/stream_executor/cuda/cuda_diagnostics.cc:363] driver version file contents: “”“NVRM version: NVIDIA UNIX x86_64 Kernel Module 367.57 Mon Oct 3 20:37:01 PDT 2016
GCC version: gcc version 5.4.0 20160609 (Ubuntu 5.4.0-6ubuntu1~16.04.4)
“””
I tensorflow/stream_executor/cuda/cuda_diagnostics.cc:193] kernel reported version is: 367.57.0
E tensorflow/stream_executor/cuda/cuda_diagnostics.cc:303] kernel version 367.57.0 does not match DSO version 375.26.0 – cannot find working devices in this configuration

After I reverted to version 375.26 of the drivers, the errors went away, and nvidia-smi and Tensorflow work fine. So what should I do to use the most recent drivers 375.26? Perhaps I need to install a more recent version of the CUDA toolkit and/or cuDNN?

Robert_Crovella · February 16, 2017, 2:23pm

Install them correctly. The correct install method depends on how the previous driver was installed. You’ll get a sense of this if you read the linux install guide:

http://docs.nvidia.com/cuda/cuda-installation-guide-linux/index.html#handle-uninstallation

jeremyrutman · May 18, 2017, 3:21pm

I ran into this too, tried rolling back to previous driver but get same error - in the container there’s a driver mismatch, outside things are ok

developer@gpu-1:~$ docker attach collar
root@84bb18e32689:/# 
root@84bb18e32689:/# 
root@84bb18e32689:/# nvidia-smi
Failed to initialize NVML: Driver/library version mismatch
root@84bb18e32689:/# developer@gpu-1:~$ 
developer@gpu-1:~$ nvidia-smi
Thu May 18 15:19:51 2017       
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 367.57                 Driver Version: 367.57                    |
|-------------------------------+----------------------+----------------------+
| GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|===============================+======================+======================|
|   0  Tesla M60           Off  | DCA9:00:00.0     Off |                  Off |
| N/A   33C    P8    13W / 150W |      0MiB /  8123MiB |      0%      Default |
+-------------------------------+----------------------+----------------------+
|   1  Tesla M60           Off  | F46C:00:00.0     Off |                  Off |
| N/A   37C    P8    14W / 150W |      0MiB /  8123MiB |      0%      Default |
+-------------------------------+----------------------+----------------------+
                                                                               
+-----------------------------------------------------------------------------+
| Processes:                                                       GPU Memory |
|  GPU       PID  Type  Process name                               Usage      |
|=============================================================================|
|  No running processes found                                                 |
+-----------------------------------------------------------------------------+

jeremyrutman · May 18, 2017, 3:35pm

mmm it seems the container must be rebuilt on driver change which makes sense.
Full procedure was:

install cuda (which automatically installs unwanted newest drivers)
remove new driver with apt-get remove nvidia-driverxxxx
install old driver from runfile
rebuild container

Topic		Replies	Views
kernel version 440.31.0 does not match DSO version 440.33.1 — cannot find working devices in this configuration Frameworks (archived) tensorflow	0	932	November 25, 2019
195.36.24 does not support cuda? versions of cuda driver & runtime mismatched CUDA Programming and Performance	1	10487	June 4, 2010
nvidia-smi: Failed to initialize NVML: Driver/library version mismatch Linux	3	17386	October 14, 2021
Failed Cuda Driver and Runtime version may be mismatched Cuda installation fails on Ubuntu 10.4 x86_ CUDA Programming and Performance	13	5185	November 17, 2010
NVML driver/library mismatch after libnvidia update CUDA Setup and Installation kernel	0	724	June 8, 2020
Failed to initialize NVML: Driver/library version mismatch CUDA Setup and Installation	0	1094	August 3, 2021
Failed to initialize NVML: Driver/library version mismatch CUDA Setup and Installation	5	34494	March 9, 2021
Unable to install newer version of CUDA (Failed to initialize NVML: Driver/library version mismatch) CUDA Setup and Installation	3	1743	October 22, 2021
Libcuda and kernel driver mismatch in Ubuntu 20.04 CUDA Setup and Installation cuda , drivesw-installation	1	3280	February 15, 2023
Status: CUDA driver version is insufficient for CUDA runtime version CUDA Setup and Installation	4	1039	April 26, 2019

Driver/library version mismatch after updating drivers to version 375.26

Related topics