Can't install any nvidia driver for Quadro K3100M on Ubuntu 22

gpaladin · February 5, 2024, 10:17am

System:
Host: xxx Kernel: 6.5.0-15-generic x86_64 bits: 64 Desktop: GNOME 42.9
Distro: Ubuntu 22.04.3 LTS (Jammy Jellyfish)
Graphics:
Device-1: Intel 4th Gen Core Processor Integrated Graphics driver: i915
v: kernel
Device-2: NVIDIA GK104GLM [Quadro K3100M] driver: N/A
Device-3: Chicony HP HD Webcam type: USB driver: uvcvideo
Display: wayland server: X.Org v: 1.22.1.1 with: Xwayland v: 22.1.1
compositor: gnome-shell driver: X: loaded: modesetting,nvidia
unloaded: fbdev,nouveau,vesa gpu: i915 resolution: 1920x1080~60Hz
OpenGL: renderer: Mesa Intel HD Graphics 4600 (HSW GT2)
v: 4.6 Mesa 23.0.4-0ubuntu1~22.04.1

Problem:

System cleanly installs with xorg nouveau driver.

Other two options are
nvidia-driver-390
nvidia-driver-418-server

Both installations end up with error code.

What to do?

generix · February 5, 2024, 10:19am

Please install the 470 driver using Software&Updates, this is the correct driver for your Kepler based GPU.

gpaladin · February 5, 2024, 4:30pm

Thank you!

These are available options:

Not sure how to get to driver 470 from Software& Updates.

gpaladin · February 5, 2024, 5:10pm

I tried like this:

$sudo apt install nvidia-driver-470

then:

$ nvidia-smi
Mon Feb  5 18:05:42 2024       
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 470.223.02   Driver Version: 470.223.02   CUDA Version: 11.4     |
|-------------------------------+----------------------+----------------------+
| GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|                               |                      |               MIG M. |
|===============================+======================+======================|
|   0  Quadro K3100M       Off  | 00000000:01:00.0 Off |                  N/A |
| N/A   45C    P8     3W /  N/A |      5MiB /  4037MiB |      0%      Default |
|                               |                      |                  N/A |
+-------------------------------+----------------------+----------------------+
                                                                               
+-----------------------------------------------------------------------------+
| Processes:                                                                  |
|  GPU   GI   CI        PID   Type   Process name                  GPU Memory |
|        ID   ID                                                   Usage      |
|=============================================================================|
|    0   N/A  N/A      1646      G   /usr/lib/xorg/Xorg                  2MiB |
+-----------------------------------------------------------------------------+

then:

$nvtop

with output:

nvtop window never shows any readings.

Does this look right?

How to test the GPU?

generix · February 5, 2024, 6:44pm

The nvidia gpu is in offload mode
https://http.download.nvidia.com/XFree86/Linux-x86_64/550.40.07/README/primerenderoffload.html
Try running
__NV_PRIME_RENDER_OFFLOAD=1 __GLX_VENDOR_LIBRARY_NAME=nvidia glxgears

gpaladin · February 5, 2024, 6:51pm

Running synchronized to the vertical refresh.  The framerate should be
approximately the same as the monitor refresh rate.
302 frames in 5.0 seconds = 60.283 FPS
301 frames in 5.0 seconds = 60.007 FPS
301 frames in 5.0 seconds = 60.003 FPS

generix · February 5, 2024, 7:25pm

Watch nvidia-smi or nvtop while running, should show up.

gpaladin · February 5, 2024, 9:02pm

This is the current status:

cogwheels are turning
first terminal window is showing ~60 FPS
nvidia-smi to me looks the same
nvtop shows some activity

gpaladin · February 5, 2024, 10:04pm

After:

sudo apt install nvidia-cuda-toolkit

nvidia-smi returns:

Command 'nvidia-smi' not found, but can be installed with:
sudo apt install nvidia-utils-390         # version 390.157-0ubuntu0.22.04.2, or
sudo apt install nvidia-utils-418-server  # version 418.226.00-0ubuntu5~0.22.04.1
sudo apt install nvidia-utils-450-server  # version 450.248.02-0ubuntu0.22.04.1
sudo apt install nvidia-utils-470         # version 470.223.02-0ubuntu0.22.04.1
sudo apt install nvidia-utils-470-server  # version 470.223.02-0ubuntu0.22.04.1
sudo apt install nvidia-utils-525         # version 525.147.05-0ubuntu0.22.04.1
sudo apt install nvidia-utils-525-server  # version 525.147.05-0ubuntu0.22.04.1
sudo apt install nvidia-utils-535         # version 535.129.03-0ubuntu0.22.04.1
sudo apt install nvidia-utils-535-server  # version 535.129.03-0ubuntu0.22.04.1
sudo apt install nvidia-utils-510         # version 510.60.02-0ubuntu1
sudo apt install nvidia-utils-510-server  # version 510.47.03-0ubuntu3

nvtop returns: no GPU to monitor

but Software&Updates shows that a driver is installed:

after:

sudo apt install nvidia-utils-470

nvidia-smi and nvtop show data as before.

but:

python3 -c "import tensorflow as tf; print(tf.config.list_physical_devices('GPU'))"

returns:

024-02-05 23:10:40.063925: E external/local_xla/xla/stream_executor/cuda/cuda_dnn.cc:9261] Unable to register cuDNN factory: Attempting to register factory for plugin cuDNN when one has already been registered
2024-02-05 23:10:40.063972: E external/local_xla/xla/stream_executor/cuda/cuda_fft.cc:607] Unable to register cuFFT factory: Attempting to register factory for plugin cuFFT when one has already been registered
2024-02-05 23:10:40.064924: E external/local_xla/xla/stream_executor/cuda/cuda_blas.cc:1515] Unable to register cuBLAS factory: Attempting to register factory for plugin cuBLAS when one has already been registered
2024-02-05 23:10:40.070361: I tensorflow/core/platform/cpu_feature_guard.cc:182] This TensorFlow binary is optimized to use available CPU instructions in performance-critical operations.
To enable the following instructions: AVX2 FMA, in other operations, rebuild TensorFlow with the appropriate compiler flags.
2024-02-05 23:10:40.798184: W tensorflow/compiler/tf2tensorrt/utils/py_utils.cc:38] TF-TRT Warning: Could not find TensorRT
2024-02-05 23:10:41.390577: I external/local_xla/xla/stream_executor/cuda/cuda_executor.cc:901] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero. See more at https://github.com/torvalds/linux/blob/v6.0/Documentation/ABI/testing/sysfs-bus-pci#L344-L355
2024-02-05 23:10:41.424517: I external/local_xla/xla/stream_executor/cuda/cuda_executor.cc:901] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero. See more at https://github.com/torvalds/linux/blob/v6.0/Documentation/ABI/testing/sysfs-bus-pci#L344-L355
2024-02-05 23:10:41.425007: I external/local_xla/xla/stream_executor/cuda/cuda_executor.cc:901] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero. See more at https://github.com/torvalds/linux/blob/v6.0/Documentation/ABI/testing/sysfs-bus-pci#L344-L355
2024-02-05 23:10:41.425333: I tensorflow/core/common_runtime/gpu/gpu_device.cc:2298] Ignoring visible gpu device (device: 0, name: Quadro K3100M, pci bus id: 0000:01:00.0, compute capability: 3.0) with Cuda compute capability 3.0. The minimum required Cuda capability is 3.5.

generix · February 6, 2024, 8:49am

You can see in nvidia-smi output clearly that glxgears is running on the nvidia gpu.

gpaladin · February 7, 2024, 10:00am

Yes, now I see it too. Thank you!

Any comment on the tensorflow issues or should I open another topic for that one?

generix · February 7, 2024, 10:06am

The cuda capability (cc) of your gpu is 3.0 but the tensorflow version you’re using requires cc 3.5 minimum so it can’t use your gpu. So you will need an older tensorflow version or recompile it for cc 3.0.

system · February 21, 2024, 10:06am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Driver installation fails on Ubuntu 20.04 for Quadro K1100M Linux cuda , ubuntu , linux-driver	8	2665	February 13, 2023
Installer failed CUDA 8 on Win 10, K3100M CUDA Setup and Installation	4	12983	September 13, 2017
Installation of NVIDIA driver for Quadro k5200 on Ubuntu 18.04 CUDA Setup and Installation	9	4065	October 8, 2019
[Solved] Tensorflow 1.14 - Cuda 10.0 - GTX 970 - Ubuntu 18.04 CUDA Setup and Installation cuda , tensorflow , ubuntu	0	2591	January 27, 2021
ubuntu 18.04.2 CUDA® Toolkit installation use cuda-repo-ubuntu1804-10-1-local-10.1.168-418.67_1.0-1_amd64.deb Linux	1	1572	August 1, 2019
Failed to install both CUDA 11.3 Toolkit and Nvidia-driver 470 CUDA Setup and Installation cuda , driver	3	11440	September 9, 2023
Tensorflow fails to find libcudart CUDA on Windows Subsystem for Linux	7	18496	September 23, 2020
Tensorflow coredump no supported devices found for CUDA (Docker nvcr.io container), after reboot nvidia-smi can't find driver Linux cuda , tensorflow	2	2560	October 8, 2020
Nvidia-driver-460 on Ubuntu 20.04: NVIDIA driver is not loaded Linux	49	19539	August 23, 2021
GPU device not being recognized CUDA Setup and Installation	2	1690	June 28, 2018

Can't install any nvidia driver for Quadro K3100M on Ubuntu 22

Related topics