can't install new driver, cannot unload module

dehanlazuardi · July 30, 2018, 7:20am

it is said that ‘An NVIDIA kernel module ‘nvidia-uvm’ appears to already be loaded in your kernel.’
i tried to unload but ‘operation is not permitted’ (in su mode)

the lsmod | grep nvidia output is

nvidia_drm 45056 0
drm_kms_helper 151552 1 nvidia_drm
drm 352256 3 nvidia_drm,drm_kms_helper
nvidia_modeset 790528 1 nvidia_drm
nvidia_uvm 647168 0
nvidia 12304384 2 nvidia_modeset,nvidia_uvm

im am using centos 7, running on virtual machine. i cant restart server. since every one use the same server

the problem started when i accidentally install new nvidia driver while installing cuda.
i cant do nvidia-smi

i already uninstall cuda and nvidia.

but i cannot install new driver

what should i do

generix · July 30, 2018, 8:56am

Stop any Xserver, stop the nvidia-persistenced, then unload modules using sudo modprobe -r nvidia

LostDog · July 30, 2018, 2:51pm

You may also want to try update-initramfs to make sure nothing is getting added at boot from that.

dehanlazuardi · July 31, 2018, 1:41am

how can i list any of X server that i used?

i try to
‘service kdm stop’
‘service gdm stop’
‘service lightdm stop’

but i always get
‘failed to stop, service is not loaded’

i access this server using jupyterhub installed on server through my browser and used terminal that jupyterhub provided. so if i stop Xserver should i ssh to server or i can still use jupyterhub?

sorry i’m new to this OS :)

generix · July 31, 2018, 9:01am

use
ps a |grep X
to check for running Xservers.

dehanlazuardi · July 31, 2018, 9:08am

The output of ‘ps axu |grep X’ is
root 756 0.0 0.0 9036 892 pts/3 S+ 08:33 0:00 grep --color=auto X

What xserver di i use?

generix · July 31, 2018, 9:10am

none.
So there’s something different keeping the modules from being unloaded. Please run nvidia-bug-report.sh as root and attach the resulting .gz file to your post. Hovering the mouse over an existing post will reveal a paperclip icon.

dehanlazuardi · August 1, 2018, 1:18am

i already uninstall nvidia and cannot install nvidia, can i download the file separately and run it on my machine?

dehanlazuardi · August 1, 2018, 2:23am

what about if i blacklist the nvidia module and install the new driver then whitelist the module again?

dehanlazuardi · August 10, 2018, 2:19am

so in the end i fresh install my VM with ubuntu (i used centos before). but when i run
lsmod | grep nvidia

nvidia_uvm nvdia_drm and nvidia still loaded

why would they load? even im using fresh installed os
please help me im stuck with this

my server condition:

no one is using GPU except me
there are several VM in the server so i cannnot restart server

should i unload the module from the main computer or restarting server is not avoidable?
or there is another solution?

muchlisinadi · August 10, 2018, 2:26am

i have same problem with him… and i still struggle with it …

LostDog · August 11, 2018, 12:45am

Have you updated the initramfs?

burnbitter · August 11, 2018, 4:52am

It’s possible that module unloading has not been compiled into your kernel. Do you have the config file in /proc? It would be /proc/config.gz - check it to see if it has module unloading selected…

david9xqqb · January 5, 2020, 7:35pm

I solved this problem by disabling the GUI, rebooting, logging in and installing the driver, enabling GUI, and reboot.

Please make sure you know your username and password!!!

Open a terminal and write

sudo systemctl set-default multi-user.target
sudo reboot 0

Now login and you’ll get to a terminal directly, install the driver Do note that I am installing here the 440.44 so you need to modify for your driver version.

sudo ./NVIDIA-Linux-x86_64-440.44.run

After installing the driver enable the GUI and Reboot:

sudo systemctl set-default graphical.target
sudo reboot 0

You should be done

In my case, nvidia-smi reported the new version 440.44, whine in the Ubuntu 18.04 Software & Updates Utilities, Additional Drivers Tab shows 435!! Another NVIDIA mystery, but heck my new docker works!!!

tonykutunio · June 18, 2021, 2:26pm

Hi, could you take a look on a bit of my logs as well please:

The NVIDIA probe routine failed for 1 device(s).
Jun 13 15:18:12 maxx kernel: [ 2318.716005] NVRM: None of the NVIDIA devices were initialized.
Jun 13 15:18:12 maxx kernel: [ 2318.716203] nvidia-nvlink: Unregistered the Nvlink Core, major device number 234
Jun 13 15:18:13 maxx kernel: [ 2319.082988] nvidia-nvlink: Nvlink Core is being initialized, major device number 234
Jun 13 15:18:13 maxx kernel: [ 2319.083639] NVRM: This is a 64-bit BAR mapped above 4GB by the system
Jun 13 15:18:13 maxx kernel: [ 2319.083639] NVRM: BIOS or the Linux kernel, but the PCI bridge
Jun 13 15:18:13 maxx kernel: [ 2319.083639] NVRM: immediately upstream of this GPU does not define
Jun 13 15:18:13 maxx kernel: [ 2319.083639] NVRM: a matching prefetchable memory window.
Jun 13 15:18:13 maxx kernel: [ 2319.083640] NVRM: This may be due to a known Linux kernel bug. Please
Jun 13 15:18:13 maxx kernel: [ 2319.083640] NVRM: see the README section on 64-bit BARs for additional
Jun 13 15:18:13 maxx kernel: [ 2319.083640] NVRM: information.

Topic		Replies	Views
can't install new driver, cannot unload module CUDA Setup and Installation	1	2441	November 4, 2018
ERROR: Unable to load the 'nvidia-drm' kernel module - CentOS 7 x86_64, version 396.54 Linux	9	6961	October 14, 2021
Failing to load Nvidia driver Linux kernel , linux	7	7897	October 23, 2024
Nvidia driver not loading after installation Linux cuda , linux , driver , rhel	3	6698	April 13, 2023
Nvidia unloading driver(nvidia-drm, nvidia-modeset, nvidia-uvm, nvidia-nvlink) Linux	0	3379	July 22, 2020
Unable to load the 'nvidia-drm' kernel module. Ubuntu 18.04 Linux	14	23009	October 12, 2021
Ubuntu - nvidia driver installed but not running Linux	1	750	December 5, 2022
An NVIDIA kernel module 'nvidia-uvm' appears to already be loaded in your kernel Linux	1	14985	August 6, 2021
CentOS 7 unable to run nvidia-settings Linux	9	7384	October 12, 2021
Nvidia module failed to load, mint 20.1, kernel 5.4.0-65 Linux	6	8145	October 4, 2021

can't install new driver, cannot unload module

Related topics