Hello guys, I have a problem installing the driver for the Quadro P600 in Debian Trixie, intended just for CUDA purposes.
I first installed the ‘keyring’ version (590) following this post Reddit - The heart of the internet
and
- nouveau is blacklisted
- secure boot is disabled in the BIOS
- running initramfs, etc
when rebooting the system, a nice message appeared telling me that “Quadro P600” has been detected and the nvidia drivers have been successfully loaded.
Then, nvidia-smi detects properly the Quadro, but it also says that as such architecture is not supported in the running version of the driver (590) it will be ignored. And also says than an older version should be installed instead (580xx, for instance).
Then I repeat all the previous, reinstalling everything (even the Trixie) but this time ignoring the keyring part, so the ‘non-free’ official repo drivers are installed instead (version 550).
But this time, when booting, the message only says that the nvidia driver 550 has been loaded
and when running nvidia-smi, I get just “No devices were found”
so doing dmesg I found:
[ 92.485237] NVRM: GPU 0000:01:00.0: RmInitAdapter failed! (0x31:0xffff:2640)
[ 92.485415] NVRM: GPU 0000:01:00.0: rm_init_adapter failed, device minor number 0
Searching in many forums I have found this sort of error is very common, and having discarded nouveau interference, and BIOS issues, I also checked the persistence problem, but when I do
systemctl list-units --type service --all | grep nvidia
it seems the persistence service is running:
nvidia-hibernate.service loaded inactive dead NVIDIA system hibernate actions
nvidia-persistenced.service loaded active running NVIDIA Persistence Daemon
nvidia-resume.service loaded inactive dead NVIDIA system resume actions
nvidia-suspend.service loaded inactive dead NVIDIA system suspend actions
Also tried to deal with the firmware issue by means of “nvidia NVreg_EnableGpuFirmware=0”, but the problem persists.
So I ran nvidia-bug-report.sh
but I am unable to find in the report any other issue related with this “rm_init_adapter failed” that could give me a clue about what is going wrong.
nvidia-bug-report.log.gz (109.9 KB)
I have attached the report. Any Ideas/suggestions are welcome.