570 release feedback & discussion

Seems there’s no topic for 570 release yet, hope it’s ok for me to start it.

I use Debian trixie, kernel 6.12.10 on an Intel iGPU machine with RTX3090 eGPU connected to it. Here are the issues I found in 570.86.15 installed from the apt repo at Index of /compute/cuda/repos/debian12/x86_64 :

Driver crashes on eGPU unplug:
Similar as with 565 as I described here. Before unplugging I did make sure that no processes were running on the eGPU with nvidia-smi. I also double-checked that on 560.35.05 from the same repo and on 535.216.03 from Debian’s repo, this problem is not present and I can hot plug and unplug the eGPU several times without any issues when using those versions. Attached is the relevant part of the system log and nvidia-bug-report.log.gz.

X11 session running on my iGPU “attaches” to the eGPU:
Upon connecting the eGPU, my X11 session running on my iGPU somehow “attaches” to the eGPU (nvidia-smi reports it in the list of running processes) and the only way to detach is to completely close the X11 session. This defeats the purpose of eGPU where in case I need to move, I should be only required to terminate processes actually using the eGPU and be able to unplug without the need of closing ALL programs unrelated to the eGPU running in my X11 session. This problem was not present in 565 nor any other version I’ve ever tried. Below are the relevant diagnostics from the terminal, the eGPU was connected between lspci invocations:

morgwai@morgwai-xps13:~$ ps ax |grep '[X]org'
   1322 tty7     Ssl+   0:09 /usr/lib/xorg/Xorg :0 -seat seat0 -auth /var/run/lightdm/root/:0 -nolisten tcp vt7 -novtswitch
morgwai@morgwai-xps13:~$ lspci |grep VGA
00:02.0 VGA compatible controller: Intel Corporation Iris Plus Graphics 640 (rev 06)
morgwai@morgwai-xps13:~$ lspci |grep VGA
00:02.0 VGA compatible controller: Intel Corporation Iris Plus Graphics 640 (rev 06)
0b:00.0 VGA compatible controller: NVIDIA Corporation GA102 [GeForce RTX 3090] (rev a1)
morgwai@morgwai-xps13:~$ nvidia-smi 
Tue Jan 28 14:20:04 2025       
+-----------------------------------------------------------------------------------------+
| NVIDIA-SMI 570.86.15              Driver Version: 570.86.15      CUDA Version: 12.8     |
|-----------------------------------------+------------------------+----------------------+
| GPU  Name                 Persistence-M | Bus-Id          Disp.A | Volatile Uncorr. ECC |
| Fan  Temp   Perf          Pwr:Usage/Cap |           Memory-Usage | GPU-Util  Compute M. |
|                                         |                        |               MIG M. |
|=========================================+========================+======================|
|   0  NVIDIA GeForce RTX 3090        Off |   00000000:0B:00.0 Off |                  N/A |
|  0%   41C    P8             21W /  370W |      47MiB /  24576MiB |      0%      Default |
|                                         |                        |                  N/A |
+-----------------------------------------+------------------------+----------------------+
                                                                                         
+-----------------------------------------------------------------------------------------+
| Processes:                                                                              |
|  GPU   GI   CI              PID   Type   Process name                        GPU Memory |
|        ID   ID                                                               Usage      |
|=========================================================================================|
|    0   N/A  N/A            1322      G   /usr/lib/xorg/Xorg                        4MiB |
+-----------------------------------------------------------------------------------------+

Package nvidia-settings does not create necessary links:
The package contains nvidia-settings binary, its man page and the freedesktop menu item, but they are not usable, as all 3 files reside in /usr/lib/nvidia/current/ folder and no links are created from the appropriate folders. Fortunately this one is easy to work-around by creating the below links manually:

$ cd /usr/bin && sudo ln -s /usr/lib/nvidia/current/nvidia-settings
$ cd /usr/share/applications && sudo ln -s /usr/lib/nvidia/current/nvidia-settings.desktop
$ cd /usr/share/man/man1 && sudo ln -s /usr/lib/nvidia/current/nvidia-settings.1.gz

Hope someone is able to make use of all this info, please let me know if some more data is needed.

journal.txt (37.2 KB)
nvidia-bug-report.log.gz (358.6 KB)

We’re back…

570.86.16 got released as “beta” driver

https://uk.download.nvidia.com/XFree86/Linux-x86_64/570.86.16/NVIDIA-Linux-x86_64-570.86.16.run

My settings:
EndeavourOS
3070 ti
12600k

Kernel:
6.13.0-zen1-1-zen
Driver:
570.86.16
(Currently on the prop. nvidia kernel)

The things I found:

nvidia-smi -q -d “VOLTAGE”
Doesn’t get any information about the card voltage usage. On 565 I could see here if my “undervolting” settings works, now I have to hear the coil whine if its working or not. (okok, the UI also let me see the offset I set, but no value here is a bit of a boomer, but not a showstopper! The VRR thing is too nice to miss!)

Also CS2 getting spammed full of messages into the dev console:

“[RenderSystem] vkQueuePresentKHR failed with error 1000001003”
While the game itself works, but its getting kinda unusable to read any other messages from the console - 565 was fine here.

and + the open kernel module seems to be bit laggy on games, 1% low is much more likely on the open instead of the proprietary one. The proprietary FPS is fine, thx for the update!

Neither the GDM flip timeout bug or the high VRAM usage was fixed. Nice.

1 Like

Still having crashes from VK_KHR_present_wait being bugged as seen in this gamescope issue.

Also something is iffy with this VRR support. On Windows, my OLED will flicker, as OLED is wont to do, but under KDE my refreshrate fluctuates MUCH more

Quick Cell-phone video of me running BG3 in gamescope that shows the problem.

2 Likes

Driver is amazing i have zero issues so far and VRR on all 3 monitors is working great thanks for all the hard work and time!

1 Like

change your user name lol it no longer applies

1 Like

black screen over hdmi 2.1 with vrr enabled in kde and a fullscreen game is opened, setting vrr to nerver is a workaround to the problem.
verified that vrr does work with display port
lowering resolution and refresh rate doesn’t change anything
compiled the 570 drivers from the rpmfusion sources
nvidia-bug-report.log.gz (578.1 KB)

1 Like

System doesn’t boot properly on battery. This is really weird, and also happened with driver 565. I’ll try to summarize below:

  • If I try to boot the system on battery, with no external display plugged to the HDMI port, internal display remains black and never recovers; system won’t boot and seems to be unresponsive (keyboard backlight does work, though).
  • If I plug an external display to the HDMI port, I can see console messages on the external display only, internal display remains black. Still, system won’t finish the boot process, as above.
  • If I boot with the charger plugged, boot proceeds normally (both displays are activated as expected)

nvidia-bug-report.log.gz (472.1 KB)

EDIT: solved by setting nvidia-drm fbdev=0 as suggested here.

Those issues i don’t have @andre.ocosta. Tested on 2 notebooks.

MSI Crosshair 15 B12UGSZ and Acer Predator G9-793-71SJ

2 Likes

this bug is also not fixed:

1 Like

Same black screen in Hyprland with VRR and X11 with Option “metamodes” “5120x1440_240 +0+0 {AllowGSYNCCompatible=On}”. Worked without issue on 565.77
RTX 3080
nvidia-bug-report.log.gz (332.1 KB)

1 Like

The nvidia driver now by default wants to drive the fb console
add options nvidia-drm fbdev=0 in /etc/modprobe.d/nvidia.conf
(regenerate your initramfs and reboot to apply the setting)

2 Likes

Oh boy, suspend still not working 🙄 System log shows additional entries now, though:

jan 30 19:52:11 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: pEntries != NULL @ gmmu_walk.c:881
jan 30 19:52:11 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: progress == indexHi_tmp - indexLo_tmp + 1 @ mmu_walk.c:1092
jan 30 19:52:11 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: NV_OK == status @ mmu_walk.c:488
jan 30 19:52:11 fedoracosta kernel: NVRM: mmuWalkSparsify: Failed to sparsify VA Range 0xa80000 to 0xabffff. Status = 0x00000040
jan 30 19:52:11 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: 0 @ mmu_walk_sparse.c:74
jan 30 19:52:11 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: pEntries != NULL @ gmmu_walk.c:881
jan 30 19:52:11 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: progress == indexHi_tmp - indexLo_tmp + 1 @ mmu_walk.c:1092
jan 30 19:52:11 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: NV_OK == status @ mmu_walk.c:488
jan 30 19:52:11 fedoracosta kernel: NVRM: mmuWalkUnmap: Failed to unmap VA Range 0xa80000 to 0xabffff. Status = 0x00000040
jan 30 19:52:11 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: 0 @ mmu_walk_unmap.c:65
jan 30 19:52:11 fedoracosta kernel: NVRM: mmuWalkSparsify: Unmap failed with status = 0x00000040
jan 30 19:52:11 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: NV_OK == unmapStatus @ mmu_walk_sparse.c:85
jan 30 19:52:11 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: pEntries != NULL @ gmmu_walk.c:881
jan 30 19:52:11 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: progress == indexHi_tmp - indexLo_tmp + 1 @ mmu_walk.c:1092
jan 30 19:52:11 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: NV_OK == status @ mmu_walk.c:488
jan 30 19:52:11 fedoracosta kernel: NVRM: mmuWalkUnmap: Failed to unmap VA Range 0xa80000 to 0xabffff. Status = 0x00000040
jan 30 19:52:11 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: 0 @ mmu_walk_unmap.c:65
jan 30 19:52:11 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: (pKernelBus->pReadToFlush != NULL || pKernelBus->virtualBar2[GPU_GFID_PF].pCpuMapping != NULL) @ kern_bus_gv100.c:388
jan 30 19:52:11 fedoracosta systemd[1]: nvidia-suspend.service: Deactivated successfully.
jan 30 19:52:11 fedoracosta systemd[1]: Finished nvidia-suspend.service - NVIDIA system suspend actions.
...
jan 30 19:52:11 fedoracosta systemd[1]: Starting systemd-suspend.service - System Suspend...
jan 30 19:52:11 fedoracosta systemd-sleep[16424]: User sessions remain unfrozen on explicit request ($SYSTEMD_SLEEP_FREEZE_USER_SESSIONS=0).
jan 30 19:52:11 fedoracosta systemd-sleep[16424]: This is not recommended, and might result in unexpected behavior, particularly
jan 30 19:52:11 fedoracosta systemd-sleep[16424]: in suspend-then-hibernate operations or setups with encrypted home directories.
jan 30 19:52:11 fedoracosta systemd-sleep[16424]: Performing sleep operation 'suspend'...
jan 30 19:52:11 fedoracosta kernel: PM: suspend entry (deep)
jan 30 19:52:11 fedoracosta kernel: Filesystems sync: 0.046 seconds
jan 30 19:52:31 fedoracosta kernel: Freezing user space processes
jan 30 19:52:31 fedoracosta kernel: NVRM: dmaAllocMapping_GM107: can't alloc VA space for mapping.
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: pEntries != NULL @ gmmu_walk.c:881
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: progress == indexHi_tmp - indexLo_tmp + 1 @ mmu_walk.c:1092
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: NV_OK == status @ mmu_walk.c:488
jan 30 19:52:31 fedoracosta kernel: NVRM: mmuWalkSparsify: Failed to sparsify VA Range 0xac0000 to 0xadffff. Status = 0x00000040
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: 0 @ mmu_walk_sparse.c:74
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: pEntries != NULL @ gmmu_walk.c:881
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: progress == indexHi_tmp - indexLo_tmp + 1 @ mmu_walk.c:1092
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: NV_OK == status @ mmu_walk.c:488
jan 30 19:52:31 fedoracosta kernel: NVRM: mmuWalkUnmap: Failed to unmap VA Range 0xac0000 to 0xadffff. Status = 0x00000040
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: 0 @ mmu_walk_unmap.c:65
jan 30 19:52:31 fedoracosta kernel: NVRM: mmuWalkSparsify: Unmap failed with status = 0x00000040
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: NV_OK == unmapStatus @ mmu_walk_sparse.c:85
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: pEntries != NULL @ gmmu_walk.c:881
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: progress == indexHi_tmp - indexLo_tmp + 1 @ mmu_walk.c:1092
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: NV_OK == status @ mmu_walk.c:488
jan 30 19:52:31 fedoracosta kernel: NVRM: mmuWalkUnmap: Failed to unmap VA Range 0xac0000 to 0xadffff. Status = 0x00000040
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: 0 @ mmu_walk_unmap.c:65
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: (pKernelBus->pReadToFlush != NULL || pKernelBus->virtualBar2[GPU_GFID_PF].pCpuMapping != NULL) @ kern_bus_gv100.c:388
jan 30 19:52:31 fedoracosta kernel: NVRM: dmaAllocMapping_GM107: can't alloc VA space for mapping.
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: pEntries != NULL @ gmmu_walk.c:881
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: progress == indexHi_tmp - indexLo_tmp + 1 @ mmu_walk.c:1092
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: NV_OK == status @ mmu_walk.c:488
jan 30 19:52:31 fedoracosta kernel: NVRM: mmuWalkSparsify: Failed to sparsify VA Range 0xae0000 to 0xaeffff. Status = 0x00000040
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: 0 @ mmu_walk_sparse.c:74
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: pEntries != NULL @ gmmu_walk.c:881
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: progress == indexHi_tmp - indexLo_tmp + 1 @ mmu_walk.c:1092
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: NV_OK == status @ mmu_walk.c:488
jan 30 19:52:31 fedoracosta kernel: NVRM: mmuWalkUnmap: Failed to unmap VA Range 0xae0000 to 0xaeffff. Status = 0x00000040
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: 0 @ mmu_walk_unmap.c:65
jan 30 19:52:31 fedoracosta kernel: NVRM: mmuWalkSparsify: Unmap failed with status = 0x00000040
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: NV_OK == unmapStatus @ mmu_walk_sparse.c:85
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: pEntries != NULL @ gmmu_walk.c:881
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: progress == indexHi_tmp - indexLo_tmp + 1 @ mmu_walk.c:1092
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: NV_OK == status @ mmu_walk.c:488
jan 30 19:52:31 fedoracosta kernel: NVRM: mmuWalkUnmap: Failed to unmap VA Range 0xae0000 to 0xaeffff. Status = 0x00000040
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: 0 @ mmu_walk_unmap.c:65
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: (pKernelBus->pReadToFlush != NULL || pKernelBus->virtualBar2[GPU_GFID_PF].pCpuMapping != NULL) @ kern_bus_gv100.c:388
jan 30 19:52:31 fedoracosta kernel: NVRM: dmaAllocMapping_GM107: can't alloc VA space for mapping.
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: pEntries != NULL @ gmmu_walk.c:881
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: progress == indexHi_tmp - indexLo_tmp + 1 @ mmu_walk.c:1092
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: NV_OK == status @ mmu_walk.c:488
jan 30 19:52:31 fedoracosta kernel: NVRM: mmuWalkSparsify: Failed to sparsify VA Range 0xaf0000 to 0xafffff. Status = 0x00000040
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: 0 @ mmu_walk_sparse.c:74
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: pEntries != NULL @ gmmu_walk.c:881
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: progress == indexHi_tmp - indexLo_tmp + 1 @ mmu_walk.c:1092
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: NV_OK == status @ mmu_walk.c:488
jan 30 19:52:31 fedoracosta kernel: NVRM: mmuWalkUnmap: Failed to unmap VA Range 0xaf0000 to 0xafffff. Status = 0x00000040
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: 0 @ mmu_walk_unmap.c:65
jan 30 19:52:31 fedoracosta kernel: NVRM: mmuWalkSparsify: Unmap failed with status = 0x00000040
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: NV_OK == unmapStatus @ mmu_walk_sparse.c:85
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: pEntries != NULL @ gmmu_walk.c:881
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: progress == indexHi_tmp - indexLo_tmp + 1 @ mmu_walk.c:1092
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: NV_OK == status @ mmu_walk.c:488
jan 30 19:52:31 fedoracosta kernel: NVRM: mmuWalkUnmap: Failed to unmap VA Range 0xaf0000 to 0xafffff. Status = 0x00000040
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: 0 @ mmu_walk_unmap.c:65
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: (pKernelBus->pReadToFlush != NULL || pKernelBus->virtualBar2[GPU_GFID_PF].pCpuMapping != NULL) @ kern_bus_gv100.c:388
jan 30 19:52:31 fedoracosta kernel: NVRM: dmaAllocMapping_GM107: can't alloc VA space for mapping.
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: pEntries != NULL @ gmmu_walk.c:881
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: progress == indexHi_tmp - indexLo_tmp + 1 @ mmu_walk.c:1092
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: NV_OK == status @ mmu_walk.c:488
jan 30 19:52:31 fedoracosta kernel: NVRM: mmuWalkSparsify: Failed to sparsify VA Range 0xb00000 to 0xb0ffff. Status = 0x00000040
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: 0 @ mmu_walk_sparse.c:74
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: pEntries != NULL @ gmmu_walk.c:881
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: progress == indexHi_tmp - indexLo_tmp + 1 @ mmu_walk.c:1092
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: NV_OK == status @ mmu_walk.c:488
jan 30 19:52:31 fedoracosta kernel: NVRM: mmuWalkUnmap: Failed to unmap VA Range 0xb00000 to 0xb0ffff. Status = 0x00000040
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: 0 @ mmu_walk_unmap.c:65
jan 30 19:52:31 fedoracosta kernel: NVRM: mmuWalkSparsify: Unmap failed with status = 0x00000040
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: NV_OK == unmapStatus @ mmu_walk_sparse.c:85
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: pEntries != NULL @ gmmu_walk.c:881
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: progress == indexHi_tmp - indexLo_tmp + 1 @ mmu_walk.c:1092
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: NV_OK == status @ mmu_walk.c:488
jan 30 19:52:31 fedoracosta kernel: NVRM: mmuWalkUnmap: Failed to unmap VA Range 0xb00000 to 0xb0ffff. Status = 0x00000040
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: 0 @ mmu_walk_unmap.c:65
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: (pKernelBus->pReadToFlush != NULL || pKernelBus->virtualBar2[GPU_GFID_PF].pCpuMapping != NULL) @ kern_bus_gv100.c:388
jan 30 19:52:31 fedoracosta kernel: NVRM: dmaAllocMapping_GM107: can't alloc VA space for mapping.
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: pEntries != NULL @ gmmu_walk.c:881
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: progress == indexHi_tmp - indexLo_tmp + 1 @ mmu_walk.c:1092
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: NV_OK == status @ mmu_walk.c:488
jan 30 19:52:31 fedoracosta kernel: NVRM: mmuWalkSparsify: Failed to sparsify VA Range 0xb10000 to 0xb1ffff. Status = 0x00000040
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: 0 @ mmu_walk_sparse.c:74
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: pEntries != NULL @ gmmu_walk.c:881
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: progress == indexHi_tmp - indexLo_tmp + 1 @ mmu_walk.c:1092
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: NV_OK == status @ mmu_walk.c:488
jan 30 19:52:31 fedoracosta kernel: NVRM: mmuWalkUnmap: Failed to unmap VA Range 0xb10000 to 0xb1ffff. Status = 0x00000040
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: 0 @ mmu_walk_unmap.c:65
jan 30 19:52:31 fedoracosta kernel: NVRM: mmuWalkSparsify: Unmap failed with status = 0x00000040
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: NV_OK == unmapStatus @ mmu_walk_sparse.c:85
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: pEntries != NULL @ gmmu_walk.c:881
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: progress == indexHi_tmp - indexLo_tmp + 1 @ mmu_walk.c:1092
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: NV_OK == status @ mmu_walk.c:488
jan 30 19:52:31 fedoracosta kernel: NVRM: mmuWalkUnmap: Failed to unmap VA Range 0xb10000 to 0xb1ffff. Status = 0x00000040
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: 0 @ mmu_walk_unmap.c:65
jan 30 19:52:31 fedoracosta kernel: NVRM: nvAssertFailedNoLog: Assertion failed: (pKernelBus->pReadToFlush != NULL || pKernelBus->virtualBar2[GPU_GFID_PF].pCpuMapping != NULL) @ kern_bus_gv100.c:388
jan 30 19:52:31 fedoracosta kernel: Freezing user space processes failed after 20.007 seconds (1 tasks refusing to freeze, wq_busy=0):
jan 30 19:52:31 fedoracosta kernel: task:gnome-shell     state:R  running task     stack:0     pid:2382  tgid:2382  ppid:2221   flags:0x0000400e
jan 30 19:52:31 fedoracosta kernel: Call Trace:
jan 30 19:52:31 fedoracosta kernel:  <TASK>
jan 30 19:52:31 fedoracosta kernel:  ? gvaspaceAlloc_IMPL+0x861/0x19b0 [nvidia]
jan 30 19:52:31 fedoracosta kernel:  ? srso_return_thunk+0x5/0x5f
jan 30 19:52:31 fedoracosta kernel:  ? memmgrFillComprInfoUncompressed_IMPL+0x40/0xa0 [nvidia]
jan 30 19:52:31 fedoracosta kernel:  ? memmgrGetKindComprForGpu_KERNEL+0x8e/0x130 [nvidia]
jan 30 19:52:31 fedoracosta kernel:  ? dmaAllocMapping_GM107+0xdbd/0x1cf0 [nvidia]
jan 30 19:52:31 fedoracosta kernel:  ? _kbusMapAperture_GM107+0x161/0x2c0 [nvidia]
jan 30 19:52:31 fedoracosta kernel:  ? srso_return_thunk+0x5/0x5f
jan 30 19:52:31 fedoracosta kernel:  ? _portMemAllocNonPagedUntracked+0x25/0x50 [nvidia]
jan 30 19:52:31 fedoracosta kernel:  ? srso_return_thunk+0x5/0x5f
jan 30 19:52:31 fedoracosta kernel:  ? kbusMapFbAperture_GM107+0x1da/0x520 [nvidia]
jan 30 19:52:31 fedoracosta kernel:  ? kbusMapFbAperture_GM107+0x182/0x520 [nvidia]
jan 30 19:52:31 fedoracosta kernel:  ? memMap_IMPL+0x6f6/0xe70 [nvidia]
jan 30 19:52:31 fedoracosta kernel:  ? os_alloc_mem+0xc0/0xd0 [nvidia]
jan 30 19:52:31 fedoracosta kernel:  ? srso_return_thunk+0x5/0x5f
jan 30 19:52:31 fedoracosta kernel:  ? serverMap+0x30b/0x480 [nvidia]
jan 30 19:52:31 fedoracosta kernel:  ? srso_return_thunk+0x5/0x5f
jan 30 19:52:31 fedoracosta kernel:  ? rmapiMapToCpuWithSecInfoV2+0x1a6/0x2b0 [nvidia]
jan 30 19:52:31 fedoracosta kernel:  ? srso_return_thunk+0x5/0x5f
jan 30 19:52:31 fedoracosta kernel:  ? rmapiMapToCpuWithSecInfoTls+0x66/0x90 [nvidia]
jan 30 19:52:31 fedoracosta kernel:  ? rmapiMapToCpuWithSecInfoTls+0x3c/0x90 [nvidia]
jan 30 19:52:31 fedoracosta kernel:  ? _nv04MapMemory.constprop.0+0x72/0xb0 [nvidia]
jan 30 19:52:31 fedoracosta kernel:  ? _nv04MapMemory.constprop.0+0x32/0xb0 [nvidia]
jan 30 19:52:31 fedoracosta kernel:  ? rm_kernel_rmapi_op+0x8c/0x1dc [nvidia]
jan 30 19:52:31 fedoracosta kernel:  ? nvkms_call_rm+0x4d/0x80 [nvidia_modeset]
jan 30 19:52:31 fedoracosta kernel:  ? nvRmApiMapMemory+0x4d/0x60 [nvidia_modeset]
jan 30 19:52:31 fedoracosta kernel:  ? MapMemory+0xac/0x110 [nvidia_modeset]
jan 30 19:52:31 fedoracosta kernel:  ? MapMemory+0x85/0x110 [nvidia_modeset]
jan 30 19:52:31 fedoracosta kernel:  ? __nv_drm_gem_nvkms_map+0x6f/0xd0 [nvidia_drm]
jan 30 19:52:31 fedoracosta kernel:  ? __nv_drm_gem_nvkms_mmap+0x16/0x40 [nvidia_drm]
jan 30 19:52:31 fedoracosta kernel:  ? nv_drm_mmap+0xdd/0x160 [nvidia_drm]
jan 30 19:52:31 fedoracosta kernel:  ? __mmap_region+0x748/0xb10
jan 30 19:52:31 fedoracosta kernel:  ? mmap_region+0x78/0xa0
jan 30 19:52:31 fedoracosta kernel:  ? do_mmap+0x499/0x690
jan 30 19:52:31 fedoracosta kernel:  ? srso_return_thunk+0x5/0x5f
jan 30 19:52:31 fedoracosta kernel:  ? vm_mmap_pgoff+0xec/0x1c0
jan 30 19:52:31 fedoracosta kernel:  ? __pfx_nv_drm_gem_alloc_nvkms_memory_ioctl+0x10/0x10 [nvidia_drm]
jan 30 19:52:31 fedoracosta kernel:  ? ksys_mmap_pgoff+0x14b/0x220
jan 30 19:52:31 fedoracosta kernel:  ? do_syscall_64+0x82/0x160
jan 30 19:52:31 fedoracosta kernel:  ? srso_return_thunk+0x5/0x5f
jan 30 19:52:31 fedoracosta kernel:  ? objects_lookup+0xa1/0xd0
jan 30 19:52:31 fedoracosta kernel:  ? srso_return_thunk+0x5/0x5f
jan 30 19:52:31 fedoracosta kernel:  ? drm_vma_offset_add+0x33/0x70
jan 30 19:52:31 fedoracosta kernel:  ? srso_return_thunk+0x5/0x5f
jan 30 19:52:31 fedoracosta kernel:  ? __nv_drm_gem_map_nvkms_memory_offset+0x1d/0x70 [nvidia_drm]
jan 30 19:52:31 fedoracosta kernel:  ? srso_return_thunk+0x5/0x5f
jan 30 19:52:31 fedoracosta kernel:  ? nv_drm_gem_map_offset_ioctl+0x4c/0xd0 [nvidia_drm]
jan 30 19:52:31 fedoracosta kernel:  ? srso_return_thunk+0x5/0x5f
jan 30 19:52:31 fedoracosta kernel:  ? srso_return_thunk+0x5/0x5f
jan 30 19:52:31 fedoracosta kernel:  ? __check_object_size+0x58/0x230
jan 30 19:52:31 fedoracosta kernel:  ? srso_return_thunk+0x5/0x5f
jan 30 19:52:31 fedoracosta kernel:  ? srso_return_thunk+0x5/0x5f
jan 30 19:52:31 fedoracosta kernel:  ? drm_ioctl+0x2b7/0x540
jan 30 19:52:31 fedoracosta kernel:  ? __pfx_nv_drm_gem_map_offset_ioctl+0x10/0x10 [nvidia_drm]
jan 30 19:52:31 fedoracosta kernel:  ? srso_return_thunk+0x5/0x5f
jan 30 19:52:31 fedoracosta kernel:  ? syscall_exit_to_user_mode+0x10/0x210
jan 30 19:52:31 fedoracosta kernel:  ? srso_return_thunk+0x5/0x5f
jan 30 19:52:31 fedoracosta kernel:  ? do_syscall_64+0x8e/0x160
jan 30 19:52:31 fedoracosta kernel:  ? srso_return_thunk+0x5/0x5f
jan 30 19:52:31 fedoracosta kernel:  ? srso_return_thunk+0x5/0x5f
jan 30 19:52:31 fedoracosta kernel:  ? drm_ioctl+0x2b7/0x540
jan 30 19:52:31 fedoracosta kernel:  ? __pfx_nv_drm_get_drm_file_unique_id_ioctl+0x10/0x10 [nvidia_drm]
jan 30 19:52:31 fedoracosta kernel:  ? srso_return_thunk+0x5/0x5f
jan 30 19:52:31 fedoracosta kernel:  ? syscall_exit_to_user_mode+0x10/0x210
jan 30 19:52:31 fedoracosta kernel:  ? srso_return_thunk+0x5/0x5f
jan 30 19:52:31 fedoracosta kernel:  ? do_syscall_64+0x8e/0x160
jan 30 19:52:31 fedoracosta kernel:  ? srso_return_thunk+0x5/0x5f
jan 30 19:52:31 fedoracosta kernel:  ? exc_page_fault+0x7e/0x180
jan 30 19:52:31 fedoracosta kernel:  ? entry_SYSCALL_64_after_hwframe+0x76/0x7e
jan 30 19:52:31 fedoracosta kernel:  </TASK>
jan 30 19:52:31 fedoracosta kernel: OOM killer enabled.
jan 30 19:52:31 fedoracosta kernel: Restarting tasks ... done.
jan 30 19:52:31 fedoracosta kernel: random: crng reseeded on system resumption
jan 30 19:52:31 fedoracosta kernel: PM: suspend exit
jan 30 19:52:31 fedoracosta kernel: PM: suspend entry (s2idle)
jan 30 19:52:31 fedoracosta rtkit-daemon[1126]: The canary thread is apparently starving. Taking action.
...

nvidia-bug-report.log.gz (472.1 KB)

add the nvidia-bug-report file too

There is on VRR on LG C4 screen which is HDMI2.1

No Support for VRR

Thanks @mercier.nicolas ! That indeed fixed the problem. It does not (see latest update here).

1 Like

GPU: GeForce RTX 5090 D
OS: Ubuntu 24.04.1 LTS (GNU/Linux 6.12.3-061203-generic x86_64)
Driver: 570.86.16

lspci -k | grep -EA3 ‘VGA|3D|Display’

01:00.0 VGA compatible controller: NVIDIA Corporation Device 2b87 (rev a1)
Subsystem: Shenzhen Colorful Yugong Technology and Development Co. Device 1400
Kernel driver in use: nvidia
Kernel modules: nvidiafb, nouveau, nvidia_drm, nvidia

nvidia-smi
No devices were found

cat /proc/driver/nvidia/version

NVRM version: NVIDIA UNIX x86_64 Kernel Module 570.86.16 Fri Jan 24 21:25:51 UTC 2025
GCC version: gcc version 14.2.0 (GCC)

uname -a
Linux u24 6.12.3-061203-generic #202412060638 SMP PREEMPT_DYNAMIC Fri Dec 6 07:08:51 UTC 2024 x86_64 x86_64 x86_64 GNU/Linux

So weird! My acer notebook has zero issues with suspend. It is working as it should.

1 Like