550.78 release feedback & discussion thread

Building the open driver, I’ve been seeing a lot of warnings of the form:

…/open-gpu-kernel-modules/kernel-open/nvidia.o: warning: objtool: __nvoc_ctor_AccessCounterBuffer+0x3c: ‘naked’ return found in MITIGATION_RETHUNK build

$ fgrep “‘naked’ return found in MITIGATION_RETHUNK build” nohup.out | wc -l
22768

Thought the report might be appreciated. Driver is running great with Linux 6.9.3.

[ 198.856] (II) NVIDIA(0): NVIDIA GPU NVIDIA GeForce RTX 3090 Ti (GA102-A)

…on a System76 Thelio Mega 1.1

-Scott

The fan on my GPU was running on max. Problem also occured with older driver version. “sensors” shows only “0” values. The problem did not occur anymore after I removed and plugged the GPU in again. Is anything visible from the error report?
sensors

nvidia-bug-report.log.gz (486.2 KB)

I was just monitoring the systemd journal to debug a network issue and found this:

jun 26 20:18:03 fedoracosta kernel: ==================================================================
jun 26 20:18:03 fedoracosta kernel: BUG: KFENCE: use-after-free read in nv_dma_release_sgt+0x29/0x70 [nvidia]
jun 26 20:18:03 fedoracosta kernel: Use-after-free read at 0x0000000086b3798c (in kfence-#213):
jun 26 20:18:03 fedoracosta kernel:  nv_dma_release_sgt+0x29/0x70 [nvidia]
jun 26 20:18:03 fedoracosta kernel:  _nv041508rm+0x67/0xd0 [nvidia]
jun 26 20:18:03 fedoracosta kernel:  _nv036719rm+0xca/0x430 [nvidia]
jun 26 20:18:03 fedoracosta kernel:  _nv036684rm+0x6b/0x130 [nvidia]
jun 26 20:18:03 fedoracosta kernel:  _nv002833rm+0xd/0x20 [nvidia]
jun 26 20:18:03 fedoracosta kernel:  _nv004576rm+0x1e/0xb0 [nvidia]
jun 26 20:18:03 fedoracosta kernel:  _nv017132rm+0x59c/0x680 [nvidia]
jun 26 20:18:03 fedoracosta kernel:  _nv045497rm+0xab/0xe0 [nvidia]
jun 26 20:18:03 fedoracosta kernel:  _nv047214rm+0xb3/0x180 [nvidia]
jun 26 20:18:03 fedoracosta kernel:  _nv047213rm+0x3e5/0x690 [nvidia]
jun 26 20:18:03 fedoracosta kernel:  _nv045395rm+0xdd/0x180 [nvidia]
jun 26 20:18:03 fedoracosta kernel:  _nv045396rm+0x41/0x70 [nvidia]
jun 26 20:18:03 fedoracosta kernel:  _nv000571rm+0x4d/0x60 [nvidia]
jun 26 20:18:03 fedoracosta kernel:  _nv000731rm+0x1b7/0xeb0 [nvidia]
jun 26 20:18:03 fedoracosta kernel:  rm_ioctl+0x58/0xb0 [nvidia]
jun 26 20:18:03 fedoracosta kernel:  nvidia_unlocked_ioctl+0x53b/0x8d0 [nvidia]
jun 26 20:18:03 fedoracosta kernel:  __x64_sys_ioctl+0x97/0xd0
jun 26 20:18:03 fedoracosta kernel:  do_syscall_64+0x82/0x160
jun 26 20:18:03 fedoracosta kernel:  entry_SYSCALL_64_after_hwframe+0x76/0x7e
jun 26 20:18:03 fedoracosta kernel: 
jun 26 20:18:03 fedoracosta kernel: kfence-#213: 0x0000000079832168-0x000000002048ad56, size=384, cache=kmalloc-rnd-13-512
jun 26 20:18:03 fedoracosta kernel: allocated by task 3044 on cpu 15 at 10913.895451s:
jun 26 20:18:03 fedoracosta kernel:  nv_drm_gem_prime_import_sg_table+0x2d/0xb0 [nvidia_drm]
jun 26 20:18:03 fedoracosta kernel:  drm_gem_prime_import_dev+0x87/0x140
jun 26 20:18:03 fedoracosta kernel:  drm_gem_prime_fd_to_handle+0xea/0x1f0
jun 26 20:18:03 fedoracosta kernel:  drm_ioctl_kernel+0xb3/0x100
jun 26 20:18:03 fedoracosta kernel:  drm_ioctl+0x28b/0x540
jun 26 20:18:03 fedoracosta kernel:  __x64_sys_ioctl+0x97/0xd0
jun 26 20:18:03 fedoracosta kernel:  do_syscall_64+0x82/0x160
jun 26 20:18:03 fedoracosta kernel:  entry_SYSCALL_64_after_hwframe+0x76/0x7e
jun 26 20:18:03 fedoracosta kernel: 
jun 26 20:18:03 fedoracosta kernel: freed by task 3044 on cpu 6 at 10923.550179s:
jun 26 20:18:03 fedoracosta kernel:  nv_dma_release_sgt+0x49/0x70 [nvidia]
jun 26 20:18:03 fedoracosta kernel:  _nv041508rm+0x67/0xd0 [nvidia]
jun 26 20:18:03 fedoracosta kernel:  _nv036719rm+0xca/0x430 [nvidia]
jun 26 20:18:03 fedoracosta kernel:  _nv036684rm+0x6b/0x130 [nvidia]
jun 26 20:18:03 fedoracosta kernel:  _nv002833rm+0xd/0x20 [nvidia]
jun 26 20:18:03 fedoracosta kernel:  _nv004576rm+0x1e/0xb0 [nvidia]
jun 26 20:18:03 fedoracosta kernel:  _nv017132rm+0x59c/0x680 [nvidia]
jun 26 20:18:03 fedoracosta kernel:  _nv045497rm+0xab/0xe0 [nvidia]
jun 26 20:18:03 fedoracosta kernel:  _nv047214rm+0xb3/0x180 [nvidia]
jun 26 20:18:03 fedoracosta kernel:  _nv047213rm+0x3e5/0x690 [nvidia]
jun 26 20:18:03 fedoracosta kernel:  _nv045395rm+0xdd/0x180 [nvidia]
jun 26 20:18:03 fedoracosta kernel:  _nv045396rm+0x41/0x70 [nvidia]
jun 26 20:18:03 fedoracosta kernel:  _nv000571rm+0x4d/0x60 [nvidia]
jun 26 20:18:03 fedoracosta kernel:  _nv000731rm+0x1b7/0xeb0 [nvidia]
jun 26 20:18:03 fedoracosta kernel:  rm_ioctl+0x58/0xb0 [nvidia]
jun 26 20:18:03 fedoracosta kernel:  nvidia_unlocked_ioctl+0x53b/0x8d0 [nvidia]
jun 26 20:18:03 fedoracosta kernel:  __x64_sys_ioctl+0x97/0xd0
jun 26 20:18:03 fedoracosta kernel:  do_syscall_64+0x82/0x160
jun 26 20:18:03 fedoracosta kernel:  entry_SYSCALL_64_after_hwframe+0x76/0x7e
jun 26 20:18:03 fedoracosta kernel: 
jun 26 20:18:03 fedoracosta kernel: CPU: 6 PID: 3044 Comm: gnome-shell Tainted: P    B      OE      6.9.5-200.fc40.x86_64 #1
jun 26 20:18:03 fedoracosta kernel: Hardware name: Acer Nitro AN515-44/Stonic_RNS, BIOS V1.04 02/04/2021
jun 26 20:18:03 fedoracosta kernel: ==================================================================

System is Acer Nitro AN515-44 running Fedora 40, GNOME 46, driver 550.90.07. I have an external monitor plugged to the HDMI port wired to the GTX 1650 dGPU.

This has happened so far a couple of times today (computer was on sleep mode from around 5pm till around 8pm):

❯ journalctl -r | grep "jun 26.*BUG: KFENCE: use-after-free read in"
jun 26 20:57:03 fedoracosta kernel: BUG: KFENCE: use-after-free read in nv_dma_release_sgt+0x29/0x70 [nvidia]
jun 26 20:43:03 fedoracosta kernel: BUG: KFENCE: use-after-free read in nv_dma_release_sgt+0x29/0x70 [nvidia]
jun 26 20:38:09 fedoracosta kernel: BUG: KFENCE: use-after-free read in nv_dma_release_sgt+0x29/0x70 [nvidia]
jun 26 20:35:11 fedoracosta kernel: BUG: KFENCE: use-after-free read in nv_dma_release_sgt+0x29/0x70 [nvidia]
jun 26 20:27:32 fedoracosta kernel: BUG: KFENCE: use-after-free read in nv_dma_release_sgt+0x29/0x70 [nvidia]
jun 26 20:18:03 fedoracosta kernel: BUG: KFENCE: use-after-free read in nv_dma_release_sgt+0x29/0x70 [nvidia]
jun 26 16:29:45 fedoracosta kernel: BUG: KFENCE: use-after-free read in nv_dma_release_sgt+0x29/0x70 [nvidia]
jun 26 16:25:21 fedoracosta kernel: BUG: KFENCE: use-after-free read in nv_dma_release_sgt+0x29/0x70 [nvidia]
jun 26 16:23:08 fedoracosta kernel: BUG: KFENCE: use-after-free read in nv_dma_release_sgt+0x29/0x70 [nvidia]
jun 26 16:22:39 fedoracosta kernel: BUG: KFENCE: use-after-free read in nv_dma_release_sgt+0x29/0x70 [nvidia]
jun 26 16:12:34 fedoracosta kernel: BUG: KFENCE: use-after-free read in nv_dma_release_sgt+0x29/0x70 [nvidia]
jun 26 16:11:24 fedoracosta kernel: BUG: KFENCE: use-after-free read in nv_dma_release_sgt+0x29/0x70 [nvidia]