Nvidia 550 linux driver ACPI error (kernel 6.5, 6.7 and 6.8)

Hi,

I’m using Ubuntu 22.04 with Nvidia 550 drivers from nvidia apt repository.

I got the following error after few minutes since boot:

[  989.890229] BUG: unable to handle page fault for address: ffff89bac4855fe8
[  989.890234] #PF: supervisor write access in kernel mode
[  989.890236] #PF: error_code(0x0003) - permissions violation
[  989.890237] PGD 664e01067 P4D 664e01067 PUD 101f44063 PMD 177ba6063 PTE 8000000104855121
[  989.890240] Oops: 0003 [#1] PREEMPT SMP NOPTI
[  989.890242] CPU: 6 PID: 54 Comm: kworker/6:0 Tainted: P           OE      6.8.4-i7 #1
[  989.890244] Hardware name: LENOVO 20Y4S3Y10Q/20Y4S3Y10Q, BIOS N40ET44W (1.26 ) 11/08/2023
[  989.890245] Workqueue: kacpi_notify acpi_os_execute_deferred
[  989.890249] RIP: 0010:_nv044009rm+0x10/0x30 [nvidia]
[  989.890620] Code: 00 00 00 00 00 0f 1f 44 00 00 f3 0f 1e fa c3 66 2e 0f 1f 84 00 00 00 00 00 90 f3 0f 1e fa 48 83 ec 08 48 83 ed 10 48 8d 7d 08 <48> c7 45 08 00 00 00 00 e8 e3 4f 6f ff 48 8b 45 08 48 83 c4 08 48
[  989.890622] RSP: 0018:ffffa263c02e7d20 EFLAGS: 00010282
[  989.890624] RAX: 0000000000000000 RBX: ffffa263c7ba68e8 RCX: 0000000000000000
[  989.890625] RDX: ffff89bac4f86c88 RSI: 00000000000000c0 RDI: ffff89bac4855fe8
[  989.890626] RBP: ffff89bac4855fe0 R08: 0000000000000000 R09: 0000000000000000
[  989.890627] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000004
[  989.890627] R13: 0000000000000000 R14: ffffa263c7b68008 R15: ffff89baf7568008
[  989.890628] FS:  0000000000000000(0000) GS:ffff89c1ff580000(0000) knlGS:0000000000000000
[  989.890629] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  989.890630] CR2: ffff89bac4855fe8 CR3: 0000000663e2e006 CR4: 0000000000770ef0
[  989.890632] PKRU: 55555554
[  989.890632] Call Trace:
[  989.890634]  <TASK>
[  989.890636]  ? __die+0x1b/0x70
[  989.890640]  ? page_fault_oops+0x14c/0x4c0
[  989.890642]  ? fixup_exception+0x1d/0x370
[  989.890645]  ? exc_page_fault+0xb3/0xc0
[  989.890648]  ? asm_exc_page_fault+0x22/0x30
[  989.890652]  ? _nv044009rm+0x10/0x30 [nvidia]
[  989.890912]  ? _raw_spin_unlock_irqrestore+0x22/0x60
[  989.890916]  ? _nv014559rm+0x4d/0x90 [nvidia]
[  989.891173]  ? _nv049696rm+0x18/0x60 [nvidia]
[  989.891424]  ? _nv026805rm+0x61/0x90 [nvidia]
[  989.891598]  ? rm_acpi_nvpcf_notify+0x1c/0xe0 [nvidia]
[  989.891823]  ? acpi_ut_release_mutex+0xea/0x180
[  989.891827]  ? acpi_get_handle+0x64/0xd0
[  989.891830]  ? process_one_work+0x15e/0x280
[  989.891833]  ? acpi_ev_notify_dispatch+0x42/0x60
[  989.891835]  ? acpi_os_execute_deferred+0xe/0x20
[  989.891837]  ? process_one_work+0x15e/0x280
[  989.891839]  ? worker_thread+0x2fb/0x410
[  989.891840]  ? __pfx_worker_thread+0x10/0x10
[  989.891842]  ? kthread+0xe9/0x120
[  989.891844]  ? __pfx_kthread+0x10/0x10
[  989.891845]  ? ret_from_fork+0x28/0x40
[  989.891848]  ? __pfx_kthread+0x10/0x10
[  989.891849]  ? ret_from_fork_asm+0x1b/0x30
[  989.891851]  </TASK>
[  989.891852] Modules linked in: tls snd_seq_dummy snd_hrtimer veth xt_nat nf_conntrack_netlink xfrm_user xfrm_algo xt_addrtype br_netfilter xt_CHECKSUM xt_MASQUERADE xt_conntrack ipt_REJECT nf_reject_ipv4 xt_tcpudp nft_compat nft_chain_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 bridge stp llc l2tp_ppp l2tp_netlink l2tp_core ip6_udp_tunnel udp_tunnel pppox nvme_fabrics ccm vboxnetadp(OE) vboxnetflt(OE) vboxdrv(OE) nf_tables nfnetlink rfcomm cmac algif_hash algif_skcipher af_alg bnep overlay sunrpc binfmt_misc snd_usb_audio snd_usbmidi_lib snd_ump btusb btrtl uvcvideo btintel videobuf2_vmalloc btbcm uvc btmtk videobuf2_memops videobuf2_v4l2 videodev bluetooth videobuf2_common mc ecdh_generic ecc snd_ctl_led snd_soc_skl_hda_dsp snd_soc_hdac_hdmi snd_soc_intel_hda_dsp_common snd_sof_probes joydev snd_soc_dmic mei_pxp mei_hdcp snd_hda_codec_realtek snd_hda_codec_generic intel_uncore_frequency intel_uncore_frequency_common intel_rapl_msr intel_tcc_cooling nls_iso8859_1 snd_sof_pci_intel_tgl x86_pkg_temp_thermal
[  989.891892]  intel_powerclamp snd_sof_intel_hda_common soundwire_intel snd_sof_intel_hda_mlink soundwire_cadence coretemp snd_sof_intel_hda snd_sof_pci snd_sof_xtensa_dsp snd_sof kvm_intel snd_sof_utils snd_soc_hdac_hda snd_hda_ext_core snd_soc_acpi_intel_match kvm snd_soc_acpi iwlmvm soundwire_generic_allocation soundwire_bus mac80211 snd_soc_core snd_hda_codec_hdmi snd_compress ac97_bus snd_pcm_dmaengine rapl intel_cstate libarc4 input_leds snd_hda_intel ofpart thinkpad_acpi think_lmi snd_intel_dspcfg snd_seq_midi processor_thermal_device_pci_legacy serio_raw firmware_attributes_class iwlwifi cmdlinepart snd_intel_sdw_acpi snd_seq_midi_event mei_me processor_thermal_device spi_nor nvram processor_thermal_wt_hint snd_hda_codec ledtrig_audio processor_thermal_rfim cfg80211 wmi_bmof mtd mei ee1004 hid_multitouch platform_profile snd_rawmidi snd_hda_core snd_hwdep processor_thermal_rapl snd_seq snd_pcm intel_rapl_common snd_seq_device processor_thermal_wt_req processor_thermal_power_floor processor_thermal_mbox
[  989.891923]  intel_soc_dts_iosf snd_timer snd int3403_thermal soundcore int340x_thermal_zone intel_pmc_core intel_vsec pmt_telemetry int3400_thermal acpi_thermal_rel pmt_class acpi_tad acpi_pad nvidia_uvm(POE) sch_fq_codel msr parport_pc ppdev lp parport efi_pstore ip_tables x_tables autofs4 btrfs blake2b_generic dm_crypt r8153_ecm cdc_ether usbnet raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor r8152 mii raid6_pq libcrc32c raid1 raid0 hid_logitech_hidpp hid_logitech_dj usbhid xe drm_gpuvm drm_exec gpu_sched drm_suballoc_helper drm_ttm_helper nvidia_drm(POE) nvidia_modeset(POE) i915 drm_buddy i2c_algo_bit ttm crct10dif_pclmul crc32_pclmul drm_display_helper hid_generic polyval_clmulni polyval_generic ghash_clmulni_intel nvidia(POE) i2c_hid_acpi cec rtsx_pci_sdmmc rc_core i2c_i801 i2c_hid sha256_ssse3 spi_intel_pci sha1_ssse3 psmouse spi_intel i2c_smbus drm_kms_helper thunderbolt hid intel_lpss_pci nvme xhci_pci intel_lpss rtsx_pci idma64 xhci_pci_renesas ucsi_acpi nvme_core typec_ucsi drm
[  989.891965]  typec video wmi pinctrl_tigerlake aesni_intel crypto_simd cryptd
[  989.891969] CR2: ffff89bac4855fe8
[  989.891971] ---[ end trace 0000000000000000 ]---
[  990.204792] RIP: 0010:_nv044009rm+0x10/0x30 [nvidia]
[  990.205238] Code: 00 00 00 00 00 0f 1f 44 00 00 f3 0f 1e fa c3 66 2e 0f 1f 84 00 00 00 00 00 90 f3 0f 1e fa 48 83 ec 08 48 83 ed 10 48 8d 7d 08 <48> c7 45 08 00 00 00 00 e8 e3 4f 6f ff 48 8b 45 08 48 83 c4 08 48
[  990.205240] RSP: 0018:ffffa263c02e7d20 EFLAGS: 00010282
[  990.205244] RAX: 0000000000000000 RBX: ffffa263c7ba68e8 RCX: 0000000000000000
[  990.205245] RDX: ffff89bac4f86c88 RSI: 00000000000000c0 RDI: ffff89bac4855fe8
[  990.205246] RBP: ffff89bac4855fe0 R08: 0000000000000000 R09: 0000000000000000
[  990.205247] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000004
[  990.205247] R13: 0000000000000000 R14: ffffa263c7b68008 R15: ffff89baf7568008
[  990.205248] FS:  0000000000000000(0000) GS:ffff89c1ff580000(0000) knlGS:0000000000000000
[  990.205250] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  990.205251] CR2: ffff89bac4855fe8 CR3: 0000000123e38002 CR4: 0000000000770ef0
[  990.205252] PKRU: 55555554
[  990.205253] note: kworker/6:0[54] exited with irqs disabled

Seems a problem with the Nvidia driver and Power Management. I’m using Nvidia default settings (auto power).

2 Likes