Black screen when resuming systemctl-suspend, using nvidia-driver-470.57.02 with kernel 5.8.0-63-generic on GTX 970, xubuntu 20.04 LTS

Couple of days ago I had the following setup:

  • kernel 5.8.0-59-generic
  • driver 460.80-0ubuntu0.20.04.2

everything was working swimmingly (suspend/resume multiple times a day without any issue). Then the unattended upgrade upgraded the kernel to 5.8.0-63-generic and driver 460.91.03-0ubuntu0.20.04.1. The computer would not go to sleep with this configuration, the screen would become black and do nothing until I hard-restarted it.

I upgraded the drivers to 470 version => nvidia-driver-470.57.02. This time the PC went to sleep but would not wake up - a black screen appears, I have to restart the PC. I tried with different configurations of kernels (5.8.0-59-generic, 5.8.0-63-generic) and older drivers (440, 450, 455 (which both install to 460), 460, 470) and this for now seems to be the best configuration (at least it goes to sleep).

I tried reverting the driver to 460.80-0ubuntu0.20.04.2 if I could, but it was deleted and superseded by 460.91.03 Publishing history : nvidia-graphics-drivers-460 package : Ubuntu which works worse.

I tried the instructions here Chapter 21. Configuring Power Management Support but the files such as /usr/share/doc/NVIDIA_GLX-1.0/samples/systemd/nvidia-suspend.service do not exist.

I attach the nvidia-bug-report.log.gz.
nvidia-bug-report.log.gz (401.5 KB)

1 Like

I am having the same black screen issue when resuming/waking up from suspend to RAM.
The problem started to occur from 470 beta driver. The problem still continues with 470.57.02 (installed from Ubuntu PPA graphics drivers).
Reverting to 460 drivers works fine.

KDE Neon (Ubuntu 20.04)
GTX 960 4GB
Kernel: 5.8.0-63-generic
Monitor connected via HDMI cable

After I purged all the nvidia packages with:
sudo apt-get purge "nvidia*"

and reinstalled the 460 drivers with:
sudo apt-get install nvidia-driver-460

the resume works again.

I’m having similar issues with latest 470 driver on 5.10.46-2.

First i see multiple lines like this in /var/log/messages:

Jul 26 07:04:09 /usr/libexec/gdm-x-session[1563]: (EE) NVIDIA(GPU-0): WAIT (1, 8, 0x8000, 0x000064dc, 0x00006698)
Jul 26 07:04:12 /usr/libexec/gdm-x-session[1563]: (EE) NVIDIA(GPU-0): WAIT (2, 8, 0x8000, 0x000064dc, 0x00006764)
Jul 26 07:04:19 /usr/libexec/gdm-x-session[1563]: (EE) NVIDIA(GPU-0): WAIT (1, 8, 0x8000, 0x000064dc, 0x00006764)
Jul 26 07:04:23 /usr/libexec/gdm-x-session[1563]: (EE) NVIDIA(GPU-0): WAIT (2, 8, 0x8000, 0x000064dc, 0x00006830)

So it seems that the nvidia module is stuck in some loop:

Jul 27 07:13:04  kernel: [ 6472.812228] ------------[ cut here ]------------
Jul 27 07:13:04  kernel: [ 6472.812375] WARNING: CPU: 2 PID: 6896 at /var/lib/dkms/nvidia-current/470.57.02/build/nvidia/nv.c:3980 nv_restore_user_channels+0xc9/0xe0 [nvidia]
Jul 27 07:13:04  kernel: [ 6472.812376] Modules linked in: rfkill intel_rapl_msr intel_rapl_common sb_edac x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm irqbypass ghash_clmulni_intel aesni_intel libaes crypto_simd cryptd glue_helper rapl intel_cstate rtsx_usb_ms memstick snd_hda_codec_realtek snd_hda_codec_generic snd_hda_codec_hdmi ledtrig_audio snd_hda_intel snd_intel_dspcfg soundwire_intel soundwire_generic_allocation snd_soc_core mei_wdt snd_compress soundwire_cadence snd_hda_codec nvidia_drm(POE) intel_uncore intel_wmi_thunderbolt wmi_bmof pcspkr snd_hda_core snd_hwdep joydev soundwire_bus tpm_tis tpm_tis_core snd_pcm iTCO_wdt mei_me snd_timer intel_pmc_bxt iTCO_vendor_support tpm snd soundcore watchdog sg mei rng_core drm_kms_helper cec nvidia_modeset(POE) evdev nvidia(POE) msr parport_pc ppdev drm lp parport fuse configfs ip_tables x_tables autofs4 ext4 crc16 mbcache jbd2 crc32c_generic hid_generic usbhid hid rtsx_usb_sdmmc mmc_core rtsx_usb sd_mod t10_pi crc_t10dif crct10dif_generic xhci_pci
Jul 27 07:13:04  kernel: [ 6472.812432]  xhci_hcd crct10dif_pclmul ahci crct10dif_common ata_generic libahci ehci_pci ehci_hcd crc32_pclmul libata e1000e mxm_wmi crc32c_intel lpc_ich i2c_i801 i2c_smbus usbcore scsi_mod ptp pps_core usb_common wmi button
Jul 27 07:13:04  kernel: [ 6472.812450] CPU: 2 PID: 6896 Comm: nvidia-sleep.sh Tainted: P           OE     5.10.0-8-amd64 #1 Debian 5.10.46-2
Jul 27 07:13:04  kernel: [ 6472.812451] Hardware name: LENOVO 30B4S13500/102F, BIOS S00KT67A 04/21/2020
Jul 27 07:13:04  kernel: [ 6472.812559] RIP: 0010:nv_restore_user_channels+0xc9/0xe0 [nvidia]
Jul 27 07:13:04  kernel: [ 6472.812561] Code: 0b dd e4 be 01 00 00 00 4c 89 e7 e8 51 a0 00 00 4c 89 ff e8 39 0b dd e4 ba 02 00 00 00 4c 89 e6 48 89 ef e8 79 66 9c 00 eb 94 <0f> 0b eb c6 41 bd 51 00 00 00 eb 9f 66 66 2e 0f 1f 84 00 00 00 00
Jul 27 07:13:04  kernel: [ 6472.812562] RSP: 0018:ffffb57744dffe28 EFLAGS: 00010206
Jul 27 07:13:04  kernel: [ 6472.812564] RAX: 0000000000000003 RBX: 0000000000000002 RCX: ffffb57744dffdc0
Jul 27 07:13:04  kernel: [ 6472.812565] RDX: 0000000000000087 RSI: 0000000000000246 RDI: 0000000000000246
Jul 27 07:13:04  kernel: [ 6472.812566] RBP: ffff921886238000 R08: 0000000000000000 R09: 0000000000000000
Jul 27 07:13:04  kernel: [ 6472.812567] R10: 0000000000000001 R11: 0000000000000000 R12: ffff9218d104d800
Jul 27 07:13:04  kernel: [ 6472.812568] R13: 0000000000000003 R14: ffff9218d104dd20 R15: ffff9218d104d800
Jul 27 07:13:04  kernel: [ 6472.812570] FS:  00007f2383829740(0000) GS:ffff921befc80000(0000) knlGS:0000000000000000
Jul 27 07:13:04  kernel: [ 6472.812571] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jul 27 07:13:04  kernel: [ 6472.812572] CR2: 00003e9e0d0f8004 CR3: 0000000267f8e005 CR4: 00000000003706e0
Jul 27 07:13:04  kernel: [ 6472.812573] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Jul 27 07:13:04  kernel: [ 6472.812574] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Jul 27 07:13:04  kernel: [ 6472.812575] Call Trace:
Jul 27 07:13:04  kernel: [ 6472.812683]  nv_set_system_power_state+0x222/0x3c0 [nvidia]
Jul 27 07:13:04  kernel: [ 6472.812790]  nv_procfs_write_suspend+0xec/0x140 [nvidia]
Jul 27 07:13:04  kernel: [ 6472.812795]  proc_reg_write+0x51/0x90
Jul 27 07:13:04  kernel: [ 6472.812797]  vfs_write+0xc0/0x260
Jul 27 07:13:04  kernel: [ 6472.812799]  ksys_write+0x5f/0xe0
Jul 27 07:13:04  kernel: [ 6472.812803]  do_syscall_64+0x33/0x80
Jul 27 07:13:04  kernel: [ 6472.812806]  entry_SYSCALL_64_after_hwframe+0x44/0xa9
Jul 27 07:13:04  kernel: [ 6472.812808] RIP: 0033:0x7f2384031f33
Jul 27 07:13:04  kernel: [ 6472.812810] Code: 8b 15 61 ef 0c 00 f7 d8 64 89 02 48 c7 c0 ff ff ff ff eb b7 0f 1f 00 64 8b 04 25 18 00 00 00 85 c0 75 14 b8 01 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 55 c3 0f 1f 40 00 48 83 ec 28 48 89 54 24 18
Jul 27 07:13:04  kernel: [ 6472.812811] RSP: 002b:00007ffc261e1d88 EFLAGS: 00000246 ORIG_RAX: 0000000000000001
Jul 27 07:13:04  kernel: [ 6472.812813] RAX: ffffffffffffffda RBX: 0000000000000007 RCX: 00007f2384031f33
Jul 27 07:13:04  kernel: [ 6472.812814] RDX: 0000000000000007 RSI: 0000564ec0a410f0 RDI: 0000000000000001
Jul 27 07:13:04  kernel: [ 6472.812815] RBP: 0000564ec0a410f0 R08: 000000000000000a R09: 0000000000000006
Jul 27 07:13:04  kernel: [ 6472.812816] R10: fffffffffffff286 R11: 0000000000000246 R12: 0000000000000007
Jul 27 07:13:04  kernel: [ 6472.812817] R13: 00007f23841026a0 R14: 0000000000000007 R15: 00007f23841028a0
Jul 27 07:13:04  kernel: [ 6472.812820] ---[ end trace 93d3b039764befce ]---
Jul 27 07:13:04  kernel: [ 6472.812831] ------------[ cut here ]------------

See attached for full stack trace of the “kernel crash”:
crash (16.0 KB)

I actually have the same issue with the 460.91 driver as well

I had the same problem using 460 through 470 versions, but I solved it using this instructions:

  • delete /etc/X11/xorg.conf
  • make sure nvidia-prime is installed (sudo apt install --reinstall nvidia-prime)
  • switch to nvidia (sudo prime-select nvidia)
  • remove stray blacklist files (sudo rm /lib/modprobe.d/blacklist-nvidia.conf /etc/modprobe.d/blacklist-nvidia.conf)
  • update the initrd (sudo update-initramfs -u)
  • reboot

Ok - nvidia-prime doesn’t exist in debian, but i added this module option

options nvidia-drm modeset=1

which seems to be the only thing the ubuntu script does (besides some blacklisting file that i dont have). Will see if it works

The problem happens a day after is is solved!!! I don’t know what happened!
And setting this options nvidia-drm modeset=1 does not solve the issue.

It seems installing nvidia-driver-460-server solved the problem for me. I’ve tested by several reboots and making my laptop suspend and awaking again.
Now it’s on NVIDIA team to find out what’s the difference the makes the server version work, and patch it in a way it does not work too.

Yeah same here - it worked for me 1 day - but didn’t worked today - the stack trace is a bit different

Jul 31 08:03:02 kernel: [49660.322416] Call Trace:
Jul 31 08:03:02 kernel: [49660.322418]  <IRQ>
Jul 31 08:03:02 kernel: [49660.322423]  dump_stack+0x6b/0x83
Jul 31 08:03:02 kernel: [49660.322425]  nmi_cpu_backtrace.cold+0x32/0x69
Jul 31 08:03:02 kernel: [49660.322428]  ? lapic_can_unplug_cpu+0x80/0x80
Jul 31 08:03:02 kernel: [49660.322430]  nmi_trigger_cpumask_backtrace+0xd7/0xe0
Jul 31 08:03:02 kernel: [49660.322433]  rcu_dump_cpu_stacks+0xa2/0xd0
Jul 31 08:03:02 kernel: [49660.322434]  rcu_sched_clock_irq.cold+0x1ff/0x3d6
Jul 31 08:03:02 kernel: [49660.322436]  update_process_times+0x8c/0xc0
Jul 31 08:03:02 kernel: [49660.322439]  tick_sched_handle+0x22/0x60
Jul 31 08:03:02 kernel: [49660.322440]  tick_sched_timer+0x7c/0xb0
Jul 31 08:03:02 kernel: [49660.322442]  ? tick_do_update_jiffies64.part.0+0xc0/0xc0
Jul 31 08:03:02 kernel: [49660.322443]  __hrtimer_run_queues+0x12a/0x270
Jul 31 08:03:02 kernel: [49660.322444]  hrtimer_interrupt+0x110/0x2c0
Jul 31 08:03:02 kernel: [49660.322445]  __sysvec_apic_timer_interrupt+0x5f/0xd0
Jul 31 08:03:02 kernel: [49660.322447]  asm_call_irq_on_stack+0x12/0x20
Jul 31 08:03:02 kernel: [49660.322448]  </IRQ>
Jul 31 08:03:02 kernel: [49660.322451]  sysvec_apic_timer_interrupt+0x72/0x80
Jul 31 08:03:02 kernel: [49660.322453]  asm_sysvec_apic_timer_interrupt+0x12/0x20
Jul 31 08:03:02 kernel: [49660.322603] RIP: 0010:_nv042006rm+0x86/0xe0 [nvidia]
Jul 31 08:03:02 kernel: [49660.322604] Code: 0f b7 35 f1 6a 62 01 eb 2d 0f 1f 00 8b 15 ca 6a 62 01 e8 4d 97 ff ff 0f b7 35 da 6a 62 01 83 eb 01 8b 05 e9 6a 62 01 44 01 e6 <83> fb ff 66 89 35 c4 6a 62 01 74 1e f6 c4 02 0f b7 f6 0f b7 3d c7
Jul 31 08:03:02 kernel: [49660.322605] RSP: 0018:ffffaf08455c3af8 EFLAGS: 00000206
Jul 31 08:03:02 kernel: [49660.322606] RAX: 0000000000000200 RBX: 0000000000003ca1 RCX: 00000000000a0000
Jul 31 08:03:02 kernel: [49660.322607] RDX: ffffffffc2638a70 RSI: 0000000000000d78 RDI: 0000000000000d74
Jul 31 08:03:02 kernel: [49660.322607] RBP: ffff9ffe83ee5aa0 R08: ffffffffc2638ac0 R09: 0000000000000202
Jul 31 08:03:02 kernel: [49660.322607] R10: 0000000000000246 R11: 0000000000000246 R12: 0000000000000004
Jul 31 08:03:02 kernel: [49660.322608] R13: ffff9ffe83ee5ae8 R14: ffff9ffe83ee5ae4 R15: 0000000000000000
Jul 31 08:03:02 kernel: [49660.322716]  ? _nv042006rm+0x73/0xe0 [nvidia]
Jul 31 08:03:02 kernel: [49660.322822]  ? _nv000413rm+0x20/0x20 [nvidia]
Jul 31 08:03:02 kernel: [49660.322927]  ? _nv000828rm+0x4f/0x130 [nvidia]
Jul 31 08:03:02 kernel: [49660.323033]  ? _nv031073rm+0x18b/0x250 [nvidia]
Jul 31 08:03:02 kernel: [49660.323142]  ? _nv032859rm+0x2c/0x170 [nvidia]
Jul 31 08:03:02 kernel: [49660.323250]  ? _nv000730rm+0x31f/0x460 [nvidia]
Jul 31 08:03:02 kernel: [49660.323358]  ? _nv000626rm+0x1db/0x3f0 [nvidia]
Jul 31 08:03:02 kernel: [49660.323466]  ? _nv014938rm+0xdf/0x2e0 [nvidia]
Jul 31 08:03:02 kernel: [49660.323592]  ? _nv036185rm+0x184/0x190 [nvidia]
Jul 31 08:03:02 kernel: [49660.323717]  ? _nv038004rm+0x274/0x2d0 [nvidia]
Jul 31 08:03:02 kernel: [49660.323821]  ? _nv009283rm+0x34c/0x420 [nvidia]
Jul 31 08:03:02 kernel: [49660.323925]  ? _nv036300rm+0x57/0x100 [nvidia]
Jul 31 08:03:02 kernel: [49660.324029]  ? _nv008276rm+0x55/0xa0 [nvidia]
Jul 31 08:03:02 kernel: [49660.324133]  ? _nv008276rm+0x34/0xa0 [nvidia]
Jul 31 08:03:02 kernel: [49660.324240]  ? rm_kernel_rmapi_op+0x159/0x1b0 [nvidia]
Jul 31 08:03:02 kernel: [49660.324254]  ? nvkms_call_rm+0x4b/0x80 [nvidia_modeset]
Jul 31 08:03:02 kernel: [49660.324263]  ? _nv002680kms+0x51/0x60 [nvidia_modeset]
Jul 31 08:03:02 kernel: [49660.324271]  ? _nv002722kms+0x3e/0x90 [nvidia_modeset]
Jul 31 08:03:02 kernel: [49660.324277]  ? _nv000352kms+0x1cf/0x200 [nvidia_modeset]
Jul 31 08:03:02 kernel: [49660.324284]  ? _nv002305kms+0x1ff/0x640 [nvidia_modeset]
Jul 31 08:03:02 kernel: [49660.324289]  ? nvKmsResume+0x43/0x80 [nvidia_modeset]
Jul 31 08:03:02 kernel: [49660.324295]  ? nvkms_resume+0x1b/0x40 [nvidia_modeset]
Jul 31 08:03:02 kernel: [49660.324357]  ? nv_set_system_power_state+0x249/0x3c0 [nvidia]
Jul 31 08:03:02 kernel: [49660.324422]  ? nv_procfs_write_suspend+0xec/0x140 [nvidia]
Jul 31 08:03:02 kernel: [49660.324425]  ? proc_reg_write+0x51/0x90
Jul 31 08:03:02 kernel: [49660.324426]  ? vfs_write+0xc0/0x260
Jul 31 08:03:02 kernel: [49660.324427]  ? ksys_write+0x5f/0xe0
Jul 31 08:03:02 kernel: [49660.324428]  ? do_syscall_64+0x33/0x80
Jul 31 08:03:02 kernel: [49660.324430]  ? entry_SYSCALL_64_after_hwframe+0x44/0xa9
Jul 31 08:03:03 kernel: [49661.674156] task:nv_queue        state:D stack:    0 pid:  335 ppid:     2 flags:0x00004000
Jul 31 08:03:03 kernel: [49661.674158] Call Trace:
Jul 31 08:03:03 kernel: [49661.674163]  __schedule+0x282/0x870
Jul 31 08:03:03 kernel: [49661.674165]  schedule+0x46/0xb0
Jul 31 08:03:03 kernel: [49661.674167]  rwsem_down_read_slowpath+0x18e/0x500
Jul 31 08:03:03 kernel: [49661.674269]  nvidia_close_deferred+0x15/0x30 [nvidia]
Jul 31 08:03:03 kernel: [49661.674347]  _main_loop+0x9e/0x150 [nvidia]
Jul 31 08:03:03 kernel: [49661.674425]  ? nvidia_modeset_resume+0x20/0x20 [nvidia]
Jul 31 08:03:03 kernel: [49661.674428]  kthread+0x11b/0x140
Jul 31 08:03:03 kernel: [49661.674429]  ? __kthread_bind_mask+0x60/0x60
Jul 31 08:03:03 kernel: [49661.674431]  ret_from_fork+0x22/0x30
Jul 31 08:03:03 kernel: [49661.674434] task:nvidia-modeset/ state:D stack:    0 pid:  375 ppid:     2 flags:0x00004000
Jul 31 08:03:03 kernel: [49661.674435] Call Trace:
Jul 31 08:03:03 kernel: [49661.674437]  __schedule+0x282/0x870
Jul 31 08:03:03 kernel: [49661.674439]  schedule+0x46/0xb0
Jul 31 08:03:03 kernel: [49661.674441]  rwsem_down_read_slowpath+0x18e/0x500
Jul 31 08:03:03 kernel: [49661.674454]  nvkms_kthread_q_callback+0x71/0x100 [nvidia_modeset]
Jul 31 08:03:03 kernel: [49661.674462]  _main_loop+0x9e/0x150 [nvidia_modeset]
Jul 31 08:03:03 kernel: [49661.674468]  ? nvkms_sema_up+0x10/0x10 [nvidia_modeset]
Jul 31 08:03:03 kernel: [49661.674470]  kthread+0x11b/0x140
Jul 31 08:03:03 kernel: [49661.674471]  ? __kthread_bind_mask+0x60/0x60
Jul 31 08:03:03 kernel: [49661.674472]  ret_from_fork+0x22/0x30
Jul 31 08:03:13 kernel: [49671.658445] ------------[ cut here ]------------

This is with 460.91.03 - i will try 470.57.02
nvidia_crash (44.6 KB)

Try installing server version, it works perfectly for me.

i dont see it in debian - would need to look into unbuntu’s repo and see how it’s different

Can you try the steps in this guide? How to install Nvidia 470.57.02 Drivers for Ubuntu 20.04 using a run file · GitHub

I had very similar problems with my laptop which has an NVIDIA GeForce GTX 1050 Ti graphics card and these steps solved all my problems.

I’m also experiencing this issue on Kubuntu 21.04 with GeForce 710, can we get someone to take a look at this please?

Hi - even downgrading to older driver version doesn’t solve the issue. On resume both nvidia-sleep.sh and Xorg are using 100% but i can’t figure out where they’re stuck…

We have filed a bug 200762400 internally for tracking purpose, you can use it for further follow up.
I will first try to duplicate issue locally and in case required any information, I will get back to you.

No repro on below configuration setup -

Dell Precision T7600 + Ubuntu 20.04.2 LTS + kernel 5.8.0-050800-generic + NVIDIA TITAN Xp + Driver 470.57.02

Shall connect NVIDIA Corporation GM204 [GeForce GTX 970] which is being used by user " toomanysecrets69" and retry for repro.
Also requesting other users to share bug report.

nvidia-bug-report.log.gz (302.7 KB)

In my case the problem occurs with 470 driver. I have no problem with 460 driver.
In my tests with the 470 driver there have been a couple of times that resuming from suspend has worked correctly, I really don’t know when. But 90% of the time resuming/wake up does not work, the PC turns on but I get black screen and I cannot enter TTY or do a safe reboot with key combination, it looks like a kernel panic.

Using KDE Neon here (Kubuntu 20.04)

Thank you, purging nvidia* and installing 460 version works like a charm!

The NVIDIA 470 drivers with kernel 5.11.ХХ Suspend mode crash system.
The screen black after exiting Suspend mode. The sshd is working and I can logging and run nvidia-smi and i get error to the dmesg

4 авг., 19:19:27 kernel: igc 0000:07:00.0 enp7s0: NIC Link is Up 1000 Mbps Full Duplex, Flow Control: RX/TX
 4 авг., 19:19:15 kernel: nvidia-modeset: ERROR: GPU:0: Failed to allocate display engine core DMA push buffer
 4 авг., 19:19:02 kernel: NVRM: Xid (PCI:0000:08:00): 56, pid=7532, CMDre 00000000 00000080 00000000 00000005 00001005
 4 авг., 19:19:02 kernel: nvidia-modeset: ERROR: GPU:0: Idling display engine timed out: 0x0000927d:0:0:325
 4 авг., 19:18:55 kernel: NVRM: Xid (PCI:0000:08:00): 56, pid=7532, CMDre 00000000 0000008c 00000000 00000005 0000102b
 4 авг., 19:18:55 kernel: nvidia-modeset: WARNING: GPU:0: Lost display notification (0:0x00000000); continuing.
 4 авг., 19:18:52 kernel: ---[ end trace 54f5c42cd6b22fdc ]---
 4 авг., 19:18:52 kernel: R13: 00007ffa19bb56a0 R14: 00007ffa19bb64a0 R15: 00007ffa19bb58a0
 4 авг., 19:18:52 kernel: R10: 00005595043fb017 R11: 0000000000000246 R12: 0000000000000007
 4 авг., 19:18:52 kernel: RBP: 0000559505cec570 R08: 000000000000000a R09: 0000000000000006
 4 авг., 19:18:52 kernel: RDX: 0000000000000007 RSI: 0000559505cec570 RDI: 0000000000000001
 4 авг., 19:18:52 kernel: RAX: ffffffffffffffda RBX: 0000000000000007 RCX: 00007ffa19ada1e7
 4 авг., 19:18:52 kernel: RSP: 002b:00007ffecf70c4b8 EFLAGS: 00000246 ORIG_RAX: 0000000000000001
 4 авг., 19:18:52 kernel: Code: 64 89 02 48 c7 c0 ff ff ff ff eb bb 0f 1f 80 00 00 00 00 f3 0f 1e fa 64 8b 04 25 18 00 00 00 85 c0 75 10 b8 01 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 51 c3 48 83 ec 28 48 89 54 24 18 48 89 74 24
 4 авг., 19:18:52 kernel: RIP: 0033:0x7ffa19ada1e7
 4 авг., 19:18:52 kernel:  entry_SYSCALL_64_after_hwframe+0x44/0xa9
 4 авг., 19:18:52 kernel: Call Trace:
 4 авг., 19:18:52 kernel: CR2: 00007fc4b0b6eef0 CR3: 0000000112596000 CR4: 0000000000350ee0
 4 авг., 19:18:52 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
 4 авг., 19:18:52 kernel: FS:  00007ffa199c6740(0000) GS:ffff8da1ae940000(0000) knlGS:0000000000000000
 4 авг., 19:18:52 kernel: R13: 0000000000000000 R14: 0000559505cec570 R15: ffffb13084ff3ef0
 4 авг., 19:18:52 kernel: R10: ffff8d934e4c3000 R11: 0000000000000001 R12: ffff8d92c0c12000
 4 авг., 19:18:52 kernel: RBP: ffffb13084ff3e50 R08: 0000000000000001 R09: ffffffffc183fa01
 4 авг., 19:18:52 kernel: RDX: 0000000080020002 RSI: ffffffffc183fa78 RDI: ffff8d92cd6aac00
 4 авг., 19:18:52 kernel: RAX: 0000000000000003 RBX: 0000000000000002 RCX: 0000000080020001
 4 авг., 19:18:52 kernel: RSP: 0018:ffffb13084ff3e20 EFLAGS: 00010206
 4 авг., 19:18:52 kernel: Code: 00 4d 85 e4 0f 84 4a ff ff ff 41 83 fd 02 74 e9 49 8b bc 24 88 02 00 00 be 02 00 00 00 e8 37 d0 ff ff 85 c0 74 d3 0f 0b eb cf <0f> 0b e9 64 ff ff ff 48 c7 c7 d0 3a 7f c3 e8 2c 67 2c c8 e8 37 1b
 4 авг., 19:18:52 kernel: RIP: 0010:nv_set_system_power_state+0x2c1/0x3c0 [nvidia]
 4 авг., 19:18:52 kernel: Hardware name: ASUS System Product Name/ROG STRIX B550-F GAMING (WI-FI), BIOS 2006 03/19/2021
 4 авг., 19:18:52 kernel: CPU: 5 PID: 7532 Comm: nvidia-sleep.sh Tainted: P        W  OE     5.11.0-25-generic #27~20.04.1-Ubuntu
 4 авг., 19:18:52 kernel:  btrtl btbcm snd_timer btintel fb_sys_fops bluetooth input_leds snd nls_iso8859_1 wmi_bmof syscopyarea eeepc_wmi ecdh_generic sysfillrect rapl efi_pstore ecc cfg80211 sysimgblt ccp soundcore joydev zenpower(OE) mac_hid sch_fq_codel nct6775 hwmon_vid msr parport_pc ppdev lp drm parport sunrpc ip_tables x_tables autofs4 btrfs blake2b_generic raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear dm_mirror dm_region_hash dm_log hid_generic usbhid hid asus_wmi sparse_keymap video mfd_aaeon crct10dif_pclmul crc32_pclmul ghash_clmulni_intel aesni_intel crypto_simd cryptd igc glue_helper ahci xhci_pci nvme libahci xhci_pci_renesas i2c_piix4 nvme_core wmi gpio_amdpt gpio_generic
 4 авг., 19:18:52 kernel: Modules linked in: vboxnetadp(OE) vboxnetflt(OE) vboxdrv(OE) rfcomm xt_CHECKSUM ipt_REJECT nf_reject_ipv4 xt_tcpudp ip6table_mangle ip6table_nat xt_conntrack xt_MASQUERADE nf_conntrack_netlink xfrm_user xfrm_algo iptable_mangle nf_tables nfnetlink xt_addrtype ip6table_filter ip6_tables iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 iptable_filter bpfilter br_netfilter bridge stp llc aufs cmac algif_hash algif_skcipher af_alg overlay bnep nvidia_uvm(POE) nvidia_drm(POE) binfmt_misc nvidia_modeset(POE) intel_rapl_msr intel_rapl_common nvidia(POE) snd_hda_codec_realtek snd_hda_codec_generic ledtrig_audio snd_hda_intel snd_intel_dspcfg soundwire_intel soundwire_generic_allocation soundwire_cadence snd_hda_codec snd_hda_core iwlmvm edac_mce_amd snd_hwdep soundwire_bus drm_kms_helper mac80211 snd_soc_core cec rc_core snd_compress ac97_bus snd_pcm_dmaengine kvm_amd libarc4 snd_pcm snd_seq_midi snd_seq_midi_event snd_rawmidi kvm snd_seq iwlwifi btusb snd_seq_device
 4 авг., 19:18:52 kernel: WARNING: CPU: 5 PID: 7532 at /var/lib/dkms/nvidia/470.57.02/build/nvidia/nv.c:4175 nv_set_system_power_state+0x2c1/0x3c0 [nvidia]
 4 авг., 19:18:52 kernel: ------------[ cut here ]------------
 4 авг., 19:18:52 kernel: ---[ end trace 54f5c42cd6b22fdb ]---
 4 авг., 19:18:52 kernel: R13: 00007ffa19bb56a0 R14: 00007ffa19bb64a0 R15: 00007ffa19bb58a0
 4 авг., 19:18:52 kernel: R10: 00005595043fb017 R11: 0000000000000246 R12: 0000000000000007
 4 авг., 19:18:52 kernel: RBP: 0000559505cec570 R08: 000000000000000a R09: 0000000000000006
 4 авг., 19:18:52 kernel: RDX: 0000000000000007 RSI: 0000559505cec570 RDI: 0000000000000001
 4 авг., 19:18:52 kernel: RAX: ffffffffffffffda RBX: 0000000000000007 RCX: 00007ffa19ada1e7
 4 авг., 19:18:52 kernel: RSP: 002b:00007ffecf70c4b8 EFLAGS: 00000246 ORIG_RAX: 0000000000000001
 4 авг., 19:18:52 kernel: Code: 64 89 02 48 c7 c0 ff ff ff ff eb bb 0f 1f 80 00 00 00 00 f3 0f 1e fa 64 8b 04 25 18 00 00 00 85 c0 75 10 b8 01 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 51 c3 48 83 ec 28 48 89 54 24 18 48 89 74 24
 4 авг., 19:18:52 kernel: RIP: 0033:0x7ffa19ada1e7
 4 авг., 19:18:52 kernel:  entry_SYSCALL_64_after_hwframe+0x44/0xa9
 4 авг., 19:18:52 kernel: Call Trace:
 4 авг., 19:18:52 kernel: CR2: 00007fc4b0b6eef0 CR3: 0000000112596000 CR4: 0000000000350ee0
 4 авг., 19:18:52 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
 4 авг., 19:18:52 kernel: FS:  00007ffa199c6740(0000) GS:ffff8da1ae940000(0000) knlGS:0000000000000000
 4 авг., 19:18:52 kernel: R13: ffff8d92c0c12000 R14: 0000000000000003 R15: ffff8d92c0c12520
 4 авг., 19:18:52 kernel: R10: ffff8d934f1838a0 R11: 0000000000000001 R12: ffff8d934e4c3000
 4 авг., 19:18:52 kernel: RBP: ffffb13084ff3e10 R08: 0000000000000000 R09: ffffffffc184dc00
 4 авг., 19:18:52 kernel: RDX: 0000000000000087 RSI: 0000000000000246 RDI: 0000000000000246
 4 авг., 19:18:52 kernel: RAX: 0000000000000003 RBX: ffff8d92c0c12000 RCX: ffffb13084ff3d80
 4 авг., 19:18:52 kernel: RSP: 0018:ffffb13084ff3de8 EFLAGS: 00010206
 4 авг., 19:18:52 kernel: Code: 98 2c c8 be 01 00 00 00 4c 89 ef e8 dc a0 00 00 48 89 df e8 34 99 2c c8 ba 02 00 00 00 4c 89 ee 4c 89 e7 e8 54 6b 9c 00 eb 93 <0f> 0b eb c6 41 be 51 00 00 00 eb 9e 66 0f 1f 44 00 00 0f 1f 44 00
 4 авг., 19:18:52 kernel: RIP: 0010:nv_restore_user_channels+0xce/0xe0 [nvidia]
 4 авг., 19:18:52 kernel: Hardware name: ASUS System Product Name/ROG STRIX B550-F GAMING (WI-FI), BIOS 2006 03/19/2021
 4 авг., 19:18:52 kernel: CPU: 5 PID: 7532 Comm: nvidia-sleep.sh Tainted: P           OE     5.11.0-25-generic #27~20.04.1-Ubuntu
 4 авг., 19:18:52 kernel:  btrtl btbcm snd_timer btintel fb_sys_fops bluetooth input_leds snd nls_iso8859_1 wmi_bmof syscopyarea eeepc_wmi ecdh_generic sysfillrect rapl efi_pstore ecc cfg80211 sysimgblt ccp soundcore joydev zenpower(OE) mac_hid sch_fq_codel nct6775 hwmon_vid msr parport_pc ppdev lp drm parport sunrpc ip_tables x_tables autofs4 btrfs blake2b_generic raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear dm_mirror dm_region_hash dm_log hid_generic usbhid hid asus_wmi sparse_keymap video mfd_aaeon crct10dif_pclmul crc32_pclmul ghash_clmulni_intel aesni_intel crypto_simd cryptd igc glue_helper ahci xhci_pci nvme libahci xhci_pci_renesas i2c_piix4 nvme_core wmi gpio_amdpt gpio_generic
 4 авг., 19:18:52 kernel: Modules linked in: vboxnetadp(OE) vboxnetflt(OE) vboxdrv(OE) rfcomm xt_CHECKSUM ipt_REJECT nf_reject_ipv4 xt_tcpudp ip6table_mangle ip6table_nat xt_conntrack xt_MASQUERADE nf_conntrack_netlink xfrm_user xfrm_algo iptable_mangle nf_tables nfnetlink xt_addrtype ip6table_filter ip6_tables iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 iptable_filter bpfilter br_netfilter bridge stp llc aufs cmac algif_hash algif_skcipher af_alg overlay bnep nvidia_uvm(POE) nvidia_drm(POE) binfmt_misc nvidia_modeset(POE) intel_rapl_msr intel_rapl_common nvidia(POE) snd_hda_codec_realtek snd_hda_codec_generic ledtrig_audio snd_hda_intel snd_intel_dspcfg soundwire_intel soundwire_generic_allocation soundwire_cadence snd_hda_codec snd_hda_core iwlmvm edac_mce_amd snd_hwdep soundwire_bus drm_kms_helper mac80211 snd_soc_core cec rc_core snd_compress ac97_bus snd_pcm_dmaengine kvm_amd libarc4 snd_pcm snd_seq_midi snd_seq_midi_event snd_rawmidi kvm snd_seq iwlwifi btusb snd_seq_device
 4 авг., 19:18:52 kernel: WARNING: CPU: 5 PID: 7532 at /var/lib/dkms/nvidia/470.57.02/build/nvidia/nv.c:3980 nv_restore_user_channels+0xce/0xe0 [nvidia]
 4 авг., 19:18:52 kernel: ------------[ cut here ]------------
 4 авг., 19:18:40 kernel: igc 0000:07:00.0 enp7s0: NIC Link is Up 1000 Mbps Full Duplex, Flow Control: RX/TX
 4 авг., 19:18:40 kernel: nvidia 0000:08:00.0: AMD-Vi: Event logged [IO_PAGE_FAULT domain=0x0011 address=0xfffffff000 flags=0x0010]

I do not know the reason error. if I try to remotely reboot the machine hangs.

If i use other drivers NVIDIA 460 or NVIDIA 390 suspend mode stops working, suspend mode just lock the screen. I tried different kernels 5.4.0-80, 5.8.0-59, 5.8.0-63
The drivers NVIDIA 470 driver works correctly only with kernel 5.4.XX and 5.8.0-59. If I using kernel 5.8.0-63 I also get a black screen sometimes after suspend mode

$► inxi -Fxxxrz
System:    Kernel: 5.8.0-59-generic x86_64 bits: 64 compiler: N/A Desktop: Cinnamon 5.0.5 wm: muffin 5.0.1 dm: LightDM 1.30.0 
           Distro: Linux Mint 20.2 Uma base: Ubuntu 20.04 focal 
Machine:   Type: Desktop System: ASUS product: N/A v: N/A serial: <filter> 
           Mobo: ASUSTeK model: ROG STRIX B550-F GAMING (WI-FI) v: Rev X.0x serial: <filter> UEFI: American Megatrends v: 2006 
           date: 03/19/2021 
CPU:       Topology: 12-Core model: AMD Ryzen 9 3900XT bits: 64 type: MT MCP arch: Zen L2 cache: 6144 KiB 
           flags: avx avx2 lm nx pae sse sse2 sse3 sse4_1 sse4_2 sse4a ssse3 svm bogomips: 182407 
           Speed: 2197 MHz min/max: 2200/3800 MHz boost: enabled Core speeds (MHz): 1: 2200 2: 2199 3: 2200 4: 2198 5: 2200 
           6: 2196 7: 2191 8: 2199 9: 2191 10: 2193 11: 2193 12: 2198 13: 2200 14: 2192 15: 2201 16: 2198 17: 2192 18: 2200 
           19: 2200 20: 2196 21: 2198 22: 2200 23: 2196 24: 2200 
Graphics:  Device-1: NVIDIA GK208B [GeForce GT 710] vendor: Gigabyte driver: nvidia v: 470.57.02 bus ID: 08:00.0 
           chip ID: 10de:128b 
           Display: x11 server: X.Org 1.20.11 driver: nvidia resolution: 1920x1080~60Hz 
           OpenGL: renderer: NVIDIA GeForce GT 710/PCIe/SSE2 v: 4.6.0 NVIDIA 470.57.02 direct render: Yes 
Audio:     Device-1: NVIDIA GK208 HDMI/DP Audio vendor: Gigabyte driver: N/A bus ID: 08:00.1 chip ID: 10de:0e0f 
           Device-2: Advanced Micro Devices [AMD] Starship/Matisse HD Audio vendor: ASUSTeK driver: snd_hda_intel v: kernel 
           bus ID: 0a:00.4 chip ID: 1022:1487 
           Sound Server: ALSA v: k5.8.0-59-generic 
Network:   Device-1: Intel Wi-Fi 6 AX200 driver: iwlwifi v: kernel bus ID: 06:00.0 chip ID: 8086:2723 
           IF: wlp6s0 state: down mac: <filter> 
           Device-2: Intel Ethernet I225-V vendor: ASUSTeK driver: igc v: 0.0.1-k port: N/A bus ID: 07:00.0 chip ID: 8086:15f3 
           IF: enp7s0 state: up speed: 1000 Mbps duplex: full mac: <filter> 
           IF-ID-1: docker0 state: down mac: <filter> 
           IF-ID-2: vboxnet0 state: down mac: <filter> 
           IF-ID-3: vboxnet1 state: down mac: <filter> 
           IF-ID-4: vboxnet2 state: down mac: <filter> 
           IF-ID-5: vboxnet3 state: down mac: <filter> 
           IF-ID-6: vboxnet4 state: down mac: <filter> 
           IF-ID-7: vboxnet5 state: down mac: <filter> 
           IF-ID-8: vboxnet6 state: down mac: <filter> 
           IF-ID-9: virbr0 state: down mac: <filter> 
           IF-ID-10: virbr0-nic state: down mac: <filter> 
Drives:    Local Storage: total: 2.73 TiB used: 827.99 GiB (29.6%) 
           ID-1: /dev/nvme0n1 vendor: Seagate model: XPG GAMMIX S50 size: 1.82 TiB speed: 63.2 Gb/s lanes: 4 serial: <filter> 
           rev: EGFM11.2 scheme: GPT 
           ID-2: /dev/nvme1n1 vendor: Samsung model: SSD 970 EVO 1TB size: 931.51 GiB speed: 31.6 Gb/s lanes: 4 
           serial: <filter> rev: 2B2QEXE7 scheme: GPT 
           ID-3: /dev/sda vendor: Seagate model: ST9500325AS size: 465.76 GiB speed: 3.0 Gb/s rotation: 5400 rpm 
           serial: <filter> rev: HPM1 scheme: GPT 
           ID-4: /dev/sdb vendor: Seagate model: ST500LM021-1KJ152 size: 465.76 GiB speed: 6.0 Gb/s rotation: 7200 rpm 
           serial: <filter> rev: SDM1 scheme: GPT 
Partition: ID-1: / size: 1.72 TiB used: 637.21 GiB (36.2%) fs: btrfs dev: /dev/nvme0n1p2 
           ID-2: /home size: 1.72 TiB used: 637.21 GiB (36.2%) fs: btrfs dev: /dev/nvme0n1p2 
           ID-3: swap-1 size: 100.68 GiB used: 0 KiB (0.0%) fs: swap dev: /dev/nvme0n1p3 
Sensors:   System Temperatures: cpu: 39.2 C mobo: 34.0 C gpu: nvidia temp: 57 C 
           Fan Speeds (RPM): fan-1: 0 fan-2: 1956 fan-3: 0 fan-4: 0 fan-5: 0 fan-6: 0 fan-7: 0 gpu: nvidia fan: 33% 
Repos:     No active apt repos in: /etc/apt/sources.list 
           Active apt repos in: /etc/apt/sources.list.d/additional-repositories.list 
           1: deb [arch=amd64] https://download.docker.com/linux/ubuntu focal stable
           Active apt repos in: /etc/apt/sources.list.d/ansible-ansible-focal.list 
           1: deb http://ppa.launchpad.net/ansible/ansible/ubuntu focal main
           Active apt repos in: /etc/apt/sources.list.d/cappelikan-ppa-focal.list 
           1: deb http://ppa.launchpad.net/cappelikan/ppa/ubuntu focal main
           Active apt repos in: /etc/apt/sources.list.d/google-chrome.list 
           1: deb [arch=amd64] http://dl.google.com/linux/chrome/deb/ stable main
           Active apt repos in: /etc/apt/sources.list.d/helm-stable-debian.list 
           1: deb https://baltocdn.com/helm/stable/debian/ all main
           Active apt repos in: /etc/apt/sources.list.d/kubernetes.list 
           1: deb https://apt.kubernetes.io/ kubernetes-xenial main
           Active apt repos in: /etc/apt/sources.list.d/linvinus-rhvoice-focal.list 
           1: deb http://ppa.launchpad.net/linvinus/rhvoice/ubuntu focal main
           Active apt repos in: /etc/apt/sources.list.d/longsleep-golang-backports-focal.list 
           1: deb http://ppa.launchpad.net/longsleep/golang-backports/ubuntu focal main
           Active apt repos in: /etc/apt/sources.list.d/microsoft-edge-dev.list 
           1: deb [arch=amd64] http://packages.microsoft.com/repos/edge/ stable main
           Active apt repos in: /etc/apt/sources.list.d/official-package-repositories.list 
           1: deb http://packages.linuxmint.com uma main upstream import backport #id:linuxmint_main
           2: deb http://archive.ubuntu.com/ubuntu focal main restricted universe multiverse
           3: deb http://archive.ubuntu.com/ubuntu focal-updates main restricted universe multiverse
           4: deb http://archive.ubuntu.com/ubuntu focal-backports main restricted universe multiverse
           5: deb http://security.ubuntu.com/ubuntu/ focal-security main restricted universe multiverse
           6: deb http://archive.canonical.com/ubuntu/ focal partner
           Active apt repos in: /etc/apt/sources.list.d/openvpn-aptrepo.list 
           1: deb http://build.openvpn.net/debian/openvpn/release/2.5 focal main
           Active apt repos in: /etc/apt/sources.list.d/opera-stable.list 
           1: deb https://deb.opera.com/opera-stable/ stable non-free #Opera Browser (final releases)
           Active apt repos in: /etc/apt/sources.list.d/pritunl.list 
           1: deb https://repo.pritunl.com/stable/apt focal main
           Active apt repos in: /etc/apt/sources.list.d/rednotebook-stable-focal.list 
           1: deb http://ppa.launchpad.net/rednotebook/stable/ubuntu focal main
           Active apt repos in: /etc/apt/sources.list.d/riot-im.list 
           1: deb [signed-by=/usr/share/keyrings/riot-im-archive-keyring.gpg] https://packages.riot.im/debian/ default main
           Active apt repos in: /etc/apt/sources.list.d/signal-xenial.list 
           1: deb [arch=amd64] https://updates.signal.org/desktop/apt xenial main
           Active apt repos in: /etc/apt/sources.list.d/steam.list 
           1: deb [arch=amd64,i386] https://repo.steampowered.com/steam/ stable steam
           2: deb-src [arch=amd64,i386] https://repo.steampowered.com/steam/ stable steam
           Active apt repos in: /etc/apt/sources.list.d/sublime-text.list 
           1: deb https://download.sublimetext.com/ apt/stable/
           Active apt repos in: /etc/apt/sources.list.d/virtualbox.list 
           1: deb [arch=amd64] http://download.virtualbox.org/virtualbox/debian focal contrib
           Active apt repos in: /etc/apt/sources.list.d/vivaldi-snapshot.list 
           1: deb http://repo.vivaldi.com/snapshot/deb/ stable main
           Active apt repos in: /etc/apt/sources.list.d/vscodium.list 
           1: deb https://paulcarroty.gitlab.io/vscodium-deb-rpm-repo/debs/ vscodium main
           Active apt repos in: /etc/apt/sources.list.d/yandex-browser-beta.list 
           1: deb [arch=amd64] http://repo.yandex.ru/yandex-browser/deb beta main
Info:      Processes: 501 Uptime: 4h 47m Memory: 62.79 GiB used: 4.64 GiB (7.4%) Init: systemd v: 245 runlevel: 5 Compilers: 
           gcc: 9.3.0 alt: 8/9 clang: 10.0.0-4ubuntu1 Shell: bash v: 5.0.17 running in: gnome-terminal inxi: 3.0.38

nvidia-bug-report.log.gz (515.0 KB)

1 Like