After upgrading from 455.45.01 to 460.32.03, I get a black screen after loading of nvidia-modeset: First, the screen goes black, then the backlight is toggled several times (about 10 times), finally it stays blank and dark.
Kernel log contains:
Jan 10 19:05:05 localhost kernel: [ 5.773942] [drm] [nvidia-drm] [GPU ID 0x00000100] Loading driver
Jan 10 19:05:05 localhost kernel: [ 10.275739] [drm] Initialized nvidia-drm 0.0.0 20160202 for 0000:01:00.0 on minor 0
Jan 10 19:05:20 localhost kernel: [ 35.249151] nvidia-modeset: ERROR: GPU:0: Display engine push buffer channel allocation failed: 0x65 (Call timed out [NV_ERR_TIMEOUT])
Jan 10 19:05:20 localhost kernel: [ 35.249945] nvidia-modeset: ERROR: GPU:0: Failed to allocate display engine core DMA push buffer
Downgrading back to 455.45.01 fixes this. I observe the same with 460.27.04.
Since 460.32.03 is now released as stable and contains critical security fixes, Iām raising this as a new issue, though it is likely the same as already reported in:
Hereās the bug report file: nvidia-bug-report.log.gz (1.1 MB)
As usual, I had to kill vulkaninfo which did hang in terminal mode otherwise.
That would also mean to disable secure boot, in turn causing other issues (security, functionality loss in Windows). I can for sure try that for a test, but could you explain / link to some explanation on why you expect this to change behaviour?
setting nvidia-drm.modeset=1 kernel parameter
Thatās already the case, as you can see from the report:
The error youāre getting is a recurring bug on older hardware, seems to be related to early vbios init. Turning on CSM often worked around it. Since this bug is very harware specific, itās rarely ever getting fixed.
Thanks, I will give CSM a try later then (disabling SecureBoot means unregistering my signing keys, so I am reluctant there).
For me, the issue is new with the 460 drivers and to my memory never showed up before even though I have been following new releases (also beta) for years, so in my case, it is not recurring.
I am seeing a similar error, machine boots ok but after suspend/resume shows a black screen then after ~120 seconds an error message. This is with nvidia-460 on a Razr Blade 15" (2018) with external monitors. Enabling CSM did not help. [also nvidia-drm.modeset=1 does not help].
Are you sure this isnāt just a regression? Iāve not seen this error previously in 18 months or so of using this laptop, and the laptop is newer hardware.
Actually, turns out I canāt: Activating CSM on that laptop happens implicitly only after the following steps:
Disable SecureBoot (which leads to loss of functionality and safety, but fine for a test). Doing this alone does not change anything, I still see the issue.
Enable āLoad legacy option ROMsā. This also means the legacy graphics will be loaded, and while my Linux EFI bootloader refind is still seen by the UEFI, it does not load it anymore.
So it seems enabling CSM and booting via UEFI is not possible with this UEFI.
Other ideas welcome. Also, please let me know if this issue should be reported to the nvidia bug report mail or whether reporting in these forums is sufficient to raise awareness. While my hardware is old(ish), Iām still reluctant to accept this is not a regression, given that this is new behaviour with the R460 series on my hardware, and seeing the reports by others.
Iāve run in the same issue, see my duplicate help request (Iām sorry about making a duplicate). Today Iāve tried various bioses for my HP zbook studio G3 (Nvidia Quadro M1000M) but it didnāt help. I played around with secure boot and legacy bios modus. Nothing worked.
Added to the issue of crashing on waking from sleep I experience problems with audio over HDMI. Where my tv was able to provide audio with the 450 drivers Iām unable to have it work under the 460 drivers. Independent whether I use the On-Demand or Nvidia (Performance Mode). Neither was I able to make it work on my Intel (Power Saving Mode) although that wasnāt possible with the 450 drivers either.
As a side note my HDMI connector is linked to the Nvidia card and I needed to wait on the reverse prime to be able to use Nvidia On-Demand when using my HDMI connection instead of my internal screen. Iām not sure if this detail matters for our current issue.
Note that this still happens for me with 460.56 (but there was also no related change in the release notes). If a new bug report might be helpful for further investigation, just let me know.
The model numbers this update broke is quite high and even recent ones like the lenovo legion 7 are affected. The bios update lenovo provided for that only fixed the situation running windows. I really donāt know where this is leading.
Iām pretty sure this is a regression as I was working fine with Ubuntu 18.04 with a 410.x driver . Iām now running 460.39 under Ubuntu 20.10. The hardware is HP Zbook Studio G3 with Quadro M1000M GPU.
Exactly the same happens on my Razor 15" (2018) without any external monitor. NVIDIA GeForce GTX 1060 Max-Q. It happened on Ubuntu 20.04 and still happens on Ubuntu 20.10. Branches 450 and 460 both does not work properly. Only the legacy version 390 works correctly.
This started happening for me (black screen when resuming after suspend) after an automatic update to 460.32.03 from 455.38 (see history.log file). Iām on an Acer Aspire 7 with NVIDIA GeForce GTX 1050, with Ubuntu 20.04.2 LTS.
Iāve been circumventing the issue by keeping my PRIME Profile on āIntel (Power Saving Mode)ā unless Iām using some more graphics intensive software.
Happens to me as well on a Lenovo Legion Y720 with Ubuntu 20.04 , driver 460 . If I have a second monitor attached via the HDMI port screen doesnāt stay blank. In fact if I remove the second monitor suspend still works until the laptop is rebooted without the second monitor. So if I want suspend to work I just need to connect a second monitor, then remove it. If I switch to my Intel card using prime-select this also works.
Iāve rolled back to 450 drivers. But came back here to see whether a fix was found or developed by nvidia.
Now the new driver 465.24.02 notes discusses various regressions that are fixed. It is unclear however whether our problem is in that list. I donāt think it is, but there is the mention of a regression related to suspend behaviour. Is it worth testing these drivers? Or is it better to wait longer for a new production branch driver?