565 release feedback & discussion

I don’t appear to have the x11 one, but I do have egl-gbm:

paru -Qs egl                                                                                                  ✔  01:46:04 
local/egl-gbm 1.1.2-1
    The GBM EGL external platform library
local/eglexternalplatform 1.2-2
    EGL External Platform interface
local/freeglut 3.6.0-1
    Free OpenGL Utility Toolkit
local/lib32-libglvnd 1.7.0-1
    The GL Vendor-Neutral Dispatch library
local/lib32-wayland 1.23.1-1
    A computer display server protocol
local/libglvnd 1.7.0-1
    The GL Vendor-Neutral Dispatch library
local/nvidia-egl-wayland-tkg 565.57.01-260
    NVIDIA EGL Wayland library (libnvidia-egl-wayland.so.1.1.17) for 'nvidia-utils-tkg'
local/wayland 1.23.1-1
    A computer display server protocol

As for the egl-wayland, no, I was using the non git version from the official arch repo, which is still p bleeding edge tbf.
But when it broke, I switched to the one provided by nvidia-all, nvidia-egl-wayland-tkg 565.57.01-260.
I should mention that I was on Hyprland, both when it broke and got fixed, that might play a role in it since hyprland has it’s own protocols afaik, aquamarine.

1 Like

Since driver 555 I’ve been experiencing wine-vulkan errors after random intervals under Wayland and I still am as of 565.57.01.

Edit: I can reproduce this by playing KINGDOM HEARTS II FINAL MIX for a while and eventually it will happen. Sometimes though the game will instead just deadlock the rendering thread without the error popup.

Edit 2: Here’s the nvidia-bug-report.log.gz (324.1 KB) taken after this error occurred but before closing the application.

OS: NixOS Unstable 24.11 (Vicuna)
KERNEL: Linux 6.11.7
GNOME: 47
CPU: 12th Gen Intel® Core™ i9-12900KS × 24
GPU: NVIDIA GeForce RTX™ 3090 Ti
NVIDIA Driver: 565.57.01
DISPLAY: Wayland
RAM: 32 GB

2 Likes

That’s what I thought, too. I wonder if anyone else with a Nvidia dGPU is facing the same problem (external monitor plugged to HDMI is not recognized). If 6.12 goes stable like this, it will be a no-go for me, as I depend heavily on an external monitor for daily work ={

The weird thing is that inxi -Gxx recognizes both the GTX 1650 dGPU and the external display, but GNOME display selection only shows the built-in display:

❯ uname -r
6.12.0-0.rc7.58.fc42.x86_64

❯ inxi -Gxx
Graphics:
  Device-1: NVIDIA TU117M [GeForce GTX 1650 Mobile / Max-Q]
    vendor: Acer Incorporated ALI driver: nvidia v: 565.57.01 arch: Turing pcie:
    speed: 2.5 GT/s lanes: 8 ports: active: none off: HDMI-A-1 empty: none
    bus-ID: 01:00.0 chip-ID: 10de:1f99
  Device-2: Advanced Micro Devices [AMD/ATI] Renoir [Radeon Vega Series /
    Radeon Mobile Series] vendor: Acer Incorporated ALI driver: amdgpu
    v: kernel arch: GCN-5 pcie: speed: 16 GT/s lanes: 16 ports: active: eDP-1
    empty: none bus-ID: 05:00.0 chip-ID: 1002:1636 temp: 60.0 C
  Device-3: Chicony HD User Facing driver: uvcvideo type: USB rev: 2.0
    speed: 480 Mb/s lanes: 1 bus-ID: 3-3:3 chip-ID: 04f2:b64f
  Display: wayland server: X.org v: 1.21.1.14 with: Xwayland v: 24.1.4
    compositor: gnome-shell v: 47.1 driver: X: loaded: amdgpu,nvidia
    unloaded: modesetting,nouveau,radeon alternate: fbdev,nv,vesa
    dri: radeonsi gpu: amdgpu,nvidia,nvidia-nvswitch display-ID: 0
  Monitor-1: HDMI-A-1 model: AOC 24B1W1G5 res: 1920x1080 dpi: 93
    diag: 604mm (23.8")
  Monitor-2: eDP-1 model: AU Optronics 0xaf90 res: 1920x1080 dpi: 142
    diag: 394mm (15.5")
  API: EGL v: 1.5 platforms: device: 0 drv: nvidia device: 1 drv: radeonsi
    device: 3 drv: swrast gbm: drv: kms_swrast surfaceless: drv: nvidia wayland:
    drv: radeonsi x11: drv: radeonsi inactive: device-2
  API: OpenGL v: 4.6.0 compat-v: 4.5 vendor: amd mesa v: 24.2.6 glx-v: 1.4
    direct-render: yes renderer: AMD Radeon Graphics (radeonsi renoir LLVM
    19.1.0 DRM 3.59 6.12.0-0.rc7.58.fc42.x86_64) device-ID: 1002:1636
    display-ID: :0.0
  API: Vulkan v: 1.3.296 surfaces: xcb,xlib,wayland device: 0
    type: integrated-gpu driver: N/A device-ID: 1002:1636 device: 1
    type: discrete-gpu driver: N/A device-ID: 10de:1f99 device: 2 type: cpu
    driver: N/A device-ID: 10005:0000

NOTE: I’m using GSK_RENDERER=ngl (otherwise GNOME apps don’t run)

Anyone from Nvidia managed to reproduce this? @scaronni , @agoins , @abchauhan , @esullivan ?

For the record, I filed a bug on RPMFusion Bugzilla about this.

I’m using GSK_RENDERER=vulkan right now and it works for me after recent gnome packages update (previously i had the same problem as you with gnome apps not opening). But maybe it’s because i have newer packages from arch.

1 Like

For me it’s OK. RTX 4090 with 3 monitors (DP). I am getting P8 with no trouble.

It seems to happen only with certain combinations of resolution/refresh rate and GPU model. You are probably lucky but not me.

Cool! But is suspend/resume working just fine for you? Would you mind sharing your setup (desktop? notebook? iGPU? dGPU? external monitor?) so that I could try to compare with mine?

BTW I just saw there is a new 6.12rc7 on Rawhide, I’ll try it later on to see if I can finally use an external monitor again.

EDIT: just tested with kernel 6.12.0-0.rc7.59.fc42 (latest from Rawhide), external display is not available.

❯ uname -r
6.12.0-0.rc7.59.fc42.x86_64

❯ modinfo -F version nvidia
565.57.01

nvidia-bug-report.log.gz (470.7 KB)

Ok.

So I am at a point where these 565 Drivers are working without any noticeable errors or faults on boot.

However the FPS with Wayland is < 10 if I log into Gnome (Xorg)/X11 its as expected.

 inxi
CPU: quad core AMD Ryzen 3 5300U with Radeon Graphics (-MT MCP-)
speed/min/max: 1549/1400/3900 MHz Kernel: 6.1.0-18-amd64 x86_64 Up: 24m
Mem: 4692.0/31977.6 MiB (14.7%) Storage: 2.96 TiB (62.6% used) Procs: 350
Shell: Bash inxi: 3.3.26

inxi -Ga
Graphics:
  Device-1: NVIDIA GA106 [Geforce RTX 3050] vendor: Micro-Star MSI
    driver: nvidia v: 565.57.01 alternate: nouveau,nvidia_drm non-free: 530.xx+
    status: current (as of 2023-03) arch: Ampere code: GAxxx
    process: TSMC n7 (7nm) built: 2020-22 pcie: gen: 1 speed: 2.5 GT/s
    lanes: 4 link-max: gen: 4 speed: 16 GT/s lanes: 16 ports: active: none
    off: DP-1 empty: DP-2,DP-3,HDMI-A-1 bus-ID: 04:00.0 chip-ID: 10de:2507
    class-ID: 0300
  Device-2: Logitech BRIO Ultra HD Webcam type: USB
    driver: hid-generic,snd-usb-audio,usbhid,uvcvideo bus-ID: 2-2.2:5
    chip-ID: 046d:085e class-ID: 0300 serial: E4C3B085
  Display: wayland server: X.org v: 1.21.1.7 with: Xwayland v: 22.1.9
    compositor: gnome-shell v: 43.9 driver: X: loaded: nvidia
    unloaded: modesetting gpu: nvidia display-ID: 0
  Monitor-1: DP-1 res: 5120x1440 size: N/A modes: N/A
  API: OpenGL v: 4.6.0 NVIDIA 565.57.01 renderer: NVIDIA GeForce RTX
    3050/PCIe/SSE2 direct-render: Yes

I notice the driver that is being used is not nvidia_drm (very confusing by the way) or is that dynamically called

sudo journalctl -xb 0 -g "nvidia"
Nov 15 08:25:47 debian kernel: integrity: Loaded X.509 cert 'nvidia-installer generated signing key: 0beb68fd3ea984df8fabc8e82c8c0f4cbbe2a99a'
Nov 15 08:25:51 debian kernel: nvidia: loading out-of-tree module taints kernel.
Nov 15 08:25:51 debian kernel: nvidia-nvlink: Nvlink Core is being initialized, major device number 242
Nov 15 08:25:51 debian kernel: nvidia 0000:04:00.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=none:owns=none
Nov 15 08:25:51 debian kernel: NVRM: loading NVIDIA UNIX Open Kernel Module for x86_64  565.57.01  Release Build  (dvs-builder@U16-A24-9-2)  Thu Oct 10 12:15:00 UTC 2024
Nov 15 08:25:51 debian kernel: nvidia-modeset: Loading NVIDIA UNIX Open Kernel Mode Setting Driver for x86_64  565.57.01  Release Build  (dvs-builder@U16-A24-9-2)  Thu Oct 10 12:03:51 UTC 2024
Nov 15 08:25:52 debian kernel: [drm] [nvidia-drm] [GPU ID 0x00000400] Loading driver
Nov 15 08:25:52 debian kernel: input: HDA NVidia HDMI/DP,pcm=3 as /devices/pci0000:00/0000:00:02.3/0000:04:00.1/sound/card0/input25
Nov 15 08:25:52 debian kernel: input: HDA NVidia HDMI/DP,pcm=7 as /devices/pci0000:00/0000:00:02.3/0000:04:00.1/sound/card0/input26
Nov 15 08:25:52 debian kernel: input: HDA NVidia HDMI/DP,pcm=8 as /devices/pci0000:00/0000:00:02.3/0000:04:00.1/sound/card0/input27
Nov 15 08:25:52 debian kernel: input: HDA NVidia HDMI/DP,pcm=9 as /devices/pci0000:00/0000:00:02.3/0000:04:00.1/sound/card0/input28
Nov 15 08:25:54 debian kernel: nvidia 0000:04:00.0: firmware: direct-loading firmware nvidia/565.57.01/gsp_ga10x.bin
Nov 15 08:25:55 debian kernel: [drm] Initialized nvidia-drm 0.0.0 20160202 for 0000:04:00.0 on minor 0
Nov 15 08:25:55 debian kernel: fbcon: nvidia-drmdrmfb (fb0) is primary device
Nov 15 08:25:55 debian kernel: nvidia 0000:04:00.0: [drm] fb0: nvidia-drmdrmfb frame buffer device
Nov 15 08:26:08 debian kernel: audit: type=1400 audit(1731659160.005:7): apparmor="STATUS" operation="profile_load" profile="unconfined" name="nvidia_modprobe" pid=1024 comm="apparmor_parser"
Nov 15 08:26:08 debian kernel: audit: type=1400 audit(1731659160.005:8): apparmor="STATUS" operation="profile_load" profile="unconfined" name="nvidia_modprobe//kmod" pid=1024 comm="apparmor_parser"
Nov 15 08:26:00 debian audit[1024]: AVC apparmor="STATUS" operation="profile_load" profile="unconfined" name="nvidia_modprobe" pid=1024 comm="apparmor_parser"
Nov 15 08:26:00 debian audit[1024]: AVC apparmor="STATUS" operation="profile_load" profile="unconfined" name="nvidia_modprobe//kmod" pid=1024 comm="apparmor_parser"
Nov 15 08:26:09 debian systemd[1]: Starting nvidia-persistenced.service - NVIDIA Persistence Daemon...
░░ Subject: A start job for unit nvidia-persistenced.service has begun execution
░░ Defined-By: systemd
░░ Support: https://www.debian.org/support
░░ 
░░ A start job for unit nvidia-persistenced.service has begun execution.
░░ 
░░ The job identifier is 175.
Nov 15 08:26:09 debian systemd[1]: Started nvidia-powerd.service - nvidia-powerd service.
░░ Subject: A start job for unit nvidia-powerd.service has finished successfully
░░ Defined-By: systemd
░░ Support: https://www.debian.org/support
░░ 
░░ A start job for unit nvidia-powerd.service has finished successfully.
░░ 
░░ The job identifier is 185.
Nov 15 08:26:09 debian /usr/bin/nvidia-powerd[1168]: nvidia-powerd version:1.0(build 1)
Nov 15 08:26:10 debian systemd[1]: Started nvidia-persistenced.service - NVIDIA Persistence Daemon.
░░ Subject: A start job for unit nvidia-persistenced.service has finished successfully
░░ Defined-By: systemd
░░ Support: https://www.debian.org/support
░░ 
░░ A start job for unit nvidia-persistenced.service has finished successfully.
░░ 
░░ The job identifier is 175.
Nov 15 08:26:10 debian systemd[1]: nvidia-powerd.service: Deactivated successfully.
░░ Subject: Unit succeeded
░░ Defined-By: systemd
░░ Support: https://www.debian.org/support
░░ 
░░ The unit nvidia-powerd.service has successfully entered the 'dead' state.
Nov 15 08:27:22 debian gnome-shell[3089]: Added device '/dev/dri/card0' (nvidia-drm) using atomic mode setting.
Nov 15 08:27:50 debian kernel: nvidia-uvm: Loaded the UVM driver, major device number 239.
Nov 15 08:28:32 debian gnome-shell[4440]: Added device '/dev/dri/card0' (nvidia-drm) using atomic mode setting.
Nov 15 08:29:20 debian kernel: audit: type=1400 audit(1731659360.432:45): apparmor="DENIED" operation="open" profile="/usr/bin/akonadiserver" name="/usr/share/nvidia/nvidia-application-profiles-565.57.01-rc" pid=5688 comm="akonadiserver" requested_mask="r" denied_mask="r" fsuid=1000 ouid=0
Nov 15 08:29:20 debian audit[5688]: AVC apparmor="DENIED" operation="open" profile="/usr/bin/akonadiserver" name="/usr/share/nvidia/nvidia-application-profiles-565.57.01-rc" pid=5688 comm="akonadiserver" requested_mask="r" denied_mask="r" fsuid=1000 ouid=0
Nov 15 08:36:04 debian microsoft-edge.desktop[10029]: Warning: loader_scanned_icd_add: Could not get 'vkCreateInstance' via 'vk_icdGetInstanceProcAddr' for ICD libGLX_nvidia.so.0
Nov 15 08:39:33 debian sudo[12587]: xx-pc : TTY=pts/0 ; PWD=/etc/X11/xorg.conf.d ; USER=root ; COMMAND=/usr/bin/nano 20-nvidia.conf
Nov 15 08:52:10 debian sudo[18907]: xx-pc : TTY=pts/0 ; PWD=/boot/grub/x86_64-efi ; USER=root ; COMMAND=/usr/bin/journalctl -xb 0 -g nvidia

vainfo also fails under wayland yet under Gnome X11 (Xorg) totally fine.

vainfo
libva info: VA-API version 1.17.0
libva error: vaGetDriverNameByIndex() failed with invalid VADisplay, driver_name = (null)
vaInitialize failed with error code 3 (invalid VADisplay),exit

I’m seeing a lot worse FPS, and higher frametimes in Vulkan applications under Linux, than under Windows. This is not exclusive to programs running under Proton, but it’s especially prevalant there. It’s also in native applications that use Vulkan.
I’ve run the same programs under Windows 11 (LTSC 24h2) and under CachyOS with the bore scheduler, and with the eevdf (standard) scheduler. And it’s consistently better under Windows. I even installed Windows in a Qemu virtual machine and passed the nvidia GPU through to the virtual machine, so that the virtual machine adheres to the Linux kernel and scheduler, just to rule that out. And the Windows virtual machine was still consistently faster in Vulkan applications than those same applications underneath the host Linux OS.

1 Like

I’ve been seeing a lot of flip event timeout on head 0 errors lately on Nobara 40. Usually happens when I have a YouTube video running. The screen will lock up, but I can still hear the sound. I can also hit the space bar to pause. The only workaround I have found so far is to switch to TTY3 and then after that loads (without putting my login info in), I switch back. 9 times out of 10, the desktop environment is responsive again. The 1 other time out of those 10 times, it won’t be fixed and will require a hard reboot.

This usually happens several times a day, a lot of times happening within 10s of minutes of each other.

2 Likes

I’d love to go to the bottom of the sleep/suspend issues… X11 works fine, but Wayland just cannot recover on my side, no matter what I try. The best I could get is a GDM screen with missing visual assets. Far better than the unresponsive black screen I was getting before, but still broken.

6.6.59-200.fc41.x86_64 with 565.57.01
Desktop PC with GDM/Gnome Wayland on a RTX 4800

options nvidia-drm modeset=1 fbdev=1
options nvidia NVreg_DynamicPowerManagement=0x02
options nvidia NVreg_EnableS0ixPowerManagement=1
options nvidia NVreg_PreserveVideoMemoryAllocations=1

systemctl is-enabled nvidia-hibernate nvidia-resume nvidia-suspend nvidia-persistenced nvidia-powerd
enabled
enabled
enabled
enabled
disabled

1 Like

I feel your pain… same here :-( It’s pretty bad as it is, and to make matters worse, kernel 6.12 which was supposed to fix these issues brings another problem for me by losing the ability to detect the external monitor. I’ll have to choose the lesser of two evils, which will mean I’ll be stuck with kernel 6.11 and broken suspend/resume.

A couple of questions regarding your configuration:

  1. is the “mode” by itself on the first line correct?
  2. until now, I wasn’t using nvidia-persistenced and was getting expected suspend/resume; also, it is not mentioned on the HOWTO; are you sure it is needed?

My GPU (GTX 1650) doesn’t support S0ix power management, so I’m out of luck here. As for the PreserveVideoMemoryAllocations I was not using it either, do you know if this became mandatory with the 565 driver?

1 Like
  1. The “mode” was a typo.
  2. From reading the documentation, my understanding is that we do not need the persistenced service running.

For PreserveVideoMemoryAllocations, I do think we need it. I’ve seen mentions of it while reading some discussions in Arch forums. I’d love to get a proper explanation of what we need and why it fail…

1 Like

Thanks for the info. As for the PreserveVideoMemoryAllocations, my understanding from the docs is that it allows proper/full suspend/resume at the cost of more disk space:

The GPU state saved by the NVIDIA kernel drivers includes allocations made in video memory. However, these allocations are collectively large, and typically cannot be evicted. Since the amount of system memory available to drivers at suspend time is often insufficient to accommodate large portions of video memory, the NVIDIA kernel drivers are designed to act conservatively, and normally only save essential video memory allocations.

The resulting loss of video memory contents is partially compensated for by the user-space NVIDIA drivers, and by some applications, but can lead to failures such as rendering corruption and application crashes upon exit from power management cycles.

However, so far It didn’t seem to be necessary for may day-to-day use. YMMV

I’ve tried giving as much information as possible to the nvidia team in my comments on this post. Including bug reports and full repro steps for quite a few bugs. However after weeks on weeks (almost 6 months on some), I’ve seen no reply or acknowledgement from the nvidia team for many of my issues. To make it easier for you, I will link my current issues below for you to look through and create internal bug reports:

osu!lazer (https://github.com/ppy/osu/releases/latest/download/osu.AppImage) crashes on startup when using native vulkan backend over opengl:

Still exceedingly high vram usage in games (spider-man remastered, doom eternal, assetto corsa) and wayland desktops (Hyprland). Supposedly fixed, but isn’t:

Cemu with vulkan backend crashing on game start with segfault. Every log possible was provided here however still no reply:

Satisfactory with vulkan backend crashes with Assertion failed, Expression: "!status && "vkWaitForFences"":

Nvidia drivers not assigning correct monitors, leading to opposite display for bios and for tty. without fbdev both bios and tty display on my 1440p monitor, with fbdev, bios is on main monitor, tty is 1080p and very small on 1440p monitor:

Exceedingly high power draw / high clocks when cuda apps are running. Issue is getting out of hand now, needs a fix soon. My fans on my gpu spin up just when watching a youtube video. This should not be happening:

Higher power usage when VRR is enabled, on idle my GPU goes from ~10W to ~34W just from having VRR enabled:

4 Likes

same issue for me,

hdmi doesn’t work on 6.12 on asus rog zeph g16 2024 (4070)
I use external monitor >85% of time, so had to revert to 6.11, but at least suspend works nicely on 6.12 now

1 Like

Thanks for confirming, this shows the problem is somewhat widespread, and hopefully increases the chances of it catching some attention.

Also, good to know 6.12 indeed fixes suspend/resume.

I’m not sure if this is the right place to ask. But any news when version 570 is going to be released?

4 Likes

I feel you brother. Hopefully soon, but the team doesn’t seem to like giving timelines for obvious reasons.

3 Likes

New kernel 6.12 arrived and it seems it fails to build nvidia modules at all. Anyway 6.12 and latest nvidia beta not working. Any solution, please?

make.log (5.5 MB)
nvidia-bug-report.log.gz (4.8 MB)

1 Like