Got very frequent and random crashes on ubuntu 21.10, sometimes it takes half an hour sometimes 3 seconds after boot, no specific trigger, can be any application or action.
Tried 4.60 and 4.50 also with no difference, at 3840x1080 system is stable, used monitor is AOC AG493UCX
Unfortunately no improvement, checked gpu-manager log that it was disabled, not sure if xorg.conf was loaded since I had to create it.
Created a fresh nvidia-bugreport but that was empty.
I also wish to withdraw my statement that on lower resolution the system is stable, got 2 lockups with 3840x1080 to.
Sorry I have no further information at this point, will try to create a decent bugreport with nogpumanager kernel switch.
This bug report is changing in another direction, to get some work done I used the Nouveau driver and that one also crashes, not a complete hang I can reboot with ssh connection and with the nvidia driver not but still do…
The last freeze with the nvidia driver with nogpumanager did not write anything in dmesg, tried a couple of freezes and no trace of error in the logs, also nothing in nvidia bugreport but irq/193-nvidia is trowing 100% cpu in top.
Attached the dmesg outputs from both the nvidia and the nouveau incidents.
Having some issues compiling gpu-burn due to glibc/cuda version mismatch, will post the outcome as soon as I have test results, thanks for helping out so far!
Running latest bios version, currently using intel_idle.max_cstate=1 and nogpumanager as kernel options, system is currently ~40 hours stable and still running with the 470 driver…