All drivers since 535.98 are not working on my machine, I got freeze, display glitch …
I’m playing Diablo 4, the game is working perfectly with 535.98, but when I try to update a lot of issues appears.
Please have a look to kernel.log we clearly see the stack error :
launch Diablo 4 using Proton Eggroll : here
I tried with 8.22 and 8.3 issues occurs. About 8.22 I’m not sure, but there might be also an issue with it.
Hi @poupouille
I am unfortunately not able to duplicate issue locally after trying steps in your earlier comments.
I will spend few more cycles on few other systems and update.
I have also filed a bug 4464466 internally for tracking purpose.
Hi @poupouille
Just wanted to know if you have any other steps which have reproduced the same issue.
Because I am still not able to duplicate issue with the earlier steps share by you.
I worked around the issue in Diablo 4 by playing with lower resolutions which ate less VRAM. In texture at maximum capacity the crash was occuring. I did not retest.
As I gave you the call stack, you should be able to find the issue ;)
Or maybe the issue is on my setup, but no other games crash the system like that.
Hi @poupouille
I spent multiple hours to repro issue with driver 550.54.14 and reported driver 535.129.03 on below setup but still no luck.
Dell Alienware Aurora R15 AMD + AMD Ryzen 9 7900X 12-Core Processor + Ubuntu 22.04.2 LTS + kernel 5.19.0-32-generic + NVIDIA GeForce RTX 3080 + Driver 535.129.03 + DELL G3223D Display 2560x1440 with refresh rate 60Hz
Steps Tried -
Logged in system and launched steam game Diablo IV game in “Ultra settings” for texture.
Kept it running for an hour or so and ran Unigine benchmark to increase VRAM usage and later closed all apps.
root@oemqa-Alienware-Aurora-R15-AMD:~# nvidia-smi
Mon May 6 12:20:28 2024
±----------------------------------------------------------------------------------------+
| NVIDIA-SMI 550.54.14 Driver Version: 550.54.14 CUDA Version: 12.4 |
|-----------------------------------------±-----------------------±---------------------+
| GPU Name Persistence-M | Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|=========================================+========================+======================|
| 0 NVIDIA GeForce RTX 3080 Off | 00000000:01:00.0 On | N/A |
| 48% 69C P0 211W / 320W | 9865MiB / 10240MiB | 100% Default |
| | | N/A |
±----------------------------------------±-----------------------±---------------------+
±----------------------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=========================================================================================|
| 0 N/A N/A 1053 G /usr/lib/xorg/Xorg 382MiB |
| 0 N/A N/A 1536 G /usr/bin/gnome-shell 76MiB |
| 0 N/A N/A 9034 G ./heaven_x64 433MiB |
| 0 N/A N/A 9091 G ./heaven_x64 423MiB |
| 0 N/A N/A 9154 G ./heaven_x64 429MiB |
| 0 N/A N/A 9232 G ./heaven_x64 372MiB |
| 0 N/A N/A 10673 G ./steamwebhelper 5MiB |
| 0 N/A N/A 12118 C+G …apps\common\Diablo IV\Diablo IV.exe 7685MiB |
±----------------------------------------------------------------------------------------+
3) Then I suspend the system for sometime.
4) Upon resume, I restarted the apps and kept is running for sometime again and then rebooted it.
5) Repeated above steps couple of times but did not observe system freeze.
@amrits
Hello, I have the same issue in Dell G16 with RTX4060(Driver version is 550.78), 16 GB memory. I reported it here, though no official care. I recently found a way to 100% reproduce such an issue.
Enable swap space with enough capacity.
Use memetester to eat nearly all of your possible memory.
Run multiple application that needs to use Nvidia to render.
Wait, and Nvidia will crash finally.
If you unplug your charger and keep the battery at a low level(below 30%), it will be very quick to reproduce.
Sorry for late answer.
I’m running now 560.28.03 drivers, I did not observe system crash any more with this driver.
I have better performances with Diablo V latest game patch.
Regarding your test, you did not fill the VRAM completly : " 9865MiB / 10240MiB"
As you mention about VRAM usage, I observed issue in Diablo V and other applications, that when VRAM is getting full, the following issues are observed :
KWin (kde window manager) might crash
Slowness in Diablo V : It become slow after a while, I have to switch from High => Low => High to get best performance again. I notice that after VRAM as around 1Gb free. When it happens again, the free VRAM is near 0.
Is it possible you answer on this point ? Around the web, a lot of people claimed that nvidia_uvm is not working.
It means that there is no shared memory between GPU and CPU to offload VRAM when it’s getting full.
Maybe the crash I observed initially what related to this ?