Hi all,
This was an issue with our nvidia-persistenced service on the latest Debian and Ubuntu test images. Engineering has identified the problem and submitted a fix. We have verified the fix on our systems.
Unfortunately, the fix is high-risk and requires a full QA test cycle at our end. This fix will be available in a future production branch.
Setting a lower limit for maximum number of open file descriptors can also be used as a potential workaround until the fix is available :-
# ulimit -Hn 16777216
or # ulimit -Hn 524288
instead of the current default value of 1073741816 (on Debian test, Ubuntu 24.10 nightly).
I will update this thread when the fix is available on a release driver.
Thank you