nvidia-persistenced - not running.

A few questions .

  1. What is the impact of errors like the following on cuda code (e.g. Tensorflow etc.) ? Running with driver 430 on Ubuntu.

nvidia-persistenced[38948]: The daemon no longer has permission to remove its runtime data directory /var/run/nvidia-persistenced

  1. Why the following uses a non-root user - which of course would not be able to access the /var/run directory.

[Unit]
Description=NVIDIA Persistence Daemon
Wants=syslog.target

[Service]
Type=forking
ExecStart=/usr/bin/nvidia-persistenced --user nvidia-persistenced --no-persistence-mode --verbose
ExecStopPost=/bin/rm -rf /var/run/nvidia-persistenced

[url]Driver Persistence :: GPU Deployment and Management Documentation

I see that the problem is a follows.

Failed to open libnvidia-cfg.so.1: libnvidia-cfg.so.1: cannot open shared object file: No such file or directory

Looks like libnvidia-cfg is a OpenGL/GLX library -see following.

libnvidia-cfg1-430/bionic 430.09-0ubuntu0~gpu18.04.1 amd64
NVIDIA binary OpenGL/GLX configuration library

Using NVIDIA GPUs exclusively for compute (e.g. headless driver) . There is a separate display driver in the system. Would installing libnvidia-cfg mess up that display driver ?

I found in order to fix the missing libnvidia-cfg.so.1 file issue, you need to install either libnvidia-cfg1 or libnvidia-legacy-390xx-cfg1
So the following command fix the issue.
sudo apt install libnvidia-legacy-390xx-cfg1