We have a custom carrier board that we are using to capture from 4 IMX568 sensors via gstreamer. Occasionally when bringing up the 4 camera streams, one of the cameras fails to come online with the following error:
nvbuf_utils: dmabuf_fd -1 mapped entry NOT found
Looking at the nvargus-daemon logs we see an error regarding nvphsd
The error looks as follows:
Jun 13 22:57:57 looq nvargus-daemon[1790]: ---- imager: Found override file [/var/nvidia/nvcam/settings/camera_overrides.isp]. ----
Jun 13 22:57:57 looq nvargus-daemon[1790]: CAM: serial no file already exists, skips storing againLSC: LSC surface is not based on full res!
Jun 13 22:57:57 looq nvargus-daemon[1790]: E/ libnvphs:socket: Error[2]: socket connection /var/lib/nvphs/nvphsd.ctl to PHS failed: No such file or directory
Jun 13 22:57:57 looq nvargus-daemon[1790]: D/ libnvphs:socket: Warning: connecting to Power Hinting Service failed. Is PHS running?
Jun 13 22:57:57 looq nvargus-daemon[1790]: === gst-launch-1.0
[1828]: CameraProvider initialized (0xffff7c5913a0)=== gst-launch-1.0
[1827]: CameraProvider initialized (0xffff74000c20)=== gst-launch-1.0
[1829]: CameraProvider initialized (0xffff6c000c20)=== gst-launch-1.0
[1830]: CameraProvider initialized (0xffff64000c20)SCF: Error BadValue: NvPHSSendThroughputHints (in src/common/CameraPowerHint.cpp, function sendCameraPowerHint(), line 56)
This forum post suggested we disable the NVPHS service with has reduced the frequency of our error but not eliminated it.
We are currently using release 35.1.0 and have seen this issue on previous versions as well.
The issue is intermittent and seems to occur randomly and not on the same device. We also have not encountered any issues with using V4L2 in trying to bring up a camera stream.
The clock boosting is currently used in our capture stack. We are still testing to reproduce the issue. It is very intermittent and mostly seems to occur when we try to start all 4 cameras simultaneously.
Is there anything we should use to capture diagnostic information when we encounter another failure?