DRIVE OS Version: 7.0.3
Issue Description:
After power on and reboot the system starts without issue and the mgbe3_0 interface acquires time-sync from an external gPTP master device connected to the DriveAGX Thor unit.
Monitoring ptp4l shows normal behavior and the port stays in SLAVE mode and synchronizes to the external master without issue.
Subsequently launching DW sample_camera with a simple rig file that enables a single camera the log output of ptp4l for the nv_ptp4l_slave_mgbe3_0.service shows a failure and the port transitions to FAULTY state and synchronization stops.
Mar 24 20:51:52 tegra-ubuntu ptp4l[2460]: [376.257] failed to step clock: No space left on device
Mar 24 20:51:52 tegra-ubuntu ptp4l[2460]: [376.507] failed to step clock: No space left on device
Mar 24 20:51:52 tegra-ubuntu ptp4l[2460]: [376.616] missing timestamp on transmitted peer delay request
Mar 24 20:51:52 tegra-ubuntu ptp4l[2460]: [376.616] port 1 (mgbe3_0): SLAVE to FAULTY on FAULT_DETECTED (FT_UNSPECIFIED)
Shortly aftewards recovery is attempted:
Mar 24 20:52:08 tegra-ubuntu ptp4l[2460]: [392.908] port 1 (mgbe3_0): FAULTY to SLAVE on INIT_COMPLETE
Mar 24 20:52:09 tegra-ubuntu ptp4l[2460]: [393.121] failed to step clock: No space left on device
Subsequently it fails again:
Mar 24 20:52:09 tegra-ubuntu ptp4l[2460]: [393.373] failed to step clock: No space left on device
Mar 24 20:52:09 tegra-ubuntu ptp4l[2460]: [393.626] failed to step clock: No space left on device
Mar 24 20:52:09 tegra-ubuntu ptp4l[2460]: [393.752] rms 2905807845 max 2905808269 freq +12335 +/- 69
Mar 24 20:52:09 tegra-ubuntu ptp4l[2460]: [393.879] failed to step clock: No space left on device
Mar 24 20:52:09 tegra-ubuntu ptp4l[2460]: [393.972] missing timestamp on transmitted peer delay request
Mar 24 20:52:09 tegra-ubuntu ptp4l[2460]: [393.972] port 1 (mgbe3_0): SLAVE to FAULTY on FAULT_DETECTED (FT_UNSPECIFIED)
This continues to repeat. It continues even if sample_camera is stopped. The only recovery is to shutdown, power-off, and restart the system.
Is this a known issue with 7.0.3?
I do not have an explanation for why the error message “No space left on device” occurs.
Inspecting the free-space on mounted drives does not show any full drives and thus I don’t really know what “device” that error might refer to. Why that error would be returned from an operation on a Ethernet device or its associated PHC is a mystery to me.
I suppose it could be related to this issue triggering ptp4l to try and slew the clock or step the clock some huge amount as implied by the rms stats reported:
Mar 24 21:05:16 tegra-ubuntu ptp4l[2460]: [1177.048] missing timestamp on transmitted peer delay request
Mar 24 21:05:16 tegra-ubuntu ptp4l[2460]: [1177.048] port 1 (mgbe3_0): SLAVE to FAULTY on FAULT_DETECTED (FT_UNSPECIFIED)
Mar 24 21:05:32 tegra-ubuntu ptp4l[2460]: [1193.252] port 1 (mgbe3_0): FAULTY to SLAVE on INIT_COMPLETE
Mar 24 21:05:32 tegra-ubuntu ptp4l[2460]: [1193.307] rms 2905813907 max 2905815525 freq +12169 +/- 45
Mar 24 21:05:32 tegra-ubuntu ptp4l[2460]: [1193.420] failed to step clock: No space left on device
Mar 24 21:05:32 tegra-ubuntu ptp4l[2460]: [1193.645] failed to step clock: No space left on device
Mar 24 21:05:33 tegra-ubuntu ptp4l[2460]: [1193.871] failed to step clock: No space left on device
Mar 24 21:05:33 tegra-ubuntu ptp4l[2460]: [1194.095] failed to step clock: No space left on device
Mar 24 21:05:33 tegra-ubuntu ptp4l[2460]: [1194.252] missing timestamp on transmitted peer delay request
Mar 24 21:05:33 tegra-ubuntu ptp4l[2460]: [1194.252] port 1 (mgbe3_0): SLAVE to FAULTY on FAULT_DETECTED (FT_UNSPECIFIED)
Mar 24 21:05:49 tegra-ubuntu ptp4l[2460]: [1210.540] port 1 (mgbe3_0): FAULTY to SLAVE on INIT_COMPLETE
Mar 24 21:05:49 tegra-ubuntu ptp4l[2460]: [1210.634] failed to step clock: No space left on device
Mar 24 21:05:50 tegra-ubuntu ptp4l[2460]: [1210.859] failed to step clock: No space left on device
Mar 24 21:05:50 tegra-ubuntu ptp4l[2460]: [1211.084] failed to step clock: No space left on device
Mar 24 21:05:50 tegra-ubuntu ptp4l[2460]: [1211.197] rms 2905815503 max 2905815525 freq +12208 +/- 27
Mar 24 21:05:50 tegra-ubuntu ptp4l[2460]: [1211.309] failed to step clock: No space left on device
Mar 24 21:05:50 tegra-ubuntu ptp4l[2460]: [1211.534] failed to step clock: No space left on device
Mar 24 21:05:50 tegra-ubuntu ptp4l[2460]: [1211.612] missing timestamp on transmitted peer delay request
Mar 24 21:05:50 tegra-ubuntu ptp4l[2460]: [1211.612] port 1 (mgbe3_0): SLAVE to FAULTY on FAULT_DETECTED (FT_UNSPECIFIED)
Please suggest how to resolve or troubleshoot this issue.