Sometimes the system works fine despite this error, but other times other errors arise as well. Is this a fatal error that could be the source of errors later? How can prevent this error from happening?
Software Version
DRIVE OS Linux 5.2.0
DRIVE OS Linux 5.2.0 and DriveWorks 3.5
NVIDIA DRIVE™ Software 10.0 (Linux)
NVIDIA DRIVE™ Software 9.0 (Linux)
other DRIVE OS version
other
I have seen rarer cases of this issue where this message gets spammed in the console continuously – all that printk output causes the machine to slow down considerably (to the point were it doesn’t function).
There have been various other glitches (i.e. camera startup failures) that occurred and this message was present, but I am unsure if that was just a coincidence.
I collected further statistics for the occurrence of this issue. Over 50 system reboots, the system lock up when running the nvsipl_camera occurred 9 times! Once the system is in this state, any time the program is run results in this issue.
During this time, it was not possible to interact with the system such as by logging in via the serial console.
Same issue.
When running sipl camera driver, kern.log and syslog are spammed with call trace until disk is full.
So yes, the warning cause serious functional issue.
Once we have any progress, I’ll update you.
BTW, without “–filedump-prefix out” I couldn’t reproduce the issue. Please justify why this is a blocker for you. Thanks.
I have reproduced the issue without “–filedump-prefix out” as well. Infact my test that showed a 9 out of 50 failure rate was without this flag as well.
This is a blocker for us because we cannot send units in to the field with a 20% failure rate – this is far above an acceptable failure rate for our operations.
The procedure I use to reproduce is to reboot, run the program, then check dmesg output for the flood of messages that comes along with this issue (or just a system lockup without the ability to check the log). This is how I came up with the 9/50 reproduction rate.
Please replace nvmap_cache_t19x.c and nvmap_init_t19x.c in ~/nvidia/nvidia_sdk/DRIVE_OS_5.2.0_SDK_Linux_OS_DDPX/DRIVEOS/drive-oss-src/nvidia/drivers/video/tegra/nvmap with them, and follow Compiling the Kernel (NVIDIA DRIVE Linux) to rebuild the kernel. Thanks.
I have tested this change and 50 out of 50 reboots came up without the previously observed error. Thank you.
Should I expect this fix to be part of a official release sometime soon? Or will I need to maintain and spread this change to all developers.