Recently we experienced sudden shutdowns of our Xavier when running a script doing Pytorch inferences.
We are seeing no unusual logs in dmesg, kern.log or syslog. We also tried to dmesg over the serial connection.
The Xavier is running with jetson_clocks and MAXN Mode.
When setting the Powermode to something other than MAXN, the scripts runs through.
We are running Jetpack 4.6 [L4T 32.6.1].
I logged a session with boot → starting the script → boot again after shutdown.
Starting the script was around 11:04 - the shutdown appeared a few seconds later. Around 11:06 I booted the Xavier again.
I attached serial console log, syslog and kern.log
Just to clarify: We run the script in a docker container - but the issue also appears when we setup the environment with venv.
so after starting the python script, the fan stops spinning and the led goes off. No further messages in the logs. The xavier stays off. The boot-messages in the logfile are because I pressed the Power-Button on the board after about 2 minutes.
Download dataset VOT2018 via drive-link: https://drive.google.com/file/d/1Nea1OVnkYoVQAPZ7t5RYWPIxMSmKDenN/view?usp=sharing
Run container: (set “pathtodataset” to the location where the VOT2018 folder is located) docker run --rm -v pathtodataset:/home/pysot/testing_dataset/VOT2018 --runtime nvidia allu1234/pysot-xavier-torch19:2.0 python3 tools/test.py --snapshot experiments/siamrpn_mobilev2_l234_dwxcorr/model.pth --config experiments/siamrpn_mobilev2_l234_dwxcorr/config.yaml --dataset VOT2018