Dear All,
sending out a distress signal - am like totally to my wit’s end.
SDK Manager-based flashing of SDK packages (Jetson AGX Xavier) screws up every time toward the end.
Always (!) managing to install the L4T (Jetson OS), but never able to complete subsequent package installation procedure. Packages (CUDA, cuDNN, etc…) either get partially installed or not at all due to dependencies of others e.g. CUDA which oftentimes failes to install.
Error messages in console/log do vary from attempt to attempt:
o .mtd wrong / corrupt
o ‘command terminated with error’ (but doesnt specify which one)
o ‘download’ timeout (everything IS already downloaded ?)
o …
→ Exemplifying .log file attached. Incl. retry of SDK install, thus, file a bit large at first glance.
sdkm.log (1.0 MB)
Host system: i5, 12GB RAM, SS-USB Port.
Target system: 32GB Jetson Xavier AGX, P2888-0004
Things tried (my time wasted on the prev. two days):
- Host OS changed {16.04 , 18.04}, fresh, clean vanilla installs
- JetPacks changed {4.4, 4.5, 4.5.1 w/ and w/o that new addon }
- Installation content changed {target only, target and host}
- Host-side: sdk folders deleted {~/nvidia, ~/Downloads/nvidia, ~/.nvsdkm}
- Host-side: apt-get remove and purge and whatnot of everything NV {‘nvidia’, ‘nsight’, …}
- JetsonOS/L4T Flash sequence start changed {auto (via 192.168…55.1) , manual (via two button press)}
- SDK Component Flashing via different NICs {USB-C/Eth bridge (192.168.55.1) , WiFi stick plugged into Jetson (DHCP assigned IP)}
- Target: User autologon feature {on, off}
- Host-side: Changed USB ports
- SDK manager style changed {GUI, CLI}
CLI based SDK Manager lauch command: “sdkmanager --cli install --license accept --logintype devzone --product Jetson --version 4.5.1 --targetos Linux --target P2888-0004 --flash all --staylogin true”
Host-side: To validate the network connection is good, I kept the following running in different terminals and monitored them, but to no avail, everything looking good at those ends:
- dmesg -wH (monitor for any suspicious event (e.g. USB-C/eth connection loss))
- ping 192.168.55.1 (see if pings fail, but nada, pings always good, connection never interrupted)
Did NOT change/downgrade from current SDKmanager revision (1.4.1.7402), for I assumed that being too lame a culprit.
In all fairness, I have to admit the USB-C is NOT (!) the original NVidia cable. However, a robust wire after all, works like a charm with my phone and always (!) manages to support full L4T flashing, also manual ssh works fine and never aborts, thus I guess we can rule out the cable.
Since, as mentioned in prev. paragraph, L4T always finishes flashing, I guess we can also safely rule out a malfunctioning target device. L4T/JetsonOS always good, log in, sudo apt update, etc… always works.
Guys, this quality of experience is far fom what I’d expected of NV’s otherwise charming tools, thus, most likely the problem onceagain sits in front of the computer… I just dont see what I might be doing wrong here ?!
SOS.
THX !