Jetson Xavier AGX reboots when downloading docker image

Perform a “docker pull nvcr.io/nvidia/l4t-cuda:10.2.460-runtime” and the jetson reboots shortly after starting.

Here are the contents of /var/log/syslog at the time of reboot.

Nov 16 14:34:50 jetson-02 kernel: [ 1234.491218] ------------[ cut here ]------------
Nov 16 14:34:50 jetson-02 kernel: [ 1234.491419] WARNING: CPU: 0 PID: 3 at /dvs/git/dirty/git-master_linux/kernel/nvidia/drivers/net/ethernet/nvidia/eqos/desc.c:387 desc_alloc_skb.isra.6+0x13c/0x1c8
Nov 16 14:34:50 jetson-02 kernel: [ 1234.491674] Modules linked in: bnep xt_conntrack ipt_MASQUERADE nf_nat_masquerade_ipv4 nf_conntrack_netlink nfnetlink xt_addrtype iptable_filter iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 nf_nat nf_conntrack br_netfilter zram overlay spidev userspace_alert nvgpu bluedroid_pm ip_tables x_tables
Nov 16 14:34:50 jetson-02 kernel: [ 1234.491777] 
Nov 16 14:34:50 jetson-02 kernel: [ 1234.491820] CPU: 0 PID: 3 Comm: ksoftirqd/0 Not tainted 4.9.201-tegra #1
Nov 16 14:34:50 jetson-02 kernel: [ 1234.491825] Hardware name: Jetson-AGX (DT)
Nov 16 14:34:50 jetson-02 kernel: [ 1234.491845] task: ffffffc7dc779c00 task.stack: ffffffc7dbc18000
Nov 16 14:34:50 jetson-02 kernel: [ 1234.491858] PC is at desc_alloc_skb.isra.6+0x13c/0x1c8
Nov 16 14:34:50 jetson-02 kernel: [ 1234.491865] LR is at eqos_re_alloc_skb+0x68/0x108
Nov 16 14:34:50 jetson-02 kernel: [ 1234.491873] pc : [<ffffff8008952c9c>] lr : [<ffffff80089530f0>] pstate: 20c00045
Nov 16 14:34:50 jetson-02 kernel: [ 1234.491877] sp : ffffffc7dbc1bb60
Nov 16 14:34:50 jetson-02 kernel: [ 1234.491883] x29: ffffffc7dbc1bb60 x28: ffffffc7d7b30900 
Nov 16 14:34:50 jetson-02 kernel: [ 1234.491895] x27: ffffffc7d7b34000 x26: ffffffc7a21aa510 
Nov 16 14:34:50 jetson-02 kernel: [ 1234.491907] x25: 0000000002080020 x24: 0000000000000000 
Nov 16 14:34:50 jetson-02 kernel: [ 1234.491919] x23: 000000005dce4042 x22: ffffffc7a21aa410 
Nov 16 14:34:50 jetson-02 kernel: [ 1234.491931] x21: ffffffc7a21aa408 x20: ffffffc7d7b30900 
Nov 16 14:34:50 jetson-02 kernel: [ 1234.491945] x19: ffffffc7941a7d00 x18: 0000000000000000 
Nov 16 14:34:50 jetson-02 kernel: [ 1234.491956] x17: 0000000000000002 x16: 0000000000000003 
Nov 16 14:34:50 jetson-02 kernel: [ 1234.491968] x15: ffffffc7db38f028 x14: 000400000005d866 
Nov 16 14:34:50 jetson-02 kernel: [ 1234.491979] x13: 0004000000000000 x12: 00000000c3b95463 
Nov 16 14:34:50 jetson-02 kernel: [ 1234.491991] x11: 00000000000000f4 x10: 0000000098ffffd7 
Nov 16 14:34:50 jetson-02 kernel: [ 1234.492005] x9 : 000000005deea000 x8 : ffffffbf1e23c300 
Nov 16 14:34:50 jetson-02 kernel: [ 1234.492017] x7 : 0000000000808f0b x6 : 0000000000000000 
Nov 16 14:34:50 jetson-02 kernel: [ 1234.492030] x5 : 0000000000000000 x4 : 0000000000000000 
Nov 16 14:34:50 jetson-02 kernel: [ 1234.492041] x3 : 0000000002080020 x2 : ffffffc7a21aa410 
Nov 16 14:34:50 jetson-02 kernel: [ 1234.492053] x1 : ffffffc7a21aa408 x0 : 000000000000005e 
Nov 16 14:34:50 jetson-02 kernel: [ 1234.492065] 
Nov 16 14:34:50 jetson-02 kernel: [ 1234.492069] ---[ end trace 7de843c608d21e9b ]---
Nov 16 14:34:50 jetson-02 kernel: [ 1234.492158] Call trace:
Nov 16 14:34:50 jetson-02 kernel: [ 1234.492178] [<ffffff8008952c9c>] desc_alloc_skb.isra.6+0x13c/0x1c8
Nov 16 14:34:50 jetson-02 kernel: [ 1234.492194] [<ffffff80089530f0>] eqos_re_alloc_skb+0x68/0x108
Nov 16 14:34:50 jetson-02 kernel: [ 1234.492210] [<ffffff800894e194>] eqos_napi_poll_rx+0x2dc/0x4f8
Nov 16 14:34:50 jetson-02 kernel: [ 1234.492230] [<ffffff8008db09c4>] net_rx_action+0xf4/0x358
Nov 16 14:34:50 jetson-02 kernel: [ 1234.492248] [<ffffff8008081054>] __do_softirq+0x13c/0x3b0
Nov 16 14:34:50 jetson-02 kernel: [ 1234.492269] [<ffffff80080b9e88>] run_ksoftirqd+0x48/0x58
Nov 16 14:34:50 jetson-02 kernel: [ 1234.492292] [<ffffff80080dfa08>] smpboot_thread_fn+0x160/0x248
Nov 16 14:34:50 jetson-02 kernel: [ 1234.492308] [<ffffff80080db074>] kthread+0xec/0xf0
Nov 16 14:34:50 jetson-02 kernel: [ 1234.492323] [<ffffff80080838a0>] ret_from_fork+0x10/0x30
Nov 16 14:34:50 jetson-02 kernel: [ 1234.493567] ------------[ cut here ]------------

Also I’ve already tried the instructions from here. Reboots while downloading the jetson-voice demo docker. The error I’m seeing appears to be in the ethernet driver.

I’ve switched over to using a USB-C ethernet adapter and works fine there. Something may be wrong with the ethernet driver for the RJ-45 interface on the dev kit hardware.

Glad to know issue resolved.

Using the USB-C interface is a work around. The issue isn’t resolved. The dev kit RJ-45 interface still crashes.

bump

docker info:
Client:
Context: default
Debug Mode: false

Server:
Containers: 2
Running: 0
Paused: 0
Stopped: 2
Images: 2
Server Version: 20.10.7
Storage Driver: overlay2
Backing Filesystem: extfs
Supports d_type: true
Native Overlay Diff: true
userxattr: false
Logging Driver: json-file
Cgroup Driver: cgroupfs
Cgroup Version: 1
Plugins:
Volume: local
Network: bridge host ipvlan macvlan null overlay
Log: awslogs fluentd gcplogs gelf journald json-file local logentries splunk syslog
Swarm: inactive
Runtimes: io.containerd.runc.v2 io.containerd.runtime.v1.linux runc
Default Runtime: runc
Init Binary: docker-init
containerd version:
runc version:
init version:
Security Options:
seccomp
Profile: default
Kernel Version: 4.9.253-tegra
Operating System: Ubuntu 18.04.6 LTS
OSType: linux
Architecture: aarch64
CPUs: 4
Total Memory: 15.45GiB

Docker Root Dir: /var/lib/docker
Debug Mode: false
Registry: https://index.docker.io/v1/
Labels:
Experimental: false
Insecure Registries:
127.0.0.0/8
Live Restore Enabled: false

note:
I can successfully pull and run the hello world image.
when I try to pull from aws I observe reboot behavior as described above. Futher info, I cant docker build without it triggers a reboot aswell.