Error with “Nvidia Container Runtime with Docker Integration” on AGX Orin with JP6.2

kalustian · February 21, 2025, 1:59am

I got below error from SDK manager while a fresh install of JP6.2. Same thing it is for JP6.1. I did not get any errors in the past for SDK manage which is on my laptop with Ubuntu 22.04

glingk · February 21, 2025, 4:01am

I’m also running into this same exact error. I tries JP 6.0, 6.1, 6.2 all same error. When I reinstall it, it says it’s successful, HOWEVER when you boot into the to use the Jetson device, docker can’t start.

AastaLLL · February 21, 2025, 4:59am

Hi,

Thanks for reporting this.
Would you mind sharing the log with us so we can check?

https://docs.nvidia.com/sdk-manager/install-with-sdkm-jetson/index.html#step-04-finalize-setup

Export Debug Logs

Thanks.

AastaLLL · February 21, 2025, 5:13am

Hi, @glingk

JetPack 6.1/6.2 doesn’t preinstall docker but just nvidia-container-toolkit.
Do you install it and found it is not working?
If so, could you share the output with us?

The docker installation steps can be found in below link:

Thanks.

kalustian · February 21, 2025, 5:38am

Here is the logs. Just for clarification, this errors show in the SDK manager on the screenshot did not show before. I could be able to fully flash my AGX Orin with JP6.2 error free last week, but today I got the error reported earlier.

SDKM_logs_2025-02-20_23-31-41.zip (2.1 MB)

kalustian · February 21, 2025, 6:01am

above workaround you suggested did not work, but instead below link it did. Main issues still there…why the SDK-Manager is spitting such an error during flashing ??

kalustian · February 22, 2025, 4:34pm

anybody home from nvidia SDK-Manager support team ?

As a side tone, installing the below packages will do the workaround fix, BUT as soon as you run apt-get update, apt-get upgrade -y, it will update the packages below to docker-buildx-plugin (0.21.0-1~ubuntu.22.04~jammy), docker-ce (5:28.0.0-1~ubuntu.22.04~jammy) and docker-compose-plugin (2.33.0-1~ubuntu.22.04~jammy) …and looks like these updates Docker does not like them. Furthermore, looks like the docker-ce package version causing the issue., as soon as you re-install docker-ce=5:27.1.2-1~ubuntu.22.04~jammy, everything goes back to normal. Can anybody test/verify this ?

Again, I am not trying to fixing the issue, just sharing with the nvidia support team things I have observed so far.

Workaround fix for my particular scenario: AGX Orin 32 Gb running JP6.2:
Downgrade docker-ce to 5:27.5.1 :
apt install docker-ce=5:27.5.1-1~ubuntu.22.04~jammy

The pain: you have to downgrade docker-ce to 5:27.5.1 : everytime you run run apt-get update, apt-get upgrade.

mohammad.reza · February 22, 2025, 6:18pm

I have the same problem. And this problem didn’t exist a week ago.

hex4def6 · February 23, 2025, 1:07am

Been fighting this as well.

It’s because docker has been updated recently, and due to some annoyances with choices in the nvidia kernel, it causes it to fail. You can either recompile the kernel, or do this:

The solution is to run this:

sudo apt install docker-ce=5:27.5.1-1~ubuntu.22.04~jammy \
                 docker-ce-cli=5:27.5.1-1~ubuntu.22.04~jammy \
                 docker-compose-plugin=2.32.4-1~ubuntu.22.04~jammy \
                 docker-buildx-plugin=0.20.0-1~ubuntu.22.04~jammy \
                 docker-ce-rootless-extras=5:27.5.1-1~ubuntu.22.04~jammy

See:

github.com/moby/moby

28.0.0 Causes the issue: failed to register "bridge" driver: invalid argument

opened 04:32PM - 20 Feb 25 UTC

tonynajjar

status/0-triage kind/bug

### Description After routinely apt upgrading, my docker daemon crashed ``` s…udo journalctl -u docker.service Feb 20 16:44:47 heron-7-jetson systemd[1]: Starting Docker Application Container Engine... Feb 20 16:44:47 heron-7-jetson dockerd[4022]: time="2025-02-20T16:44:47.547947598+01:00" level=info msg="Starting up" Feb 20 16:44:47 heron-7-jetson dockerd[4022]: time="2025-02-20T16:44:47.549349019+01:00" level=info msg="OTEL tracing is not configured, using no-op tracer provider" Feb 20 16:44:47 heron-7-jetson dockerd[4022]: time="2025-02-20T16:44:47.549623294+01:00" level=info msg="detected 127.0.0.53 nameserver, assuming systemd-resolved, so using resolv.conf: /run/systemd/resolve/resolv.conf" Feb 20 16:44:47 heron-7-jetson dockerd[4022]: time="2025-02-20T16:44:47.614006937+01:00" level=info msg="[graphdriver] using prior storage driver: overlay2" Feb 20 16:44:47 heron-7-jetson dockerd[4022]: time="2025-02-20T16:44:47.614419805+01:00" level=info msg="Loading containers: start." Feb 20 16:44:47 heron-7-jetson dockerd[4022]: time="2025-02-20T16:44:47.617942624+01:00" level=warning msg="Could not load necessary modules for IPSEC rules: protocol not supported" Feb 20 16:44:47 heron-7-jetson dockerd[4022]: time="2025-02-20T16:44:47.646490842+01:00" level=info msg="stopping event stream following graceful shutdown" error="<nil>" module=libcontainerd namespace=moby Feb 20 16:44:47 heron-7-jetson dockerd[4022]: failed to start daemon: Error initializing network controller: error obtaining controller instance: failed to register "bridge" driver: invalid argument Feb 20 16:44:47 heron-7-jetson systemd[1]: docker.service: Main process exited, code=exited, status=1/FAILURE Feb 20 16:44:47 heron-7-jetson systemd[1]: docker.service: Failed with result 'exit-code'. Feb 20 16:44:47 heron-7-jetson systemd[1]: Failed to start Docker Application Container Engine. ``` downgrading back to 27.0.0 fixes the issue: ``` sudo apt install docker-ce=5:27.5.1-1~ubuntu.22.04~jammy \ docker-ce-cli=5:27.5.1-1~ubuntu.22.04~jammy \ docker-compose-plugin=2.32.4-1~ubuntu.22.04~jammy \ docker-buildx-plugin=0.20.0-1~ubuntu.22.04~jammy \ docker-ce-rootless-extras=5:27.5.1-1~ubuntu.22.04~jammy ``` Please note that this does not happen on my ubuntu 24 amd machine ### Reproduce **Machine: Nvidia jetson with JP 6 (Ubuntu 22)** just `apt upgrade` and see the docker.service crash ### Expected behavior No crash ### docker version ```bash docker version Client: Docker Engine - Community Version: 28.0.0 API version: 1.48 Go version: go1.23.6 Git commit: f9ced58 Built: Wed Feb 19 22:10:56 2025 OS/Arch: linux/arm64 Context: default Cannot connect to the Docker daemon at unix:///var/run/docker.sock. Is the docker daemon running? ``` ### docker info ```bash docker info Client: Docker Engine - Community Version: 28.0.0 Context: default Debug Mode: false Plugins: buildx: Docker Buildx (Docker Inc.) Version: v0.21.0 Path: /usr/libexec/docker/cli-plugins/docker-buildx compose: Docker Compose (Docker Inc.) Version: v2.33.0 Path: /usr/libexec/docker/cli-plugins/docker-compose Server: ERROR: Cannot connect to the Docker daemon at unix:///var/run/docker.sock. Is the docker daemon running? errors pretty printing info ``` ### Additional Info _No response_

You can then hit “retry”, and it should resume installing the packages in SDK manager. (Make sure to skip the ‘flash’ dialog or you get to start over… :D)

kalustian · February 23, 2025, 1:43am

Thank a lot for confirming about the work around. Now someone needs to look into the SDK-Manager so end users don’t get the error at the time when flashing. I am not sure if that is the same issues for the Jeston nano, super, etc…I am reporting this from AGX orin point of view.

infiniteechorobotics · February 23, 2025, 2:49am

Can Nvidia stop breaking its software every time it releases a new update to a single package?

hex4def6 · February 23, 2025, 3:31am

What I did was hit the retry button after downgrading. It should work. Just make sure to skip the first screen “flash”, otherwise you’ll nuke the install and start over, lol

hex4def6 · February 23, 2025, 3:32am

My primary ask would be to date documentation and have a clear “written for version XXX”.
But yeah, it’s been an interesting week. Had a demo for my CEO, and promptly managed to break my working demo, and then have this issue crop up preventing me from getting it working again… :D

AastaLLL · February 24, 2025, 2:27am

Hi, all

We are working on this issue with our internal team.
Will provide more info to you soon.

Thanks.

AastaLLL · February 24, 2025, 7:04am

Hi, all

Update:

The new 28.0.1 docker release has fixed this issue. SDKmanager can work normally now.

$ sudo apt-get install docker-ce docker-ce-cli

Thanks for your patience. According to this issue:

$ sudo docker run hello-world
docker: Cannot connect to the Docker daemon at unix:///var/run/docker.sock. Is the docker daemon running?

$ journalctl -xu docker.service
...
Feb 24 02:48:46 tegra-ubuntu dockerd[76713]: failed to start daemon: Error initializing network controller: error obtaining controller instance: failed to register "bridge" driver: invalid argument
Feb 24 02:48:46 tegra-ubuntu systemd[1]: docker.service: Main process exited, code=exited, status=1/FAILURE

The root cause is that the latest docker (v28.0.0) requires more kernel config which is not enabled on r36.4.3 (JetPack 6.2) by default.
We are working with our internal team to fix this issue.

Currently, there are some possible workarounds for your reference:
(thanks to @Kangalow @hex4def6 @whitesscott for the contribution)

WAR-1: Downgrade to the 27.5.1 docker

You will see the below error message when setting up device with JetPack 6.2:

image1297×827 167 KB
Please ignore the failure and login to the device to run the following command:

$ sudo apt-get install -y docker-ce=5:27.5.1-1~ubuntu.22.04~jammy --allow-downgrades
$ sudo apt-get install -y docker-ce-cli=5:27.5.1-1~ubuntu.22.04~jammy --allow-downgrades
$ sudo apt-mark hold docker-ce=5:27.5.1-1~ubuntu.22.04~jammy
$ sudo apt-mark hold docker-ce-cli=5:27.5.1-1~ubuntu.22.04~jammy
$ ./NV_L4T_DOCKER_TARGET_POST_INSTALL_COMP.sh

Verify

$ sudo docker --version
Docker version 27.5.1, build 9f9e405

$ sudo systemctl status docker.service 
● docker.service - Docker Application Container Engine
     Loaded: loaded (/lib/systemd/system/docker.service; enabled; vendor preset: enabled)
     Active: active (running) since Mon 2025-02-24 05:03:21 UTC; 16s ago
TriggeredBy: ● docker.socket
       Docs: https://docs.docker.com
   Main PID: 34864 (dockerd)
      Tasks: 17
     Memory: 29.1M
        CPU: 261ms
     CGroup: /system.slice/docker.service
             └─34864 /usr/bin/dockerd -H fd:// --containerd=/run/containerd/containerd.sock

Feb 24 05:03:20 tegra-ubuntu dockerd[34864]: time="2025-02-24T05:03:20.970132435Z" level=info msg="detected 127.0.0.53 nameserver, assuming systemd-resolved, so usi>
Feb 24 05:03:21 tegra-ubuntu dockerd[34864]: time="2025-02-24T05:03:21.052802756Z" level=info msg="[graphdriver] using prior storage driver: overlay2"
Feb 24 05:03:21 tegra-ubuntu dockerd[34864]: time="2025-02-24T05:03:21.053766567Z" level=info msg="Loading containers: start."
Feb 24 05:03:21 tegra-ubuntu dockerd[34864]: time="2025-02-24T05:03:21.124370043Z" level=warning msg="Could not load necessary modules for IPSEC rules: protocol not>
Feb 24 05:03:21 tegra-ubuntu dockerd[34864]: time="2025-02-24T05:03:21.175636062Z" level=info msg="Default bridge (docker0) is assigned with an IP address 172.17.0.>
Feb 24 05:03:21 tegra-ubuntu dockerd[34864]: time="2025-02-24T05:03:21.224770440Z" level=info msg="Loading containers: done."
Feb 24 05:03:21 tegra-ubuntu dockerd[34864]: time="2025-02-24T05:03:21.240763535Z" level=info msg="Docker daemon" commit=4c9b3b0 containerd-snapshotter=false storag>
Feb 24 05:03:21 tegra-ubuntu dockerd[34864]: time="2025-02-24T05:03:21.240879431Z" level=info msg="Daemon has completed initialization"
Feb 24 05:03:21 tegra-ubuntu dockerd[34864]: time="2025-02-24T05:03:21.263237945Z" level=info msg="API listen on /run/docker.sock"
Feb 24 05:03:21 tegra-ubuntu systemd[1]: Started Docker Application Container Engine.

$ sudo docker run -it --rm --net=host --runtime nvidia -e DISPLAY=$DISPLAY -v /tmp/.X11-unix/:/tmp/.X11-unix nvcr.io/nvidia/l4t-tensorrt:r10.3.0-devel
root@tegra-ubuntu:/# /usr/src/tensorrt/bin/trtexec --onnx=/usr/src/tensorrt/data/mnist/mnist.onnx 
&&&& RUNNING TensorRT.trtexec [TensorRT v100300] # /usr/src/tensorrt/bin/trtexec --onnx=/usr/src/tensorrt/data/mnist/mnist.onnx
...
[02/24/2025-05:29:02] [I] Average on 10 runs - GPU latency: 0.0347168 ms - Host latency: 0.0474365 ms (enqueue 0.0302002 ms)
[02/24/2025-05:29:02] [I] Average on 10 runs - GPU latency: 0.0346436 ms - Host latency: 0.047168 ms (enqueue 0.0301514 ms)
[02/24/2025-05:29:02] [I] 
[02/24/2025-05:29:02] [I] === Performance summary ===
[02/24/2025-05:29:02] [I] Throughput: 15546 qps
[02/24/2025-05:29:02] [I] Latency: min = 0.0450439 ms, max = 0.140625 ms, mean = 0.0479416 ms, median = 0.0473633 ms, percentile(90%) = 0.0489502 ms, percentile(95%) = 0.050293 ms, percentile(99%) = 0.0616455 ms
[02/24/2025-05:29:02] [I] Enqueue Time: min = 0.0285339 ms, max = 0.0996094 ms, mean = 0.030676 ms, median = 0.0302734 ms, percentile(90%) = 0.0317383 ms, percentile(95%) = 0.032959 ms, percentile(99%) = 0.0415039 ms
[02/24/2025-05:29:02] [I] H2D Latency: min = 0.00524902 ms, max = 0.0258789 ms, mean = 0.00624994 ms, median = 0.00619507 ms, percentile(90%) = 0.00646973 ms, percentile(95%) = 0.0065918 ms, percentile(99%) = 0.00708008 ms
[02/24/2025-05:29:02] [I] GPU Compute Time: min = 0.0324707 ms, max = 0.10498 ms, mean = 0.035119 ms, median = 0.034668 ms, percentile(90%) = 0.0359497 ms, percentile(95%) = 0.0371094 ms, percentile(99%) = 0.0460205 ms
[02/24/2025-05:29:02] [I] D2H Latency: min = 0.0055542 ms, max = 0.0219727 ms, mean = 0.00657305 ms, median = 0.00653076 ms, percentile(90%) = 0.00683594 ms, percentile(95%) = 0.00695801 ms, percentile(99%) = 0.0078125 ms
[02/24/2025-05:29:02] [I] Total Host Walltime: 3.00013 s
[02/24/2025-05:29:02] [I] Total GPU Compute Time: 1.63795 s
[02/24/2025-05:29:02] [W] * Throughput may be bound by Enqueue Time rather than GPU Compute and the GPU may be under-utilized.
[02/24/2025-05:29:02] [W]   If not already in use, --useCudaGraph (utilize CUDA graphs where possible) may increase the throughput.
[02/24/2025-05:29:02] [W] * GPU compute time is unstable, with coefficient of variance = 5.98857%.
[02/24/2025-05:29:02] [W]   If not already in use, locking GPU clock frequency or adding --useSpinWait may improve the stability.
[02/24/2025-05:29:02] [I] Explanations of the performance metrics are printed in the verbose logs.
[02/24/2025-05:29:02] [I] 
&&&& PASSED TensorRT.trtexec [TensorRT v100300] # /usr/src/tensorrt/bin/trtexec --onnx=/usr/src/tensorrt/data/mnist/mnist.onnx

WAR-2: Build and flash custom kernel with the required config

Please refer to Kernel Customization — NVIDIA Jetson Linux Developer Guide

Download the r36.4.3 source package here and run the below command:

$ tar xf public_sources.tbz2
$ cd /Linux_for_Tegra/source
$ tar xf kernel_src.tbz2
$ tar xf kernel_oot_modules_src.tbz2
$ tar xf nvidia_kernel_display_driver_source.tbz2

Enable config
Adding below config to Linux_for_Tegra/source/kernel/kernel-jammy-src/arch/arm64/configs/defconfig:

CONFIG_IP_SET=m
CONFIG_IP_SET_HASH_NET=m
CONFIG_NETFILTER_XT_SET=m

Build the custom kernel

$ export CROSS_COMPILE=$HOME/Desktop/Toolchain_gcc_11.3/aarch64--glibc--stable-2022.08-1/bin/aarch64-buildroot-linux-gnu-
$ make -C kernel
$ export INSTALL_MOD_PATH=<install-path>/Linux_for_Tegra/rootfs/
$ sudo -E make install -C kernel
$ cp kernel/kernel-jammy-src/arch/arm64/boot/Image <install-path>/Linux_for_Tegra/kernel/Image

Flash r36.4.3 image
Run SDKManager to install SDK components. It should finish successfully like below:

container21289×763 175 KB
Verify

$ sudo docker --version
Docker version 28.0.0, build f9ced58

$ sudo systemctl status docker.service
● docker.service - Docker Application Container Engine
     Loaded: loaded (/lib/systemd/system/docker.service; enabled; vendor preset: enabled)
     Active: active (running) since Mon 2025-02-24 06:12:59 UTC; 43min ago
TriggeredBy: ● docker.socket
       Docs: https://docs.docker.com
   Main PID: 26327 (dockerd)
      Tasks: 13
     Memory: 7.8G
        CPU: 4min 31.263s
     CGroup: /system.slice/docker.service
             └─26327 /usr/bin/dockerd -H fd:// --containerd=/run/containerd/containerd.sock

Feb 24 06:12:59 tegra-ubuntu dockerd[26327]: time="2025-02-24T06:12:59.171168564Z" level=info msg="Docker daemon" commit=af898ab containerd-snapshotter=false storag>
Feb 24 06:12:59 tegra-ubuntu dockerd[26327]: time="2025-02-24T06:12:59.171724546Z" level=info msg="Initializing buildkit"
Feb 24 06:12:59 tegra-ubuntu dockerd[26327]: time="2025-02-24T06:12:59.222523480Z" level=info msg="Completed buildkit initialization"
Feb 24 06:12:59 tegra-ubuntu dockerd[26327]: time="2025-02-24T06:12:59.237147987Z" level=info msg="Daemon has completed initialization"
Feb 24 06:12:59 tegra-ubuntu dockerd[26327]: time="2025-02-24T06:12:59.237343768Z" level=info msg="API listen on /run/docker.sock"
Feb 24 06:12:59 tegra-ubuntu systemd[1]: Started Docker Application Container Engine.

$ sudo docker run -it --rm --net=host --runtime nvidia -e DISPLAY=$DISPLAY -v /tmp/.X11-unix/:/tmp/.X11-unix nvcr.io/nvidia/l4t-tensorrt:r10.3.0-devel
root@tegra-ubuntu:/# /usr/src/tensorrt/bin/trtexec --onnx=/usr/src/tensorrt/data/mnist/mnist.onnx 
&&&& RUNNING TensorRT.trtexec [TensorRT v100300] # /usr/src/tensorrt/bin/trtexec --onnx=/usr/src/tensorrt/data/mnist/mnist.onnx
...
[02/24/2025-06:56:59] [I] Average on 10 runs - GPU latency: 0.0666748 ms - Host latency: 0.0834717 ms (enqueue 0.0402832 ms)
[02/24/2025-06:56:59] [I] Average on 10 runs - GPU latency: 0.0663574 ms - Host latency: 0.0811768 ms (enqueue 0.0387207 ms)
[02/24/2025-06:56:59] [I] 
[02/24/2025-06:56:59] [I] === Performance summary ===
[02/24/2025-06:56:59] [I] Throughput: 12287.4 qps
[02/24/2025-06:56:59] [I] Latency: min = 0.0768738 ms, max = 0.185547 ms, mean = 0.0828475 ms, median = 0.081543 ms, percentile(90%) = 0.0887756 ms, percentile(95%) = 0.0905762 ms, percentile(99%) = 0.101715 ms
[02/24/2025-06:56:59] [I] Enqueue Time: min = 0.0361633 ms, max = 0.0998535 ms, mean = 0.0392462 ms, median = 0.0385132 ms, percentile(90%) = 0.0402832 ms, percentile(95%) = 0.043457 ms, percentile(99%) = 0.0566406 ms
[02/24/2025-06:56:59] [I] H2D Latency: min = 0.0065918 ms, max = 0.0449219 ms, mean = 0.00811712 ms, median = 0.00799561 ms, percentile(90%) = 0.00860596 ms, percentile(95%) = 0.0090332 ms, percentile(99%) = 0.0108337 ms
[02/24/2025-06:56:59] [I] GPU Compute Time: min = 0.059967 ms, max = 0.105713 ms, mean = 0.0679207 ms, median = 0.0667725 ms, percentile(90%) = 0.0742188 ms, percentile(95%) = 0.0756226 ms, percentile(99%) = 0.0835876 ms
[02/24/2025-06:56:59] [I] D2H Latency: min = 0.00463867 ms, max = 0.112549 ms, mean = 0.00681027 ms, median = 0.0067749 ms, percentile(90%) = 0.00805664 ms, percentile(95%) = 0.00854492 ms, percentile(99%) = 0.00927734 ms
[02/24/2025-06:56:59] [I] Total Host Walltime: 3.00015 s
[02/24/2025-06:56:59] [I] Total GPU Compute Time: 2.50383 s
[02/24/2025-06:56:59] [W] * GPU compute time is unstable, with coefficient of variance = 5.43696%.
[02/24/2025-06:56:59] [W]   If not already in use, locking GPU clock frequency or adding --useSpinWait may improve the stability.
[02/24/2025-06:56:59] [I] Explanations of the performance metrics are printed in the verbose logs.
[02/24/2025-06:56:59] [I] 
&&&& PASSED TensorRT.trtexec [TensorRT v100300] # /usr/src/tensorrt/bin/trtexec --onnx=/usr/src/tensorrt/data/mnist/mnist.onnx

WAR-3: Update kernel image with the required config

Download the r36.4.3 source package here and run the below command:

$ tar xf public_sources.tbz2
$ cd /Linux_for_Tegra/source
$ tar xf kernel_src.tbz2
$ tar xf kernel_oot_modules_src.tbz2
$ tar xf nvidia_kernel_display_driver_source.tbz2

Enable config
Adding below config to Linux_for_Tegra/source/kernel/kernel-jammy-src/arch/arm64/configs/defconfig:

CONFIG_IP_SET=m
CONFIG_IP_SET_HASH_NET=m
CONFIG_NETFILTER_XT_SET=m

Build the custom kernel

$ export CROSS_COMPILE=$HOME/Desktop/Toolchain_gcc_11.3/aarch64--glibc--stable-2022.08-1/bin/aarch64-buildroot-linux-gnu-
$ make -C kernel

Update kernel image

# On Target
$ mkdir -p /usr/lib/modules/5.15.148-tegra/kernel/net/netfilter/ipset

# On Host
$ scp /Linux_for_Tegra/source/out/nvidia-linux-header/arch/arm64/boot/Image [Jetson]:/boot/Image
$ scp /Linux_for_Tegra/source/kernel/kernel-jammy-src/net/netfilter/xt_set.ko [Jetson]:/usr/lib/modules/5.15.148-tegra/kernel/net/netfilter/.
$ scp /Linux_for_Tegra/source/kernel/kernel-jammy-src/net/netfilter/ipset/ip_set_hash_net.ko [Jetson]:/usr/lib/modules/5.15.148-tegra/kernel/net/netfilter/ipset/.
$ scp /Linux_for_Tegra/source/kernel/kernel-jammy-src/net/netfilter/ipset/ip_set.ko [Jetson]:/usr/lib/modules/5.15.148-tegra/kernel/net/netfilter/ipset/.

#On Target
$ depmod -a 5.15.148-tegra
$ update-initramfs -c -k 5.15.148-tegra
$ reboot

Verify

$ sudo docker --version
Docker version 28.0.0, build f9ced58

$ sudo docker run -it --rm --net=host --runtime nvidia -e DISPLAY=$DISPLAY -v /tmp/.X11-unix/:/tmp/.X11-unix nvcr.io/nvidia/l4t-tensorrt:r10.3.0-devel
root@tegra-ubuntu:/# /usr/src/tensorrt/bin/trtexec --onnx=/usr/src/tensorrt/data/mnist/mnist.onnx 
...
[02/26/2025-06:45:44] [I] Average on 10 runs - GPU latency: 0.0656006 ms - Host latency: 0.0818359 ms (enqueue 0.0415771 ms)
[02/26/2025-06:45:44] [I] Average on 10 runs - GPU latency: 0.0655762 ms - Host latency: 0.0809326 ms (enqueue 0.0402344 ms)
[02/26/2025-06:45:44] [I] 
[02/26/2025-06:45:44] [I] === Performance summary ===
[02/26/2025-06:45:44] [I] Throughput: 12231.4 qps
[02/26/2025-06:45:44] [I] Latency: min = 0.0751953 ms, max = 0.129395 ms, mean = 0.0826511 ms, median = 0.0812988 ms, percentile(90%) = 0.0893555 ms, percentile(95%) = 0.0912476 ms, percentile(99%) = 0.101227 ms
[02/26/2025-06:45:44] [I] Enqueue Time: min = 0.0366211 ms, max = 0.0927734 ms, mean = 0.0399322 ms, median = 0.0391846 ms, percentile(90%) = 0.041748 ms, percentile(95%) = 0.0439453 ms, percentile(99%) = 0.0535889 ms
[02/26/2025-06:45:44] [I] H2D Latency: min = 0.00601196 ms, max = 0.0402832 ms, mean = 0.00832953 ms, median = 0.00805664 ms, percentile(90%) = 0.00939941 ms, percentile(95%) = 0.0100098 ms, percentile(99%) = 0.0115509 ms
[02/26/2025-06:45:44] [I] GPU Compute Time: min = 0.0568237 ms, max = 0.0999146 ms, mean = 0.0672211 ms, median = 0.06604 ms, percentile(90%) = 0.0744629 ms, percentile(95%) = 0.0760498 ms, percentile(99%) = 0.0833435 ms
[02/26/2025-06:45:44] [I] D2H Latency: min = 0.00463867 ms, max = 0.0119629 ms, mean = 0.00710001 ms, median = 0.00708008 ms, percentile(90%) = 0.00830078 ms, percentile(95%) = 0.00878906 ms, percentile(99%) = 0.00952148 ms
[02/26/2025-06:45:44] [I] Total Host Walltime: 3.00015 s
[02/26/2025-06:45:44] [I] Total GPU Compute Time: 2.46675 s
[02/26/2025-06:45:44] [W] * GPU compute time is unstable, with coefficient of variance = 5.93523%.
[02/26/2025-06:45:44] [W]   If not already in use, locking GPU clock frequency or adding --useSpinWait may improve the stability.
[02/26/2025-06:45:44] [I] Explanations of the performance metrics are printed in the verbose logs.
[02/26/2025-06:45:44] [I] 
&&&& PASSED TensorRT.trtexec [TensorRT v100300] # /usr/src/tensorrt/bin/trtexec --onnx=/usr/src/tensorrt/data/mnist/mnist.onnx

Thanks.

Kangalow · February 24, 2025, 7:34am

Does WAR-2 download a version of the kernel with the Docker required flags? i.e.

CONFIG_IP_SET=m
CONFIG_IP_SET_HASH_NET=m
CONFIG_NETFILTER_XT_SET=m

If not, it would seem to be an additional step is needed for this process.

AastaLLL · February 24, 2025, 7:41am

Hi, @Kangalow

Yes, thanks for the reminder. Have updated the steps accordingly.

Thanks a lot.

hex4def6 · February 24, 2025, 6:48pm

I would suggest including the apt hold on those two packages, otherwise any apt upgrade that e.g, the SDK manageror user does, will break it again.

kalustian · February 24, 2025, 10:06pm

Thanks @AastaLLL and @Kangalow for providing few workaround options. Hoping the nvidia team will come with a final fix soon.

AastaLLL · February 25, 2025, 2:40am

Hi, @hex4def6

Thanks for the suggestion. We have updated the command accordingly.

Topic		Replies	Views
Docker on TX1 (or TX2) Jetson TX1	25	11791	October 18, 2021
Error in executing TensorRT samples through docker container environment DRIVE AGX Orin General docker , driveos-dl	14	100	October 24, 2024
Docker on the TX2 Jetson TX2	37	31900	October 18, 2021
Docker In Docker Error in Nvidia Jetpack Environment Jetson Orin Nano docker	3	47	March 31, 2025
Problems with Docker version 28.0.1 on Jetson Orin NX Jetson Orin NX docker	30	1506	April 7, 2025
Libnvdla_compiler.so not found (bis repetita) Jetson AGX Orin tensorrt	8	64	March 31, 2025
Docker Engine Dependencies JetPack 5.1 Jetson Xavier NX docker	28	934	April 10, 2024
Docker GPU acceleration on Jetson AGX for Ubuntu-18.04 image Jetson AGX Xavier	9	6161	October 18, 2021
WSL Modulus Docker run error (libnvidia-ml.so.1: file exists: unknown.) Technical Support (PhysicsNeMo Only)	7	6127	June 12, 2023
Docker broken upon update to v28 - jetson-containers won't run, reinstalling ubuntu with SDK manager seems to end wit broken installs Jetson AGX Orin sdkm , docker	8	202	February 24, 2025

Error with “Nvidia Container Runtime with Docker Integration” on AGX Orin with JP6.2

Update:

WAR-1: Downgrade to the 27.5.1 docker

WAR-2: Build and flash custom kernel with the required config

WAR-3: Update kernel image with the required config

Related topics