Getting GPU docker passthrough working

sagilo · July 10, 2022, 7:55am

I’m trying to get the containers running on my Jetson Xavier AGX to use the GPU.
I’ve followed these instructions and also these and I do see everything I should when validating:

$ sudo dpkg --get-selections | grep nvidia
libnvidia-container-tools                       install
libnvidia-container0:arm64                      install
libnvidia-container1:arm64                      install
nvidia-container-runtime                        install
nvidia-container-toolkit                        install
nvidia-docker2                                  install
nvidia-l4t-3d-core                              install
nvidia-l4t-apt-source                           install
nvidia-l4t-bootloader                           install
nvidia-l4t-camera                               install
nvidia-l4t-configs                              install
nvidia-l4t-core                                 install
nvidia-l4t-cuda                                 install
nvidia-l4t-display-kernel                       install
nvidia-l4t-firmware                             install
nvidia-l4t-gputools                             install
nvidia-l4t-graphics-demos                       install
nvidia-l4t-gstreamer                            install
nvidia-l4t-init                                 install
nvidia-l4t-initrd                               install
nvidia-l4t-jetson-io                            install
nvidia-l4t-jetson-multimedia-api                install
nvidia-l4t-jetsonpower-gui-tools                install
nvidia-l4t-kernel                               install
nvidia-l4t-kernel-dtbs                          install
nvidia-l4t-kernel-headers                       install
nvidia-l4t-libvulkan                            install
nvidia-l4t-multimedia                           install
nvidia-l4t-multimedia-utils                     install
nvidia-l4t-nvfancontrol                         install
nvidia-l4t-nvpmodel                             install
nvidia-l4t-nvpmodel-gui-tools                   install
nvidia-l4t-nvsci                                install
nvidia-l4t-oem-config                           install
nvidia-l4t-optee                                install
nvidia-l4t-pva                                  install
nvidia-l4t-tools                                install
nvidia-l4t-wayland                              install
nvidia-l4t-weston                               install
nvidia-l4t-x11                                  install
nvidia-l4t-xusb-firmware                        install

AND

$ sudo docker info | grep nvidia
Runtimes: io.containerd.runc.v2 io.containerd.runtime.v1.linux nvidia runc

However, when I try to run some cuda base image:

sudo docker run --rm --gpus all nvidia/cuda:11.0.3-base-ubuntu20.04 nvidia-smi

Or via docker-compose:

services:
  test:
    image: nvidia/cuda:10.2-base
    command: nvidia-smi
    deploy:
      resources:
        reservations:
          devices:
            - driver: nvidia
              capabilities: [gpu]

I get the following error:

docker: Error response from daemon: failed to create shim task: OCI runtime create failed: runc create failed: unable to start container process: error during container init: error running hook #0: error running hook: exit status 1, stdout: , stderr: Auto-detected mode as 'csv'
invoking the NVIDIA Container Runtime Hook directly (e.g. specifying the docker --gpus flag) is not supported. Please use the NVIDIA Container Runtime instead.: unknown.

Last thing to mention is the device is running without a screen attached (headless) in case it matters?

Any thoughts on how to pass the GPU to the docker containers?

user138956 · July 10, 2022, 2:05pm

Desktop uses different images compared to arm64. Make sure you using right one.

Also make sure jetpack has a good install - run sdk manager to reflash. Also make sure version you choose is compatible with version of jetpack.

Finally - be very careful running “apt update” because it will likely break docker.

edit: Oh yeah there is no “nvidia-smi” on arm64 - so what you are trying will never work.

sagilo · July 10, 2022, 2:15pm

Oh… so any idea about how I can validate whether the GPU passthrough in Docker is working (or not?)

user138956 · July 10, 2022, 2:28pm

I’m using it with a camera connected - so I just fire up gstreamer and see if encoder works.

There is other stuff that need more complicated testing tho, like inference models etc. Try to run the deepstream samples.

Read the docs again, and pay special attention to the different instructions given for “dgpu” vs “jetson”. DGPU means desktop systems and procedure is a little different.

In my experience getting gpu to “work” means nothing - the difficulty comes down to software compatibility. Must have clean version of jetpack - and must be running containers that are compatible.

The newer versions of jetpack come with the “nvidia container” stuff - the older versions didn’t. Easy fix is to just update jetpack rather than mess around with manually installing packages.

sagilo · July 11, 2022, 10:52am

That’s exactly what I did.
I installed the latest jetpack which should have this built in.
Then used the docs validation commands to validate the proper packages are installed and they are…
The idea is to run Plex server which knows how to use the GPU for transcoding, however, only the CPU is used

user138956 · July 11, 2022, 6:41pm

Maybe it’s working fine already. Run bash instead of nvidia-smi inside the container then try to run gstreamer.

sagilo · July 12, 2022, 4:34pm

I keep getting the same error when adding the GPU device, docker_compose.yaml:

services:
  test:
    image: nvidia/cuda:10.2-base
    command: echo "hello"
    deploy:
      resources:
        reservations:
          devices:
            - driver: nvidia
              capabilities: [gpu]

And output for docker-compose up:

Removing cuda_test_1
Recreating faedc76c10d5_cuda_test_1 ... error

ERROR: for faedc76c10d5_cuda_test_1  Cannot start service test: failed to create shim task: OCI runtime create failed: runc create failed: unable to start container process: error during container init: error running hook #0: error running hook: exit status 1, stdout: , stderr: Auto-detected mode as 'csv'
invoking the NVIDIA Container Runtime Hook directly (e.g. specifying the docker --gpus flag) is not supported. Please use the NVIDIA Container Runtime instead.: unknown

ERROR: for test  Cannot start service test: failed to create shim task: OCI runtime create failed: runc create failed: unable to start container process: error during container init: error running hook #0: error running hook: exit status 1, stdout: , stderr: Auto-detected mode as 'csv'
invoking the NVIDIA Container Runtime Hook directly (e.g. specifying the docker --gpus flag) is not supported. Please use the NVIDIA Container Runtime instead.: unknown
ERROR: Encountered errors while bringing up the project.

user138956 · July 12, 2022, 4:41pm

Follow example here: Your First Jetson Container | NVIDIA Developer

L4T are the containers that you want. NVIDIA L4T Base | NVIDIA NGC

CUDA works a bit differently on the jetson, and nvidia keep changing the way things work as well. Don’t follow same procedure as you do on desktop.

Looks like thats a non-tegra container you are trying, use the l4t version instead: NVIDIA L4T CUDA | NVIDIA NGC

system · July 26, 2022, 4:42pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Error run docker image nvcr.io/nvidia/l4t-base:r32.6.1 on jetson AGX Jetson AGX Xavier docker , jetson	9	2436	November 10, 2021
Jetson AGX Xavier cannot start a basic docker Jetson AGX Xavier docker	8	1288	June 23, 2021
Trouble running docker on Xavier Jetson AGX Xavier cuda , docker	4	741	August 23, 2023
Unable to use nvidia-docker on Jetson TX2 Jetson TX2 docker	10	1310	October 18, 2021
Using upgraded Cuda (>11.4) from within nvidia-docker2 / nvidia-container Jetson AGX Orin ros , opencv , cuda , ubuntu , containers	4	3255	June 19, 2023
Nvidia-container with GPU Jetson AGX Xavier containers , ngc	5	2159	October 19, 2022
Accessing the GPU from docker on L4T R32.1 Jetson TX2	7	1445	October 18, 2021
Creating Containers Using nvidia-docker with AGX Xavier Jetson AGX Xavier docker	8	2736	October 18, 2021
Cannot run docker with --runtime nvidia Jetson Xavier NX docker , containers	8	7232	December 22, 2021
Unable to create a docker to run CUDA on JETSON AGX ORIN for opencv Jetson AGX Orin cuda , docker	3	170	August 1, 2024

Getting GPU docker passthrough working

Related Topics