Docker image for deepstream and pytorch

dangraf · March 16, 2021, 9:32am

As the title says, I’m trying to create a docker image with both deepstream and pytorch but are currently failing.

My system setup: Jetson AGX with a clean jetpack 5.1.
My first try was to merge two images as a multi-stage docker file:
FROM nvcr.io/nvidia/l4t-pytorch:r32.5.0-pth1.7-py3
FROM nvcr.io/nvidia/deepstream-l4t:5.1-21.02-sample

But this did not work. I guess it’s because the first image uses jp5.0 and the second 5.1

I then tried to use the deepstream docker container as my starting point and then install pytorch.

FROM nvcr.io/nvidia/deepstream-l4t:5.1-21.02-samples
RUN pip3 install Cython
RUN pip3 install numpy

RUN mkdir torch_install
RUN wget https://nvidia.box.com/shared/static/p57jwntv436lfrd78inwl7iml6p13fzh.whl -O torch_install/torch-1.8.0-cp36-cp36m-linux_aarch64.whl
RUN apt-get install python3-pip libopenblas-base libopenmpi-dev -y
RUN cd torch_install && pip3 install torch-1.8.0-cp36-cp36m-linux_aarch64.whl && cd …

RUN apt-get install libjpeg-dev zlib1g-dev libpython3-dev libavcodec-dev libavformat-dev libswscale-dev -y
RUN git clone --branch v0.9.0 GitHub - pytorch/vision: Datasets, Transforms and Models specific to Computer Vision /opt/nvidia/deepstream/deepstream-5.1/sources/torchvision
RUN pip3 install PyYAML tqdm
RUN pip3 install requests
RUN pip3 install onnx pycuda
RUN apt-get install libopenblas-dev -y

RUN export BUILD_VERSION=0.9.0 && \
export LD_LIBRARY_PATH=/usr/local/cuda-10.2/targets/aarch64-linux/lib &&
python3 setup.py install

But this gives the error:

Step 40/40 : RUN export BUILD_VERSION=0.9.0 && export LD_LIBRARY_PATH=/usr/local/cuda-10.2/targets/aarch64-linux/lib && python3 setup.py install
—> Running in a29f9103cbee
Traceback (most recent call last):
File “setup.py”, line 12, in
import torch
File “/usr/local/lib/python3.6/dist-packages/torch/init.py”, line 195, in
_load_global_deps()
File “/usr/local/lib/python3.6/dist-packages/torch/init.py”, line 148, in _load_global_deps
ctypes.CDLL(lib_path, mode=ctypes.RTLD_GLOBAL)
File “/usr/lib/python3.6/ctypes/init.py”, line 348, in init
self._handle = _dlopen(self._name, mode)
OSError: libcurand.so.10: cannot open shared object file: No such file or directory
The command ‘/bin/sh -c export BUILD_VERSION=0.9.0 && export LD_LIBRARY_PATH=/usr/local/cuda-10.2/targets/aarch64-linux/lib && python3 setup.py install’ returned a non-zero code: 1

I then tried to just outcomment the line “python3 setup.py install” for the torchvision installation, then start the container and run it manually.

This succeeds! it’s possible to install torchvision.

I would like to understand why the command ffails in the docker-file but succeeds when I run the docker-container.
My guess is that I have access to cuda devices while running the docker but not during the build of the docker.

How do I change my dockerfile so it can install torchvision?

AastaLLL · March 16, 2021, 12:02pm

Hi,

Please noted that l4t-pytorch:r32.5.0-pth1.7-py3 indicates that the L4T version is r32.5.
But in deepstream-l4t:5.1-21.02-sample, 5.1 is the Deepstream library version, not related to L4T.

We could build pytorch and torchvision from Dockerfile.
You can find an example below:

github.com

dusty-nv/jetson-containers/blob/master/Dockerfile.pytorch

# Copyright (c) 2020, NVIDIA CORPORATION. All rights reserved.
#
# Permission is hereby granted, free of charge, to any person obtaining a
# copy of this software and associated documentation files (the "Software"),
# to deal in the Software without restriction, including without limitation
# the rights to use, copy, modify, merge, publish, distribute, sublicense,
# and/or sell copies of the Software, and to permit persons to whom the
# Software is furnished to do so, subject to the following conditions:
#
# The above copyright notice and this permission notice shall be included in
# all copies or substantial portions of the Software.
#
# THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
# IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
# FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT.  IN NO EVENT SHALL
# THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
# LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING
# FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER
# DEALINGS IN THE SOFTWARE.

This file has been truncated. show original

Would you mind to use it by updating the base to Deepstream container to see if it works?

Thanks.

dangraf · March 17, 2021, 5:06pm

I’ve tried to use that dockerfile as a base but get the same error. I’ts an old version of pytorch and a lot of packages that does not exist to jetpack v4.5.1

you are also mentioning that the deepstream image is not related to l4t, but why does it has l4t in the image name?

AastaLLL · March 18, 2021, 6:56am

Hi,

Since Deepstream can support both Jetson and desktop , the l4t tag is used for distinguishing the target environment.

To run the dockerfile on JetPack4.5.1, please update the corresponding package version based on below topic:

Thanks.

dangraf · March 18, 2021, 10:30am

thanks, but I have followed the exact steps as you suggests and it works when I’m inside the docker but not when I’m writing them in a docker file. (I tried to explain this in my original post) When following these steps in the docker file I get the error as described.

AastaLLL · March 30, 2021, 7:53am

Hi,

In general, we can get torchvision installed with the pyTorch base.
Let us try it with the deepstream base and share more information with you later.

Thanks.

AastaLLL · April 16, 2021, 3:29am

Hi,

Sorry for the late update.
The OSError: libcurand.so.10 can be solved by adding docker default runtime.

1. Edit /etc/docker/daemon.json with the following patch and reboot:

diff --git a/daemon.json b/daemon.json
index ad77732..9afc625 100644
--- a/daemon.json
+++ b/daemon.json
@@ -4,5 +4,7 @@
             "path": "nvidia-container-runtime",
             "runtimeArgs": []
         }
-    }
+    },
+
+    "default-runtime": "nvidia"
 }

2. We can build torchvision within deepstream-l4t:5.1-21.02-samples as below:
Dockerfile (849 Bytes)

$ sudo docker build .

Thanks.

Topic		Replies	Views
Is it possible to create docker image for DeepStream 6.3 from jetson nano? TensorRT jetson , deepstream	1	41	August 29, 2024
Deepstream Docker ERROR: failed to solve: runtime not found DeepStream SDK nvbugs , docker , containers	6	674	August 6, 2024
Installing torch and torchvision in l4t-jetpack based docker image on Jetson Xavier NX Jetson Xavier NX docker	4	2239	November 28, 2022
Missing l4t tensorrt image for deepstream 6.2 docker image Docker and NVIDIA Docker	0	240	April 18, 2024
Issue compiling deepstream-app in deepstream docker container on Jetson Xavier NX DeepStream SDK deepstream	10	788	August 16, 2023
Docker DeepStream-6.1.1:App run failed DeepStream SDK docker	16	502	April 1, 2024
Containerization using docker of deepstream python script DeepStream SDK deepstream	14	65	December 18, 2024
Provide tooling for compiling apps in Docker DeepStream SDK	15	1084	October 12, 2021
Build Docker error standard_init_linux.go:219 Jetson Xavier NX docker	9	2265	October 18, 2021
Error run docker image nvcr.io/nvidia/l4t-base:r32.6.1 on jetson AGX Jetson AGX Xavier docker , jetson	9	2576	November 10, 2021

Docker image for deepstream and pytorch

Related topics