Hello all,
I am trying to setup the Jetson Nano using Docker and the existing containers. I have reviewed several pages on this forum but I was not able to fix the issues I am having. I am assuming the Docker container cannot reach the CUDA libraries.
Setup:
- Jetson Nano Development Kit 4 GB
- Jetpack 4.6.1 [L4T 32.7.1]
- NVIDIA (R) Cuda compiler driver Cuda compilation tools, release 10.2, V10.2.300
First attempt
Based on the Dockerfile of GitHub - dusty-nv/jetson-containers: Machine Learning Containers for NVIDIA Jetson and JetPack-L4T, I try to build a container from l4t-base:r32.7.1 with torch and torchvision.
FROM nvcr.io/nvidia/l4t-base:r32.7.1
ENV DEBIAN_FRONTEND=noninteractive
RUN apt-get update && \
apt-get install -y --no-install-recommends \
python3-pip \
python3-dev \
libopenblas-base \
libopenblas-dev \
libopenmpi-dev \
openmpi-bin \
openmpi-common \
gfortran \
libomp-dev \
git \
libjpeg-dev \
zlib1g-dev \
libpython3-dev \
libavcodec-dev \
libavformat-dev \
libswscale-dev \
build-essential \
&& rm -rf /var/lib/apt/lists/* \
&& apt-get clean
RUN pip3 install --upgrade pip
RUN pip3 install --no-cache-dir setuptools Cython wheel
RUN pip3 install --no-cache-dir -U jetson-stats
RUN pip3 install --no-cache-dir --verbose numpy
# PyTorch (for JetPack 4.6 DP)
ARG PYTORCH_URL=https://nvidia.box.com/shared/static/fjtbno0vpo676a25cgvuqc1wty0fkkg6.whl
ARG PYTORCH_WHL=torch-1.10.0-cp36-cp36m-linux_aarch64.whl
RUN wget --quiet --show-progress --progress=bar:force:noscroll --no-check-certificate ${PYTORCH_URL} -O ${PYTORCH_WHL} && \
pip3 install --no-cache-dir --verbose ${PYTORCH_WHL} && \
rm ${PYTORCH_WHL}
# torchvision 0.11.1
ARG TORCHVISION_VERSION=v0.10.0
ARG TORCH_CUDA_ARCH_LIST="5.3;6.2;7.2;8.7;10.2"
RUN printenv && echo "torchvision version = $TORCHVISION_VERSION" && echo "TORCH_CUDA_ARCH_LIST = $TORCH_CUDA_ARCH_LIST"
RUN git clone https://github.com/pytorch/vision torchvision && \
cd torchvision && \
git checkout ${TORCHVISION_VERSION} && \
python3 setup.py install && \
cd ../ && \
rm -rf torchvision
This fails when I try to install torchvision as it cannot find libcurand.so.10
Second attempt
I use the existing torch container provided by NVIDIA:
nvcr.io/nvidia/l4t-pytorch:r32.7.1-pth1.10-py3
If I import torch there it cannot find libcurand.so.10
Another note that I found is stat GPG public key is missing in this torch container and therefore no other packages cannot be installed.
Looking forward to your reply.