Why it doesn't see CUDA docker container?

s.pinchuk · August 30, 2023, 1:12pm

Hi, Team!
We haveJetson Nano 4GB
Ubuntu 18.04
JetPack 4.6.0
OpenCV 4.5.3

When we run such command, CUDA works great:

run-jetson-docker-nvidia:
	docker run -it --rm --runtime nvidia --network host -v $(pwd):/app/src -v /usr/local/cuda-10.2/:/usr/local/cuda-10.2/:ro -v /usr/lib/aarch64-linux-gnu/:/usr/lib/aarch64-linux-gnu -v /usr/include/aarch64-linux-gnu:/usr/include/aarch64-linux-gnu jetson-build bash

But with out mount attributes for CUDA manually in this command - it doesn’t work. We will have error:

>>> import torch
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "/usr/local/lib/python3.6/dist-packages/torch/__init__.py", line 196, in <module>
    _load_global_deps()
  File "/usr/local/lib/python3.6/dist-packages/torch/__init__.py", line 149, in _load_global_deps
    ctypes.CDLL(lib_path, mode=ctypes.RTLD_GLOBAL)
  File "/usr/lib/python3.6/ctypes/__init__.py", line 348, in __init__
    self._handle = _dlopen(self._name, mode)
OSError: libcurand.so.10: cannot open shared object file: No such file or directory

When we run docker run -it --rm --runtime nvidia --gpus all nvcr.io/nvidia/l4t-pytorch:r32.6.1-pth1.9-py3 bash, we have empty files for cudnn, so CUDA doesnt mount in Docker container:

root@ec3f7b8ddc55:/usr/include/aarch64-linux-gnu# ls
NvCaffeParser.h       NvInferRuntime.h        NvUtils.h         cblas.h               cudnn_ops_infer_v8.h  fpu_control.h      sys
NvInfer.h             NvInferRuntimeCommon.h  a.out.h           cudnn_adv_infer_v8.h  cudnn_ops_train_v8.h  gnu
NvInferImpl.h         NvInferVersion.h        asm               cudnn_adv_train_v8.h  cudnn_v8.h            ieee754.h
NvInferLegacyDims.h   NvOnnxConfig.h          bits              cudnn_backend_v8.h    cudnn_version_v8.h    jconfig.h
NvInferPlugin.h       NvOnnxParser.h          c++               cudnn_cnn_infer_v8.h  expat_config.h        openblas_config.h
NvInferPluginUtils.h  NvUffParser.h           cblas-openblas.h  cudnn_cnn_train_v8.h  f77blas.h             python3.6m
root@ec3f7b8ddc55:/usr/include/aarch64-linux-gnu# cat cudnn_v8.h
root@ec3f7b8ddc55:/usr/include/aarch64-linux-gnu# cat cudnn_version_v8.h
root@ec3f7b8ddc55:/usr/include/aarch64-linux-gnu#

So, where we should find Jetpack on installed Ubuntu on Jetson Nano 4GB ? (/etc/fstab - I don’t have such directory and /media/nvidia/NVME - we dont have such dir too)
Or maybe you have any other ideas to fix this trouble?
Thanks .

dusty_nv · August 30, 2023, 1:53pm

@s.pinchuk as per your last post:

something with your nvidia-container-runtime had gotten misconfigured/broken at some point and is no longer mounting CUDA/cuDNN/ect into the container correctly. The most reliable recommendation is to reflash the device with JetPack and check that the --runtime nvidia works properly for you from the beginning.

s.pinchuk · August 30, 2023, 1:59pm

I have already reinstalled system and I have CUDA on host machine, but it doesn’t mount in docker container.

Can I reinstall or upgrade JetPack from terminal?

dusty_nv · August 30, 2023, 2:33pm

@s.pinchuk please try this sequence, starting after flashing the SD card with a fresh install. The SD card image already comes with CUDA Toolkit and the NVIDIA Container Runtime installed. Try it with the latest JetPack 4.6.4 instead.

Flash SD card image
Test CUDA samples outside of container:

cd /usr/local/cuda/samples/1_Utilities/deviceQuery
sudo make
./deviceQuery

Test CUDA samples inside l4t-base container (note the r32.7.1 tag is for JetPack >= 4.6.1)

sudo docker run --runtime nvidia -it --rm nvcr.io/nvidia/l4t-base:r32.7.1
cd /usr/local/cuda/samples/1_Utilities/deviceQuery
./deviceQuery

If step 3 doesn’t work, reflash again to get your system environment in a known-good state.

system · September 26, 2023, 6:02am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Libcurand.so.10 not found on JetPack 4.6.2 in docker Jetson AGX Xavier cuda	13	1977	July 6, 2022
Unable to use nvidia-docker on Jetson TX2 Jetson TX2 docker	10	1319	October 18, 2021
CUDA not getting installed when using Jetpack 3.2 Jetson TX1	12	2785	October 18, 2021
[Fatal] Exception: CUDA error: 35 Jetson Nano cuda , linux , linux-driver	15	1919	December 19, 2023
Error trying to use OpenCV with CUDA support on Docker: CUDA driver version is insufficient for CUDA runtime version Jetson Nano opencv , cuda , docker	4	2823	June 27, 2022
Disable mount plugins Jetson AGX Xavier cuda , docker , cudnn	5	1169	October 18, 2021
Jetson nano CUDA error Could not load dynamic library 'libcudart.so.10.2' Jetson Nano cuda	6	1915	March 2, 2023
Test nvidia-smi by nvidia docker Jetson TX2	2	2158	October 18, 2021
No access to CUDA repo from CUDA L4t docker container Jetson AGX Orin cuda , docker	5	970	December 7, 2022
Pytorch installed on l4t-jetpack:r35.4.1 container on Jetson Orin Nano (JetPack 6.0 Developer Kit) fails to recognize CUDA Jetson Orin Nano cuda , docker , pytorch , python , containers	2	95	October 22, 2024

Why it doesn't see CUDA docker container?

Related topics