https://gitlab.com/nvidia/container-images/cuda
This issue has been fixed cuda/dist/12.9.* and cuda/dist/13.0.0 Dockerifles
See attachment FilesThatNeedtobeCorrected.txt (136.3 KB)
in the event you are still building docker images for any of the earlier cuda
11.4.3 11.5.2 11.6.2 11.7.1 11.8.0 12.0.1 12.1.1 12.2.2 12.3.2 12.4.1 12.5.0 12.5.1 12.6.3 12.8.0 12.8.1 .
it still needs to be fixed.
I encountered the error in the below nvcr.io/nvidia docker images. And just checked them; those are the nvcr.io/nvidia docker images I have used. The last image I checked was Il4t-cuda:12.6.11-runtime. Remembered I’d found the gitlab/cuda repo about a year ago and just git pull to update it. Hence this post.
>grep “/usr/local/nvidia” --include=*.Dockerfile -irHn
grep "/usr/local/nvidia" --include=*.j2 -irHn
templates/ubuntu/base-dockerfile.j2:87:RUN echo "/usr/local/nvidia/lib" >> /etc/ld.so.conf.d/nvidia.conf \
templates/ubuntu/base-dockerfile.j2:88: && echo "/usr/local/nvidia/lib64" >> /etc/ld.so.conf.d/nvidia.conf
templates/ubuntu/base-dockerfile.j2:90:ENV PATH /usr/local/nvidia/bin:/usr/local/cuda/bin:${PATH}
templates/ubuntu/base-dockerfile.j2:91:ENV LD_LIBRARY_PATH /usr/local/nvidia/lib:/usr/local/nvidia/lib64
templates/redhat/base-dockerfile.j2:70:RUN echo "/usr/local/nvidia/lib" >> /etc/ld.so.conf.d/nvidia.conf && \
templates/redhat/base-dockerfile.j2:71: echo "/usr/local/nvidia/lib64" >> /etc/ld.so.conf.d/nvidia.conf
templates/redhat/base-dockerfile.j2:73:ENV PATH /usr/local/nvidia/bin:/usr/local/cuda/bin:${PATH}
templates/redhat/base-dockerfile.j2:74:ENV LD_LIBRARY_PATH /usr/local/nvidia/lib:/usr/local/nvidia/lib64
nvcr.io/nvidia/l4t-cuda:12.6.11-runtime
/etc/ld.so.conf.d/nvidia-tegra.conf:/usr/lib/aarch64-linux-gnu/tegra nonexistant
LD_LIBRARY_PATH=/usr/local/nvidia/lib:/usr/local/nvidia/lib64 nonexistant
/etc/ld.so.conf.d/nvidia.conf:/usr/local/nvidia/lib nonexistant
/etc/ld.so.conf.d/nvidia.conf:/usr/local/nvidia/lib64 nonexistant
PATH=/usr/local/nvidia/bin:
nvcr.io/nvidia/tritonserver:25.07-trtllm-python-py3
cat /etc/ld.so.conf.d/nvidia.conf
/etc/ld.so.conf.d/nvidia.conf:/usr/local/nvidia/lib
/etc/ld.so.conf.d/nvidia.conf:/usr/local/nvidia/lib64
That should point to /usr/local/cuda/lib64 and /usr/local/cuda/targets/sbsa-linux/lib/
PATH has /usr/local/nvidia/bin which also should point to /usr/local/cuda/bin
nvcr.io/nvidia/pytorch:25.06-py3-igpu
/etc/ld.so.conf.d/nvidia.conf has /usr/local/nvidia/lib /usr/local/nvidia/lib64
/etc/ld.so.conf.d/nvidia-tegra.conf has nonexistent directory /usr/lib/aarch64-linux-gnu/tegra
PATH=/usr/local/nvidia should be removed from path.
nvcr.io/nvidia/tritonserver:25.05-vllm-python-py3
Fix /etc/ld.so.conf.d/nvidia.conf as it has nonexistent directories.
/etc/ld.so.conf.d/nvidia.conf:/usr/local/nvidia/lib
/etc/ld.so.conf.d/nvidia.conf:/usr/local/nvidia/lib64
wrong PATH=/usr/local/nvidia/bin
nvcr.io/nvidia/deepstream-l4t:7.0-triton-multiarch
/etc/ld.so.conf.d/nvidia.conf:/usr/local/nvidia/lib
/etc/ld.so.conf.d/nvidia.conf:/usr/local/nvidia/lib64
PATH=/usr/local/nvidia/bin
nvcr.io/nvidia/deepstream-l4t:7.0-triton-multiarch
/etc/ld.so.conf.d/nvidia.conf:/usr/local/nvidia/lib
/etc/ld.so.conf.d/nvidia.conf:/usr/local/nvidia/lib64