Nvidia-container-cli does not show NVRM version or GPU information

vsaw · September 12, 2023, 9:20pm

Hi,

I’m working with a fresh install of the official JetPack 5.1.2 SD card image for the Xavier NX. The system has not been modified besides adding a WiFi connection and running the following commands

sudo apt update
sudo apt upgrade

# The system is run in headless mode so remove all desktop software
sudo apt remove ubuntu-desktop
sudo apt autoremove

# Jetson Stats (https://github.com/rbonghi/jetson_stats)
sudo apt install python3-pip
sudo pip3 install -U jetson-stats

When running nvidia-container-cli info I get the following output

$nvidia-container-cli info
NVRM version:   (null)
CUDA version:   11.4

Device Index:   0
Device Minor:   0
Model:          Xavier
Brand:          (null)
GPU UUID:       (null)
Bus Location:   (null)
Architecture:   7.2

Why is it not showing the NVRM and GPU information? Is this an error in the JetPack version?

I’ve had the same output of nvidia-container-cli info earlier today, when I could not access the GPU in Docker and thought I messed up my system, which was why I did a full re-install of the system. But it does not seem to work and now I’m at a loss what could be the reason why I can not access the GPU inside the NVIDIA container runtime.

Any pointer are greatly appreciated.

dusty_nv · September 13, 2023, 12:54am

Hi @vsaw, Jetson doesn’t support NVRM or nvidia-smi. If you start an l4t container built for JetPack with --runtime nvidia , then the GPU should be accessible. If you start nvcr.io/nvidia/l4t-jetpack:r35.4.1 you can try running the CUDA samples that are in it, like deviceQuery/vectorAdd/ect to confirm your GPU is working inside containers.

vsaw · September 13, 2023, 12:11pm

Thank you @dusty_nv! I can run the demo. But when trying to run OpenDataCam & Darknet inside docker I get the following error message

NvRmPrivGetChipPlatform: Could not read platform information

I assumed it was an access issue, if that’s not the case, any idea what this means and how to fix it?

vsaw · September 14, 2023, 3:57pm

@dusty_nv I’ve been doing some more debugging and found out that cudnnCreate fails with error CUDNN_STATUS_NOT_INITIALIZED when running darknet in Docker contaier nvidia/cuda:11.4.3-cudnn8-runtime-ubuntu20.04.

The same code works as expected when running natively on my Jetson Xavier NX DevKit running JetPack 5.1.2 (CUDA 11.4, cuDNNN 8.6), and I confirmed that cudaGetDevice is called successfully before calling cudnnCreate.

Therefore I doubt that this is a code error, and more likely an issue with the container or image. However, I don’t know how to fix this from here 🤷‍♂️

Any pointers are greatly appreciated!

dusty_nv · September 14, 2023, 4:02pm

@vsaw I think this is an x86 container, instead try using nvcr.io/nvidia/l4t-jetpack:r35.4.1

vsaw · September 14, 2023, 7:46pm

According to Docker Hub the image is multiarch with amd64 and arm64 builds.

I’m trying to build Darknet with nvcr.io/nvidia/l4t-jetpack:r35.4.1 but for some reason it does not find -lcuda during linking.

Update
I got it working with nvcr.io/nvidia/l4t-jetpack:r35.4.1. The trick was to add -L/usr/local/cuda-11.4/targets/aarch64-linux/lib/stubs to LDFLAGS.

Is there an alternative image that’s smaller? 10gb is pretty hefty just to run Darknet :-/

dusty_nv · September 23, 2023, 7:49pm

Once your application is built, you can deploy it in l4t-cuda which has runtime containers that don’t include the full CUDA Toolkit. On JetPack 5, these components are inside the containers themselves as opposed to mounted from the host device (which is why they are bigger than the JetPack 4 containers)

Also, that 10GB will be shared in the docker cache among any other containers using l4t-jetpack (which most GPU-accelerated containers for Jetson do), so you needed to only download it once.

vsaw · October 6, 2023, 1:39pm

Sorry I forgot to answer that I got it working by building container myself from l4t-base and only including the necessary components.

The full Dockerfile can be found here: https://github.com/opendatacam/darknet/blob/e6716ceca368c8e9b2fd2bf0286adba0290a1ad4/docker/build/xavier/Dockerfile

This has been tested and proven to be working on a Xavier NX.

system · October 20, 2023, 1:40pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Docker Container: nvidia:l4t-ml wont run Jetson Xavier NX docker	13	4444	October 18, 2021
Nvidia-container with GPU Jetson AGX Xavier containers , ngc	5	2357	October 19, 2022
Docker container with CUDA, CUDNN, OPENCV installed on Jetson Xavier Docker and NVIDIA Docker	13	4958	March 17, 2021
Docker Nvidia Fails after Software Update Jetson Xavier NX docker	10	705	December 18, 2023
Jetson Xavier NX getting out of memory when running a docker container Jetson Xavier NX docker	6	226	July 17, 2024
Jetson Orin NX nvidia-container-runtime Jetson Orin NX containers , kubernetes	8	1944	September 11, 2023
Issues with Machine Learning Containers for Jetson on containerd Jetson Nano docker	7	1124	October 18, 2021
Jetson Xavier NX (emmc), CUDA, and Docker Jetson Xavier NX docker	6	3498	October 17, 2021
Getting GPU docker passthrough working Jetson AGX Xavier docker	8	9887	July 12, 2022
Host files for container - nvidia-container-runtime Jetson AGX Orin containers	3	489	August 27, 2024

Nvidia-container-cli does not show NVRM version or GPU information

Related topics