Adding GPU to Docker on Rocky Linux platform

alireza.kahdooee · February 4, 2024, 11:15am

I’m going to deploy an “NVIDIA GeForce GTX 1050 Ti” graphics card to docker containers. According to the links below, I installed the driver for the graphic card and Cuda, as well as the toolkit for Docker in Rocky Linux.

https://docs.nvidia.com/datacenter/tesla/tesla-installation-notes/index.html

https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/latest/install-guide.html

The relevant drivers were installed according to the following path and also the nouveau module is not loaded and the nvidia module is loaded.

But when a docker container is up and I run the nvidia-smi command on Rocky, it shows as follows that it did not find any processes:

Sun Feb  4 14:49:33 2024
+---------------------------------------------------------------------------------------+
| NVIDIA-SMI 545.23.08              Driver Version: 545.23.08    CUDA Version: 12.3     |
|-----------------------------------------+----------------------+----------------------+
| GPU  Name                 Persistence-M | Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp   Perf          Pwr:Usage/Cap |         Memory-Usage | GPU-Util  Compute M. |
|                                         |                      |               MIG M. |
|=========================================+======================+======================|
|   0  NVIDIA GeForce GTX 1050 Ti     On  | 00000000:04:00.0 Off |                  N/A |
|  0%   48C    P5              N/A /  75W |      1MiB /  4096MiB |      0%      Default |
|                                         |                      |                  N/A |
+-----------------------------------------+----------------------+----------------------+

+---------------------------------------------------------------------------------------+
| Processes:                                                                            |
|  GPU   GI   CI        PID   Type   Process name                            GPU Memory |
|        ID   ID                                                             Usage      |
|=======================================================================================|
|  No running processes found                                                           |
+---------------------------------------------------------------------------------------+

I started the respective container several times with the following parameters separately, but it didn’t make a difference.

--pid=host --privileged --gpus 'all,capabilities=utility' --runtime=nvidia

The output of the strace nvidia-smi command is as follows:
ou.txt (32.5 KB)

*my questions:

I don’t know why the nvidia-smi command doesn’t show any processes and whether the graphics card is applied to the docker container or not? :(
My next question is that this nvidia-smi command should be executed in the Rocky system or in the container environment as follows:

docker exec -it af868d81d6f4 nvidia-smi

alireza.kahdooee · February 6, 2024, 6:29am

Is there no one to help me? I have been dealing with this issue for several days and it has not been resolved yet

alireza.kahdooee · February 18, 2024, 7:17am

There is no one inside the NVIDIA company to help me, why the processes are not showing up?

nadeemm · February 28, 2024, 7:08pm

We are trying to find someone who could help.
I have not used Rocky or this container, but the SMI reporting no processes are running seems to be what I would have expected - unless you are actually running some GPU accelerated task in the background.
I would suggest run some CUDA Samples - something which would take a good few minutes, and then run NVIDIA SMI and see if the process shows up.

Sorry again for the lack of response.

alireza.kahdooee · March 2, 2024, 4:42am

Thank you for your answer @nadeemm.
I use it like this inside the container. Google Earth is passed to $DISPLAY env corresponding to xvfb as xserver and then vncserver which reads from xserver.

Maybe the GPU doesn’t support this method?
what’s the solution?

alireza.kahdooee · March 2, 2024, 4:06pm

I downloaded Cuda Samples from the link https://github.com/nvidia/cuda-samples and run it in the container as follows:

Then I run nvidia-smi in the container concurrently and it gave me different usage percentages (3%, 9%, 16%).

I still don’t know if the gpu is being used or not.
Because it doesn’t show any process and I was checking nvida-smi output in a loop.

Do you think the gpu is being used?
If it is, why doesn’t the googleearth application that runs inside the container use gpu?
what to do?

Topic		Replies	Views
Applications not using GPU inside docker container Docker and NVIDIA Docker	1	1284	May 2, 2024
`nvidia-smi` command not found in Docker Container CUDA on Windows Subsystem for Linux	3	20497	July 3, 2021
NVIDIA driver is not available on latest docker Docker and NVIDIA Docker cuda , docker	8	5695	July 5, 2023
Docker and nvidia-smi not working with clean install on Driver 470.14 and Insider Preview (Build 21343) Ubuntu 20.04 CUDA on Windows Subsystem for Linux	3	5627	April 17, 2021
No gpu capabilities with docker that has cuda and a windows host that also has cuda Drivers - Linux, Windows, MacOS cuda , docker , nvidia-smi	0	560	March 13, 2024
nvidia-docker inside Kubernetes - Failed to initialize NVML: Unknown Error CUDA Setup and Installation	3	4206	January 9, 2022
Unable to access GPU from Docker container on WSL 2 with NVIDIA GeForce GTX 1050 Ti CUDA on Windows Subsystem for Linux cuda , wsl	1	6862	December 1, 2023
GPU becomes unavailable after some time in Docker container CUDA Setup and Installation	4	3837	October 12, 2021
"docker: Error response from daemon: exec: "nvidia-container-runtime-hook": executable file not found in $PATH"? CUDA Setup and Installation	0	4516	January 16, 2024
Error using docker & nvidia-container-toolkit on Pegasus DRIVE AGX Xavier General drive-misc	5	1209	October 12, 2021

Adding GPU to Docker on Rocky Linux platform

Related topics