cuda 8.0 and libnvidia-ml.so.xxx on cloud platforms that use underlying nvidia-docker

karlmutch · May 2, 2018, 7:50pm

When using cloud vendor machines with GPUs that expect the driver directories and files to be mounted into the container I am having issues getting the management library installed so that I can query the GPU. If I use docker to create the machine and force the nvidia drivers in, along with the standard cuda software to get a copy of the shared library, when the machine starts on Azure or AWS I get errors from Kubernetes that is being used to start these saying that the directories are getting in the way of the nvidia-docker style mounts.

So, if I cannot get the nvidia drivers on how on earth do I get the shared library and tools like nvidia-smi into containers ?

Robert_Crovella · May 2, 2018, 8:02pm

use nvidia container runtime plugin: (previously called nvidia-docker 2.0)

[url]https://github.com/NVIDIA/nvidia-container-runtime[/url]

When you do that, you don’t install the driver bits into the container (the runtime does it for you).

with respect to Kubernetes, this may be useful:

[url]https://kubernetes.io/docs/tasks/manage-gpus/scheduling-gpus/[/url]

Note that this situation is changing pretty rapidly (as indicated on kubernetes page) and so the “recipe” may be different 6 months from now.

karlmutch · May 2, 2018, 8:15pm

Do you know of an Azure recipe for the plugin approach on k8s ?

Thanks

Robert_Crovella · May 2, 2018, 9:29pm

a google search on “gpu kubernetes azure” seemed to turn up several promising hits

karlmutch · May 2, 2018, 9:36pm

I was not able to find any using google etc that mention the plugin approach. Many older articles and blogs abound for the older style alpha.kubernetes.io/nvidia-gpu: 1 approaches.

I will try Microsoft and see if I can find anything out about the ‘nvidia container runtime plugin’ approach but it does not seem to have been on the MSDN radar at least.

Topic		Replies	Views
How to get CUDA container to utilize graphics card drivers CUDA Programming and Performance	3	558	May 9, 2019
NVIDIA Kubernetes installation Pre-requisites Linux	4	818	December 6, 2018
nvidia-smi installation on Jetson TX2 Jetson TX2	7	8672	November 29, 2021
CUDA Usage in Docker and What is nvidia-docker plugin Docker and NVIDIA Docker	2	2512	October 12, 2021
Driver container + Container toolkit info Docker and NVIDIA Docker	1	2591	July 29, 2023
Shared Access to GPUs in k8s not working CUDA Programming and Performance	2	160	July 2, 2025
How can I use nvidia gpu in kubernetes pod? Jetson Xavier NX jetson-inference	4	9141	September 21, 2022
Libnvidia-ml location and source? Jetson Xavier NX containers	15	10699	October 17, 2023
Cuda & Nvidia Version for Google's Container-Optimized OS Container: CUDA	3	2108	May 14, 2019
Libnvidia-ml.so.1 not found under /usr CUDA Setup and Installation cuda , kubernetes	0	94	February 11, 2025

cuda 8.0 and libnvidia-ml.so.xxx on cloud platforms that use underlying nvidia-docker

Related topics