CUDA forward compatibility miracle with Nvidia container on Docker

user77956 · December 3, 2021, 2:11am

As the title suggests, I’m able to run a higher version of CUDA in a container than the host drivers allow and test computations seem to come out fine. I want to understand if there is some undocumented support for forward compatibility with containers or if I’m just lucky it hasn’t broken yet. The details:

Host system
OS: Ubuntu 18.04.3 LTS
NVIDIA Driver: 440.33.01 (acquired via nvidia-smi)
CUDA max supported version: 10.2 (acquired via nvidia-smi)
CUDA actual version installed: 9.1.85 (acquired via nvcc --version)

Container
Base Image: nvidia/cuda:11.3-devel or nvcr.io/nvidia/pytorch:21.06-py3 (I’ve tried this with both and the results were the same)
NVIDIA Driver: 440.33.01 (acquired via nvidia-smi)
CUDA max supported version: 11.3 (acquired via nvidia-smi)
CUDA actual version installed: 11.3.109 (acquired via nvcc --version)
All gpus from the host are passed to the container (5 total)

To test whether I would still be able to use CUDA in pytorch, I tried all the usual pytorch cuda commands (torch.cuda.is_available(), torch.cuda.device(0), torch.cuda.get_device_name(0), etc and I ran the simple example at the top of this pytorch page. All of these tests were successful.

Could I really be running CUDA 11.3 in the container when the host driver doesn’t support it? Could the 11.3 be misreported or is it just luck that I haven’t encountered an error yet?

Robert_Crovella · December 4, 2021, 4:04am

It’s not undocumented. The general principles are documented here. You can find out whether your container loads the necessary libraries by studying the dockerfile itself, or the documentation for the container.

This article may also be of interest.

Topic		Replies	Views
Upgrading Nvidia DGX packages did not update CUDA version DGX User Forum cuda	4	920	March 3, 2023
PyTorch container 'pytorch:19.10-py3' fails to load CUDA: "This container was built for NVIDIA Driver Release 418.87 or later, but version 410.66 was detected and compatibility mode is UNAVAILABLE" General	2	6631	October 12, 2021
Rootless Docker; ERROR: No supported GPU(s) detected to run this container Docker and NVIDIA Docker docker	2	7531	April 8, 2022
CUDA is not initialized with Deepstream 6.1 and Official Docker Image DeepStream SDK cuda , docker , deepstream , deepstream61	4	1078	July 27, 2022
Running Cuda on Docker CUDA Setup and Installation	7	17161	May 23, 2016
Unable to start CUDA container with recent update on November 10 Container: CUDA cuda , ubuntu , docker	5	3475	November 21, 2023
Run containers using multiple CUDA versions (12.4.x, 12.5.x and 12.6.x) using GPU operator in a node with CUDA v12.6 Container: CUDA	1	41	October 16, 2024
General Doubt Regarding Frameworks Support Matrix of NGC Containers Container: CUDA cuda , pytorch	1	565	April 1, 2024
CUDA driver version is insufficient for CUDA runtime version CUDA Setup and Installation	7	29240	May 18, 2024
Docker Container: CUDA driver version is insufficient for CUDA runtime version Physics Modeling (closed) cuda , docker	0	654	January 20, 2021

CUDA forward compatibility miracle with Nvidia container on Docker

Related topics