When running containers all gpus are visible in contaner regardless of --gpus option settings

bartmann · November 23, 2021, 4:49am

HI.

I’m using Tensorflow2 21.09-tf2-py3 and hpc-banchmarks containers from NGC.

Recently,
I tried to limit the number of gpus in containers for test by using “–gpus” options, but it does not works.
Regardless of “–gpus” options with any combinations of GPU selection, all gpus are visible in container.
I tried “–gpus” with GPU numebrs, and UUID, NVIDIA_VIDIBLE_DEVICES options, but the container always includes all gpus in the node.

How can I limit the number of GPUs in abover containers?

PS. The following run results correctly shows 3 GPUs.
“docker run --rm --gpus 1,2,3 nvidia/cuda:11.0-base nvidia-smi”

Here is my environment.

OS = Ubuntu 18.4 ( 4.15.0-162-generic)
NVIDIA Driver = 470.57.02
CUDA version = 11.4
docker version = 20.10.9, build c2ea9bc (with NVIDIA Docekr)
NGC container

Tensorflow2 21.09-tf2-py3
hpc-banchmarks

Followiings are script that I used to run tensorflow.

Select all GPUs
docker run --runtime=nvidia --shm-size=4g --ulimit memlock=-1 -ti --privileged --rm -v $(pwd):/workspace/nvidia-examples/cnn/scripts nvcr.io/nvidia/tensorflow:21.09-tf2-py3
Select one GPU (gpu 0) using “–gpus”
docker run --runtime=nvidia --gpus 1 --shm-size=4g --ulimit memlock=-1 -ti --privileged --rm -v $(pwd):/workspace/nvidia-examples/cnn/scripts nvcr.io/nvidia/tensorflow:21.09-tf2-py3
Select one GPU (gpu 0) using “-e NVIDIA_VIDIBLE_DEVICES=1”
docker run --runtime=nvidia -e NVIDIA_VIDIBLE_DEVICES=1 --shm-size=4g --ulimit memlock=-1 -ti --privileged --rm -v $(pwd):/workspace/nvidia-examples/cnn/scripts nvcr.io/nvidia/tensorflow:21.09-tf2-py3

==> each run shows all GPUs in container.

Topic		Replies	Views
Docker container unable to see more than 1 GPU Docker and NVIDIA Docker	0	446	February 13, 2020
Docker container cant use GPU cuDNN tensorflow , docker , python , gpu	1	4548	July 1, 2022
Tensorflow docker can't detect gpu Docker and NVIDIA Docker cuda , tensorflow , docker	0	4003	December 31, 2020
Any setting(enivronment variable etc) to not all GPU's in Multi-GPU hardware? CUDA Programming and Performance	2	586	June 29, 2011
Failure to call to cuInit in nvidia-docker2 Container: CUDA ubuntu , docker	2	2173	August 18, 2023
NVIDIA_VISIBLE_DEVICES vs CUDA_VISIBLE_DEVICES CUDA Setup and Installation	2	6619	April 23, 2021
A problem about docker image of deepstream DeepStream SDK	8	740	October 12, 2021
tensorflow:19.12-tf2-py3 no multiple gpus Frameworks (archived) tensorflow	0	598	January 2, 2020
Assigning individual GPUs to WSL / docker instances CUDA on Windows Subsystem for Linux	0	80	November 9, 2024
Horovod using only a gpu no matter what np value? Deep Learning (Training & Inference)	0	277	July 8, 2020

When running containers all gpus are visible in contaner regardless of --gpus option settings

Here is my environment.

Related topics