No GPUs Available

vik.tsebriy · September 27, 2024, 10:34am

Problem Summary:

I am facing an issue when trying to start a container in a remote environment using NVIDIA AI Workbench after building it. Used:
PyTorch
A Pytorch 2.1 Base with CUDA 12.2v1.0.2 | Ubuntu 22.04 | Python3
Upon launching the environment, I receive the following error message:

No GPUs Available
Not enough GPU resources are available. You can continue without GPUs. Additionally, you can cancel to manually stop projects to free up GPU resources.

This error occurs despite having GPUs available on the server. The output from the nvidia-smi command confirms that both GPUs (NVIDIA GeForce RTX 3060 and GeForce RTX 3060 Ti) are detected and show minimal memory usage:

Interestingly, when I run the container directly from the terminal, it successfully recognizes the GPUs. This indicates that there may be an issue with how the GPU resources are allocated or recognized specifically within NVIDIA AI Workbench.

I would appreciate any assistance in troubleshooting this issue to ensure the GPUs are available for the container in the AI Workbench environment.

Please tick the appropriate box to help us categorize your post
[0] Bug or Error
Feature Request
Documentation Issue
Other
logs.txt (7.9 KB)

twhitehouse · September 27, 2024, 10:40am

Is this on a virtual desktop?

vik.tsebriy · September 27, 2024, 10:43am

it is nvidia AI Workbench installed on Windows 11

twhitehouse · September 27, 2024, 12:29pm

The screenshot doesn’t let me see which system that Workbench window is for. Is it for local or for a remote?

Regardless, let me echo back what I think you are doing.

You have Workbench installed locally on a Windows 11 system.
You have installed Workbench remotely on an Ubuntu system that has the two GPUs in question.
When you try to build and open a CUDA enabled project on the remote, you are getting no GPUs found even though there are GPUs.

Is this assessment correct?

vik.tsebriy · September 27, 2024, 2:36pm

Thanks for your support!!!

yes. I installed workbench on remote, installed and connected.(Screenshot from remote location)

I can modify files, launch jupyter(CPU) etc. on remote and nvidia workbench program.

using nvidia-smi in remote location, you can see the output(screenshot from remote).

About app Nvidia Worbanch. I installed program and connected to remote(screenshot from nvidia workbench program on windows11)

Q & A:

You have Workbench installed locally on a Windows 11 system. Yes
You have installed Workbench remotely on an Ubuntu system that has the two GPUs in question. Yes
When you try to build and open a CUDA enabled project on the remote, you are getting no GPUs found even though there are GPUs. Yes

In general, my project that needs GPU won’t run in workbench

Error using remote location:

twhitehouse · September 27, 2024, 6:04pm

ok. so i’m guessing this is a problem at the dependency level, like something with Docker or the drivers.

Are you using Docker or podman as the runtime on the remote?

baburee · October 3, 2024, 5:37pm

Hi,
I am having the exact same problem. I have the GPU on a remote ubuntu environment and am accessing it from a workbench environment on my laptop with ubuntu.
I have installed the nvidia container toolkit so doesn’t seem like a docker issue. Some environment info below.

+-----------------------------------------------------------------------------------------+
| NVIDIA-SMI 550.107.02             Driver Version: 550.107.02     CUDA Version: 12.4     |
|-----------------------------------------+------------------------+----------------------+
| GPU  Name                 Persistence-M | Bus-Id          Disp.A | Volatile Uncorr. ECC |
| Fan  Temp   Perf          Pwr:Usage/Cap |           Memory-Usage | GPU-Util  Compute M. |
|                                         |                        |               MIG M. |
|=========================================+========================+======================|
|   0  NVIDIA GeForce RTX 3080        Off |   00000000:17:00.0 Off |                  N/A |
|  0%   28C    P8              5W /  370W |      10MiB /  10240MiB |      0%      Default |
|                                         |                        |                  N/A |
+-----------------------------------------+------------------------+----------------------+
                                                                                         
+-----------------------------------------------------------------------------------------+
| Processes:                                                                              |
|  GPU   GI   CI        PID   Type   Process name                              GPU Memory |
|        ID   ID                                                               Usage      |
|=========================================================================================|
|    0   N/A  N/A      2988      G   /usr/lib/xorg/Xorg                              4MiB |
+-----------------------------------------------------------------------------------------+

Topic		Replies	Views
Can't start environment with GPU NVIDIA AI Workbench	8	384	June 19, 2024
GPU becomes unavailable after some time in Docker container CUDA Setup and Installation	4	3589	October 12, 2021
NVIDIA AI workbench NVIDIA AI Workbench	1	38	October 7, 2024
Applications not using GPU inside docker container Docker and NVIDIA Docker	1	1080	May 2, 2024
Rootless Docker; ERROR: No supported GPU(s) detected to run this container Docker and NVIDIA Docker docker	2	7628	April 8, 2022
NVIDIA AI Workbench Simplifies Using GPUs on Windows Technical Blog	6	42	September 6, 2024
Speed Up Your AI Development: NVIDIA AI Workbench Goes GA Technical Blog	2	241	September 6, 2024
Unable to log in my local location in the workbench NVIDIA AI Workbench	15	188	September 10, 2024
Workbench service not reachable NVIDIA AI Workbench	7	89	August 26, 2024
Nvidia-smi recognize H100 when Firmware is disable Confidential Computing cuda , ubuntu	10	359	September 11, 2024

No GPUs Available

Related topics