Can't start environment with GPU

leo_hsiao · June 6, 2024, 5:55am

Please provide the following info (tick the boxes after creating this topic):

Submission Type
Bug or Error
Feature Request
Documentation Issue
Question
Other

Workbench Version
Desktop App v0.50.17
CLI v0.21.3
Other

Host Machine operating system and location
Local Windows 11
Local Windows 10
Local macOS
Local Ubuntu 22.04
Remote Ubuntu 22.04
Other

Summary of the issue
When I start the environment, an error occurs, but when I choose not to use the GPU, the environment starts normally.

Error message
Here is the error log in workbench.log

{“level”:“info”,“time”:“2024-06-06T13:44:10+08:00”,“message”:“project ‘my-project’ is requesting 1 GPUs. Added runtime selection for GPUs.”}
{“level”:“warn”,“runtimeInfoPath”:“/home/rd/.nvwb/project-runtime-info/my-project-e834f8624793c4f0a48dab7b2b5d09801cf98164”,“time”:“2024-06-06T13:44:10+08:00”,“message”:“No git remote operation output files were found.”}
{“level”:“error”,“error”:“input: startProject exit status 125”,“time”:“2024-06-06T13:44:11+08:00”,“message”:“GIN-Graphql request failed”}
{“level”:“info”,“time”:“2024/06/06 - 13:44:11”,“status”:200,“latency”:“261.362116ms”,“client-ip”:“127.0.0.1”,“method”:“POST”,“path”:“/v1/query”,“time”:“2024-06-06T13:44:11+08:00”,“message”:“GIN-Request”}

Screenshots

twhitehouse · June 11, 2024, 1:44pm

Hi Leo

sorry for taking so long to get back to you.

Can you please do the following and then send us the logs?

Open a terminal and activate your local context in debug mode
- nvwb --debug activate local
Open the Project and start the container as you usually do in the UI with the GPU enabled
- Or you can do it in the CLI as follows:
  - nvwb open <project_name>
  - nvwb start jupyterlab

This will give us more information on what’s happening.

Thanks

leo_hsiao · June 12, 2024, 2:03am

Hi

Here is my logs.

workbench.log (18.7 KB)

Thanks!

twhitehouse · June 12, 2024, 1:37pm

Hi Leo,

I didn’t give you full instructions. Sorry.

In order to set the --debug mode you first need to completely shutdown Workbench on your local machine.

You can do this by fully closing and quitting the Desktop application.

Or if you have an active session in your terminal you can do the following:

nvwb -f --shutdown deactivate

This will force close everything, including running Projects.

Then, activate the local context in debug mode and fire up the container with the GPUs enabled again.

Sorry for missing this.

leo_hsiao · June 13, 2024, 1:45am

Hi

Here is my logs. I didn’t set the debug mode correctly, sorry.

workbench.log (1.1 MB)

Thanks!

bfurtaw · June 14, 2024, 3:28pm

Hi there,
Your logs are showing a problem engaging the GPU a GeForce 3080 in Ai Workbench. Meanwhile your CUDA runtime and driver are current. Could we run a docker test independent of AI Workbench to help diagnose? Please try nvidia-smi in the docker container environment test and see if the docker environment can recognize the GPU. Please run

docker run --gpus all -it --rm nvcr.io/nvidia/pytorch:23.07-py3 nvidia-smi

and send the results.

If this test fails to see CUDA initialize in the pytorch container, next steps are to troubleshoot libnvidia-container TK Troubleshooting — NVIDIA Container Toolkit 1.15.0 documentation

leo_hsiao · June 17, 2024, 9:47am

Hi
Here is my result running the command.

Thanks!

bfurtaw · June 18, 2024, 2:22pm

Lets try a fix for this I found helpful. Edit the /etc/nvidia-container-runtime/config.toml using vi or nano as-in

sudo vi /etc/nvidia-container-runtime/config.toml

Modify the line

no-cgroup = true

to

no-cgroup = false

and save the file.

then

sudo systemctl restart docker

and retry the container

docker run --gpus all -it … nvidia-smi

cmd from above.

leo_hsiao · June 19, 2024, 9:59am

Thanks for your replying, I reinstalled my system and solved my problem.

Thanks for your help in these days.

Topic		Replies	Views
No GPUs Available NVIDIA AI Workbench	10	568	April 8, 2025
NVIDIA AI workbench NVIDIA AI Workbench	1	90	October 7, 2024
No GPUs Available error NVIDIA AI Workbench	1	87	August 29, 2024
An Error Occured while executing wb-svc-quiet start-container-tool NVIDIA AI Workbench	5	121	January 22, 2025
NVIDIA AI Workbench Error When Running Local NVIDIA AI Workbench nim	3	144	June 30, 2025
Local compute unavailable NVIDIA AI Workbench	21	619	August 2, 2024
Workbench service not reachable NVIDIA AI Workbench	7	209	August 26, 2024
Docker not detecting Nvidia GPU NVIDIA AI Workbench	3	4785	February 17, 2025
Can't connect to local (the machine i am working on) NVIDIA AI Workbench	7	310	February 4, 2025
Issue while trying to install ngc NVIDIA AI Workbench	1	255	April 4, 2024

Can't start environment with GPU

Related topics