AI Workbench project stuck in rebuild mode

If you are reporting a bug or error, consider submitting a Support Bundle to aiworkbench-ea@nvidia.com. This will help us solve your issue more quickly.

Please describe your issue or request: (tick the boxes after creating this topic):

Please tick the appropriate box to help us categorize your post
Bug or Error
Feature Request
Documentation Issue
Other

I am using a nvidia nim anywhere project in workbench configuring mounts for the project. I kept seeing an error when I launched jupyter notebook saying docker was not found at /var/host-run/ Well I knew I had changed that in the environment mounts so found it in the postbuild script and changed it.

When I closed the edit, the project went into rebuild mode and got stuck on step 6. I closed the project & workbench, but it was still building when I relaunched. I even restarted the computer. When I press the stop build button it says there is a problem stopping. So I went to the logs and it said it is trying to stop a container that isn’t running.

So do I have an option other than to remove the project and start over?

Thanks for posting and sorry for the late reply.

You don’t need to delete the project.

Instead, you can delete the build-output.<state> file from the project’s runtime info folder. the state of the container build is in the last part of the file name, i.e. success, error, or building.

Yours probably is build-output.building. If you delete it and restart the build, things should work.

You can see more about this file here: Log and Runtime Files - NVIDIA Docs

Thanks for the response. I suspected there was something like that but didn’t know where to look.

1 Like