Disk space not free even after removing the project from AI workbench

raghums534 · September 7, 2024, 7:50pm

If you are reporting a bug or error, consider submitting a Support Bundle to aiworkbench-ea@nvidia.com. This will help us solve your issue more quickly.

Please describe your issue or request: (tick the boxes after creating this topic):

Please tick the appropriate box to help us categorize your post
[*] Bug or Error
Feature Request
Documentation Issue
Other

Hi Team,
I was trying out the example-hybrid-rag example of AI workbench . Firstly the build was successful, however when I tried to open the chat it gave me a connection failure error. I tried to fetch the logs and I did not find the necessary logs to debug the issue.

Secondly I tried to clear delete the project , however even after the deleting the project the disk space did not free because of which I do not have sufficient space in my c drive. It took around 80 GB this activity but I’m not able to free up the space because I’m not sure why this project is occupying my disk space even when I have deleted the project, I’m pretty sure the I had at least 80 GB before building this project. Would appreciate if this issue is fixed on priority.

Thanks and Regards,
Raghu

twhitehouse · September 8, 2024, 8:18pm

You need to delete the container image, which is typically quite large.

I believe you are on Windows.

If you go into the WSL distro, NVIDIA-Workbench and run the command docker images you will see the different images you have and their sizes.

Does that account for the size on disk?

raghums534 · September 8, 2024, 8:41pm

Yes you are right, I’m on windows. I don’t find WSL distro from my command prompt . I only see WSL…
wsl_command

twhitehouse · September 8, 2024, 8:43pm

sorry.
open the terminal from a windows shell with:
wsl -d NVIDIA-Workbench

Then run:
docker images

raghums534 · September 8, 2024, 8:47pm

edwli · September 9, 2024, 8:08pm

Hi,

When you pull a container image to your machine, those image layers get cached on your system, which can take up space. Further, when you download large files like model weights from hugging face inside those containers, those resources may get cached as well. This can take up several gigabytes of space.

When you delete a project, the build cache typically remains such that if you were to re-pull a project, you do not necessarily need to undergo another build process. However, if you would like to delete the contents of the cache, you can try to run the following:

docker system prune -a. This also should work with podman.

This should clear out the cache of all unused images and resources on your system and typically can free up a good amount of space. Hope this helps!

raghums534 · September 9, 2024, 8:28pm

Hi,
I ran the command that you have mentioned , it says …

5mwmz5h3uvmvn0wz84335cetp
tinor552hil7pzi7wn4ke3nl8
z1qtgj2h3l7j0n0dp27wufz99
bj7hafflwh6socifa43qh2h1b

Total reclaimed space: 18.31GB

But I don’t see the disk space being cleared…

Br,
Raghu

edwli · September 9, 2024, 8:55pm

Ah I see, so from this thread,

WSL2 works like a virtual machine; the guest OS doesn’t directly store individual files in your host filesystem – it has a dedicated virtual “disk” that is stored in a single large .vhdx file as seen by Windows.

This file grows dynamically as the guest OS writes data to new “disk” areas, but does not shrink when those areas are no longer used. For example, if the virtual disk had sectors 10000-20000 in use by a large file and you removed it, the space for those sectors (and their data, too) is still consumed by the VHDX file.

So over time, as Ubuntu places new files all over the disk area, the VHDX will continue growing until it eventually reaches the complete “disk size” that was given to the VM.

There are some other threads you can find online to optimize your virtual disk used by WSL2, such as optimize-vhd. @bfurtaw can help you locate the virtual disk used by the NVIDIA-Workbench WSL distro.

raghums534 · September 9, 2024, 9:59pm

Oh ok, I tried to go through the threads but not so clear as to what the solution is. Is it possible to maybe have a small debug call and clarify it / get it fixed because I want to create a new project and I might need more disk space… Also I want to know how to get the disk space back to normal when I no longer need the project…

twhitehouse · September 10, 2024, 3:32pm

For sure. This is something we want to figure out as well.

Please email aiworkbench-ea@nvidia.com to set it up.

bfurtaw · September 11, 2024, 6:23pm

What is the size of the ext4.vhdx file in C:\ProgramData\NVIDIA Corporation\workbench ?

bwc-daniel · September 29, 2024, 5:49pm

One of the good things about AI Workbench is that it takes care of a lot of the admin related work for you. For example, you don’t have to manage or worry about WSL.

The bad news is that having everything abstracted away also means you’re kind of limited to solutions through the interface, and as has been pointed out, in this case the limitation is really with how WSL is greedy when it comes to disk space.

There are a lot of good reasons for this, but of course it is in not ideal when you run into issues. Not only do you have to go down this rabbit hole, but because you’re embarking on this adventure you’re also going to encounter the pains related to all of the overhead that was so nicely abstracted away. To say it another way, the more you chase this down the more likely the problem is to become larger and much more challenging.

As a general approach I’d recommend using your local computer as a orchestrater and running all of your workloads on remote instances. If you’re not using remote instances, the overhead of workbench is high and the benefit is low. That is more true on Windows. I have a tiny little laptop so I’m doing this with remote instances where $1 will allow me to run for hours. Aside from being easy and relatively cheap, I can also decide I want something more powerful and it only takes a few minutes to setup. I don’t have to worry about disk space or anything else about my local environment, and when I’m done I can just throw everything away (or commit it) and let AI workbench manage all the painful things on the next round.

I’ll be publishing a blog article about this for a related hackathon so I’ll provide those instructions once that goes live. Remind me if you don’t see an update by Thursday.

All that said, the WSL disk concern is a known issue that unfortunately has been outstanding for some time. Like the other link you might have some luck trying the things that have worked for others (but prepare for a likelihood of additional challenges with docker and/or workbench when trying these things):

github.com/microsoft/WSL

WSL 2 should automatically release disk space back to the host OS

opened 02:00PM - 20 Nov 19 UTC

dsmaher

feature wsl2

**Is your feature request related to a problem? Please describe.** I occasional…ly have workloads that consume large amounts of disk space for temporary use in `/tmp`. For example, transcoding large video files or processing large datasets exported from a database. Since WSL2 stores its filesystem on a `.vhdx`, that file grows when the dataset is processed and never releases that space. Also, it doesn't appear to re-use released disk space in the `.vhdx` when files are deleted... it seems to prefer to grow the file rather than re-use existing empty space. (I haven't explicitly tested that theory, but my workload deletes temporary files as they are used and I never had enough of them existing simultaneously to reach 250GB, but the size of the `.vhdx` eventually expanded to that size. When I'm done with that workload, I have to: ``` wsl --shutdown optimize-vhd -Path .\ext4.vhdx -Mode full ``` This is annoying since it basically means that my system backups include a huge `.vhdx` file that is mostly empty. Also, until about 2 weeks ago, it was consuming the entire remainder of my system disk. I had only 50GB free, which was enough to handle the workload, but since it continued to grow even after files were removed, that didn't matter. (I figured it was time to upgrade the disk size anyway) **Describe the solution you'd like** Automatic compaction of the `.vhdx`, or a way to do so while WSL2 is still running so that I can schedule it for frequent cleanups. **Describe alternatives you've considered** * Going back to WSL1 since it uses the host filesystem and is faster accessing that than WSL2. However, this alternative makes docker less useful. Docker on WSL2 is actually _almost_ perfect; not so much on WSL1. * Exporting/Importing WSL 2 to move it to another disk. Not even sure if this is possible, but if it is, it would _probably_ be fine for my desktop, but not really an option for my laptop. Not really worth the time either since that entire process would probably take about 30 minutes, but compacting is usually under 3.

twhitehouse · September 29, 2024, 8:10pm

Hi - thanks a lot for this feedback. looking forward to the blog.

I think we actually found a fairly simple solution for this problem with this user. If that’s the case we can post it here.

Also, would be great to chat with you to help us scope how to open things up for people to trouble shoot these kinds of issues on their own more easily.

If you want to chat, just email us at aiworkbench-ea@nvidia.com

Tyler

raghums534 · September 29, 2024, 9:39pm

Hi,
Do you mean AI workbench works better for GPUs located in remote locations by just spending 1$ rather than using the resources/location available locally?

twhitehouse · October 11, 2024, 6:29pm

This has been fixed in the 4.34.0 docker desktop release.

You can’t update to the fix and you have do a full uninstall and reinstall.

see release notes here: Release notes | Docker Docs

raghums534 · October 12, 2024, 10:27am

That is good to know, thank you for the update!!

Br,
Raghu

system · October 26, 2024, 10:28am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Application Error, Unable to install NVIDIA AI Workbench	20	846	March 31, 2025
Can i delete . cache file on my jetson nano to increase available space? Jetson Nano linux	17	1885	January 22, 2024
Speed Up Your AI Development: NVIDIA AI Workbench Goes GA Technical Blog	2	246	September 6, 2024
Unable to complete installment of NVIDIA AI Workbench NVIDIA AI Workbench workbench-v-02829	16	439	September 20, 2024
Ai Workbench, WSL, Sudo NVIDIA AI Workbench	8	542	June 28, 2024
Unable to install AI Workbench Desktop WSL2 v0.28.29 on Windows 11 (x86_64) NVIDIA AI Workbench workbench-install , workbench-podman , workbench-desktop-app , workbench-v-02829	8	1145	March 6, 2024
Installing Error Application Error: Something happened, please contact support or try restarting the application and installer NVIDIA AI Workbench	17	1060	June 13, 2024
Now Available: CUDA on WSL Public Preview CUDA on Windows Subsystem for Linux	31	4274	July 8, 2020
Project never finishes building nor can I stop it from building NVIDIA AI Workbench nvidia-ai-workbench , workbench-desktop-app , workbench-example-project	7	341	September 16, 2024
Local compute unavailable NVIDIA AI Workbench	21	409	August 2, 2024

Disk space not free even after removing the project from AI workbench

Related topics