Example project that works on consumer sized card or cards?

joe173 · June 24, 2024, 12:52pm

Please provide the following info (tick the boxes after creating this topic):

Submission Type
Bug or Error
Feature Request
Documentation Issue
Question
Other

Workbench Version
Desktop App v0.44.8
CLI v0.21.3
Other

Host Machine operating system and location
Local Windows 11
Local Windows 10
Local macOS
Local Ubuntu 22.04
Remote Ubuntu 22.04
Other

The AI Workbench example readme’s document the memory size which is awesome . It is less awesome that none of the examples run on a single consumer card.

Can someone point me at a project that fits in 8gb, 12gb, 16gb or 20gb?
Are there any examples that run across multiple consumer cards in a machine.

edwli · June 27, 2024, 7:28pm

Here is a full list of the example projects we offer.

The finetuning projects generally require more GPU compute, so it may be difficult to run those on a small consumer card (especially those dealing with NeMo Framework). The mistral finetuning project you may be able to fit as long as you run at lower quantization, eg. 4-bit. Additionally, the sdxl customization project may be able to run on a smaller consumer card as well, eg. 16gb or lower.

The Hybrid RAG project does not require a GPU if running with cloud endpoints; if you would like to inference with a model locally, you can quantize a smaller model down to 4 bits and fit it onto a 16gb or lower card.

The data science projects should be able to use smaller GPUs to accelerate libraries like pandas and sklearn.

You can assign multiple GPUs to a project under Environment > Hardware > GPUs. For many examples the default is 1, but this can be adjusted to assign more GPUs to a project at runtime.

joe173 · June 28, 2024, 12:54am

Thanks. I was looking at the projects on this link NVIDIA AI Workbench Examples which look to be the same projects.

All but one require 40GB or 80GB of vram. The RAG example could run in 12GFB but probably needs at least 24GB.

Would a 40GB model run on a pair of 24GB cards. I don’t understand when it has to be one card.

edwli · July 1, 2024, 8:44pm

Some README definitions may need updating but in general those are geared more towards recommended rather than minimum specs.

If working on a multi-GPU system, you can attach more GPUs to a project under Environment > Hardware. They should show up in the project container under nvidia-smi and you can use it to run models like any other multi GPU system.

joe173 · July 14, 2024, 1:45am

Was able to run the hybrid rag locally on my gaming card using the default ungated model.

Project: GitHub - NVIDIA/workbench-example-hybrid-rag: An NVIDIA AI Workbench example project for Retrieval Augmented Generation (RAG)

system · July 28, 2024, 1:46am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Deploy NVIDIA BLueprint for RAG using limited resources (MIG) NVIDIA Blueprints llama	2	147	September 23, 2025
[SUPPORT] Workbench Example Project: Local RAG NVIDIA AI Workbench workbench-example-project	14	1434	April 10, 2024
[SUPPORT] Workbench Example Project: Hybrid RAG NVIDIA AI Workbench workbench-example-project	111	3933	December 15, 2025
Build and Deploy a Multi-Agent Chatbot on a Workstation DGX Spark / GB10 Projects llama , deepseek	1	328	December 8, 2025
Sdxl-customization fails - dual card system Titan RTX 24GB - RTX 3060 8GB NVIDIA AI Workbench stable-diffusion-xl	5	134	July 29, 2025
AI Workbench confused when have two different levels of cards NVIDIA AI Workbench nim , phi-3-mini-4k-instruct	5	164	July 17, 2025
Blueprint RAG v2.0.0 NVIDIA Blueprints nim , llama-31-70b-instruct , llama , blueprints	1	210	April 24, 2025
Linking of 2*NVIDIA RTX PRO 6000 Blackwell Max-Q gpu NVIDIA AI Workbench llama	0	76	March 31, 2026
Hybrid RAG example app in AI Workbench doesn't work with DGX Spark DGX Spark / GB10	4	158	January 29, 2026
[SUPPORT] Workbench Example Project: Nemotron Finetune NVIDIA AI Workbench workbench-example-project	0	341	January 9, 2024

Example project that works on consumer sized card or cards?

Related topics