nvFP4 training - Playbook request

vgoklani · December 19, 2025, 2:27am

Could you please create a playbook for stable nvFP4 training using the transformer-engine. A basic template would be very useful. thanks

josephbreda · December 19, 2025, 2:56am

They are very likely not going to bother.

vgoklani · December 19, 2025, 5:04am

why not?

josephbreda · December 19, 2025, 1:10pm

Sorry, I’m a bit frustrated with slow pace of support for Spark.

aniculescu · December 22, 2025, 9:08pm

I have passed this request along to the playbook team

aniculescu · December 23, 2025, 4:17pm

@vgoklani Can you tell me which model you want to train on and why you want to use the GB10 for training instead of finetuning?

vgoklani · December 23, 2025, 4:22pm

sure, we want to pre-train Andrej Karpathy’s nanoChat model using nvFP4

The repo is here:

This is the best use-case for the DGX Spark as the models are small (~500M parameters) and this give us a playground to test different model types etc before deploying to a cloud instance for a larger training run.

FYI, there are several threads in the github discussion forum for nanoChat, where users are training with a DGX Spark… This playbook would be incredibly helpful for a lot of people.

Thanks!

aniculescu · December 23, 2025, 5:36pm

Thanks for the info. A playbook for nanochat is already in the works and will be published soon. We will also post an update on the forum so stay tuned.

vgoklani · December 23, 2025, 5:39pm

Will it use nvFP4? That’s the whole point of this request!!!

aniculescu · December 23, 2025, 6:09pm

It will not use nvFP4, it does not look like the nanochat repo offers nvfp4

vgoklani · December 23, 2025, 6:15pm

The point of this exercise is to train something in nvFP4, and we are proposing nanoChat since it’s a small model and a good base-case. And there is clearly demand (just look at the super-long threads in the nanoChat discussion forum).

We don’t care about nanoChat, or the nanoChat playbook, the goal is to pre-train in nvFP4 and that is the request. It sounds like we are not communicating properly.

aniculescu · December 23, 2025, 7:45pm

Sorry for the confusion. There is no plan to build nvfp4 training playbook yet. I have passed the request along however, and the team will evaluate and prioritize in the future

Topic		Replies	Views
NVIDIA folks -- where is this promised nvfp4 speedup? DGX Spark / GB10	24	534	January 11, 2026
Anyone got nanochat training working on the DGX spark? DGX Spark / GB10	10	1073	November 21, 2025
DGX Spark, Nemotron3, and NVFP4: Getting to 65+ tps DGX Spark / GB10 spark , nemotron , dgx	14	613	December 22, 2025
How to enable nvfp4 DGX Spark / GB10	6	443	November 6, 2025
New bleeding-edge vLLM Docker Image: avarok/vllm-nvfp4-gb10-sm120 DGX Spark / GB10 Projects	35	994	December 31, 2025
Help: Running NVFP4 model on 2x DGX Spark with vLLM + Ray (multi-node) DGX Spark / GB10 mistral-large	18	1071	December 25, 2025
Pretrain nanochat on 2 x DGX Sparks DGX Spark / GB10 Projects	3	375	January 2, 2026
Pre-training Nanochat on DGX Spark (Standalone and Clustered mode) DGX Spark / GB10 Projects training , llm , ai-model-training	2	421	December 14, 2025
Suggestion for https://build.nvidia.com/spark/ DGX Spark / GB10	1	296	October 20, 2025
Running Parakeet speech to text on Spark DGX Spark / GB10 nim	16	270	January 7, 2026

nvFP4 training - Playbook request

Related topics