CUDA 13.2 DGX Spark impact

adi-sonusflow · March 11, 2026, 6:28pm

My recap what CUDA 13.2 brings that matters for DB10:

The Big Wins for DGX Spark / SM121

cuBLASLt: NVFP4 and MXFP8 performance improvements on DGX Spark. This is the headline item — cuBLASLt now delivers up to 3× performance improvement for NVFP4 and MXFP8 data types on DGX Spark systems for large M and N problem sizes. Also, cuBLASLt’s experimental Grouped GEMM API now supports MXFP8 inputs on GPUs with Compute Capability 10.x and 11.0. NVIDIA

Critical bug fix: A cublasLtMatmul issue that could lead to incorrect results when running concurrently with another kernel that uses Tensor Memory has been fixed NVIDIA — this affected Compute Capability 10.x and 11.x since cuBLAS 12.8. Could be related to quality degradation people were seeing.

CUDA Tile — Now on SM120/SM121

CUDA Tile is now supported on compute capability 8.X (Ampere and Ada), as well as 10.X and 12.X architectures (Blackwell). NVIDIA Developer This is the new tile-based programming model NVIDIA introduced in 13.0. cuTile Python (the Python DSL) now supports recursive functions, closures, custom reductions, and enhanced array slicing. This could eventually become the cleaner path to writing optimized NVFP4 kernels for SM121 vs the current CUTLASS patch-and-pray approach.

Unified Tegra + Desktop Toolkit

CUDA 13.2 delivers a single unified toolkit for Tegra and desktop GPUs, reducing overhead for containers and libraries. NVIDIA This is relevant for DGX Spark since GB10 is an aarch64 Tegra-derived SoC — fewer divergences between the Tegra and desktop CUDA paths means less chance of hitting SM121-specific bugs that only appear on the Spark.

Other Notable Items

PTX ISA 9.2 — new PTX features, worth checking if there are any SM121-specific instruction improvements.

Compiler: support for new host compilers including VS 2026, plus improved nvcc host compilation support on aarch64 systems, including fixes for ARM Neon intrinsics when using newer GCC versions.

CUDA_DISABLE_PERF_BOOST env var added — lets you disable GPU power state boosting, useful for power management in your rack enclosure project.

Is it worth upgrading to CUDA 13.2?
What do you guys think?

jwarner · March 11, 2026, 8:33pm

Definitely look forward to upgrading - once it arrives in the official system updates.

chrm · March 12, 2026, 8:30pm

Dear @aniculescu /NVIDIA,
I wonder, when we can use or install CUDA 13.2 CUDA Toolkit 13.2 - Release Notes — Release Notes 13.2 documentation or our GB10 (e.g., DGX Spark) devices will be upgraded automatically to this CUDA 13.2 version, because this version contains multiple DGX Spark improvements?
Many thanks!

Digital_David · March 19, 2026, 5:35pm

I hope this gets released very soon. Here is today’s latest update: nvcc --version

nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2025 NVIDIA Corporation
Built on Wed_Aug_20_01:57:39_PM_PDT_2025
Cuda compilation tools, release 13.0, V13.0.88
Build cuda_13.0.r13.0/compiler.36424714_0

INJurer · March 21, 2026, 3:27pm

Just saying “hi” while waiting for everyone else for the official GB10 update with CUDA 13.2!

AoE · March 22, 2026, 9:24pm

The community docker is using nvidia/cuda:13.2.0-devel-ubuntu24.04 as a base now.

INJurer · March 29, 2026, 4:12pm

What if you run something outside a docker container? Any official way to update to 13.2 on the Spark?

cosinus · March 29, 2026, 4:36pm

The “official” and safe way is to wait until it gets approved for the use with the DGX Spark/GB10 platform. Otherwise you might end up with a broken system or at least possibly an impaired system.

If you’re comfortable working with Linux systems, you can install “bleeding edge” drivers, provided you know how to quickly roll back the old driver even without a working console (fallback via SSH).

Nevertheless, you should always have a backup on hand. ;-)

INJurer · March 29, 2026, 4:40pm

Nah, I may feel comfortable still I don’t wanna risk everything I have on my Sparks. They are too precious. :)

Topic		Replies	Views
Using CUDA 13.2 on the DGX Spark DGX Spark / GB10	1	495	March 22, 2026
CUDA 13.2 Introduces Enhanced CUDA Tile Support and New Python Features Technical Blog	0	131	March 9, 2026
CUDA 13.2 just dropped, and GPU programming just got simpler CUDA	1	134	March 10, 2026
What’s New and Important in CUDA Toolkit 13.0 Technical Blog	3	350	October 9, 2025
CUDA 12.8 DGX Spark / GB10	1	279	November 7, 2025
Nvidia drivers 595.45.04 and CUDA 13.2 on their way DGX Spark / GB10	14	2196	March 21, 2026
595.58.03 Certified Linux-aarch64 (ARM64) Display Driver and CUDA 13.2 - when for DGX Spark GB10 DGX Spark / GB10 cuda , driver	1	234	March 25, 2026
DGX Spark GB10 CUDA Compute Capability? DGX Spark / GB10 cuda	2	1014	September 25, 2025
NVIDIA CUDA 13.1 Powers Next-Gen GPU Programming with NVIDIA CUDA Tile and Performance Gains Technical Blog	0	118	December 4, 2025
To NVIDIA Staff: Stop leeching off community developers, Get your act together and start shopping fixes the broken VLLM & TensorLLM Packes DGX Spark / GB10	4	248	March 31, 2026

CUDA 13.2 DGX Spark impact

The Big Wins for DGX Spark / SM121

CUDA Tile — Now on SM120/SM121

Unified Tegra + Desktop Toolkit

Other Notable Items

Related topics