cuPQC SDK 0.4.1 crashes on DGX Spark (GB10, sm_121) — Does Blackwell support exist?

cs24dp006 · February 23, 2026, 12:37pm

Hi everyone,

I’m hoping someone here can help me — I’ve been stuck on this for a few days and I’m running out of ideas.

The short version

I’m trying to use cuPQC SDK 0.4.1 on an NVIDIA DGX Spark for a GPU-accelerated PQC project. The library loads fine and CUDA initializes, but the program crashes with double free or corruption the moment the first PQC function is called. After a lot of debugging, I found out the GB10 GPU has compute capability sm_121 — which isn’t in the cuPQC SDK’s supported list (it goes up to sm_90).

My system

Machine: NVIDIA DGX Spark (Rev A.7)
GPU: NVIDIA GB10 — nvidia-smi reports compute cap 12.1
CPU: ARM aarch64 (Cortex-X925 / A725)
Host CUDA: 13.1, Driver 580.95.05
cuPQC SDK: 0.4.1 (aarch64)
Docker container: nvidia/cuda:12.8.0-devel-ubuntu22.04

What I see

Loaded cuPQC library from: /opt/cupqc-lib/libcupqc_wrapper.so
CUDA initialized
double free or corruption (!prev)
Exited with code 133 (SIGABRT)

This happens on the very first call to cupqc_kem_keypair() — right after cudaSetDevice(0) succeeds. No kernel output, just a crash.

Root cause I identified

The cuPQC SDK 0.4.1 precompiles its internal libraries (cupqc-pk_static, cupqc-hash_static) using LTO code for architectures sm_70 through sm_90. The GB10 is sm_121, which has no native or PTX code path in the SDK. So when the kernel tries to launch, it either picks the wrong code or fails to find any.

Related: cuPQC examples fail to compile on Jetson Orin Nano — similar pattern of cuPQC being incompatible with specific platforms.

What I’ve tried

Updated CUDA base image in Docker from 12.6.2 → 12.8.0 in Dockerfile (SDK requires 12.8+)
Added PTX fallback to CMakeLists.txt:

--generate-code=arch=compute_90,code=compute_90

This embeds sm_90 PTX in our wrapper so CUDA might JIT it for sm_121. Waiting to confirm if this works with the cuPQC LTO internals.

Confirmed aarch64 SDK matches machine architecture ✓

My questions for the community / NVIDIA team

Has anyone successfully run cuPQC SDK on a Blackwell GPU (sm_100, sm_121)? What did you do?
Does embedding sm_90 PTX in the wrapper help? Or will the cuPQC LTO libraries still fail to JIT on sm_121?
Is there a newer SDK version being worked on with Blackwell support?
Any other workaround you’d recommend for getting GPU-accelerated PQC running on a DGX Spark?

Thanks in advance — any help is hugely appreciated!

sreeves · February 24, 2026, 6:12pm

Hello there,

You are trying to execute the examples for 0.4.1, correct?

Can you try editing line 2 the makefile for the public_key examples and change arch=native to arch= and see if that changes things?

I don’t think you need to build the example to PTX and then JIT.

Thanks!

cs24dp006 · February 26, 2026, 7:26am

Hello,

Yes, we were trying to build a custom C++ wrapper for the 0.4.1 PQC SDK to deploy as a Python extension.

You are completely correct! Setting arch= (or explicitly setting arch=sm_90) solved the issue perfectly. Since nvcc in CUDA 12.8 doesn’t officially recognize compute_121 yet, using native on the Blackwell DGX was causing the compiler to fail because it couldn’t find the 121 specific targets. By dropping the architecture flag (and letting LTO default to Hopper sm_90), the linker successfully assembled the .cubin and the Blackwell driver executed the JIT flawlessly at runtime via backwards compatibility.

Thanks for your help!

sreeves · February 26, 2026, 4:42pm

Hi,

Great, I am glad it worked out!

Topic		Replies	Views
Dearest CUTLASS TEAM, When the hell are you going to properly fix tcgen05 FP4 support for DGX Spark / GB10 (SM121)? DGX Spark / GB10	3	206	February 4, 2026
cuPQC examples fail to compile on Jetson Orin Nano cuPQC jetson	4	98	October 25, 2025
Does the spark support `tcgen05`? DGX Spark / GB10	2	174	December 10, 2025
DGX Spark: CUDA Install Pitfalls on Ubuntu 24.04 (ARM64) – FIXED DGX Spark / GB10	4	796	November 4, 2025
To NVIDIA Staff: Stop leeching off community developers, Get your act together and start shopping fixes the broken VLLM & TensorLLM Packes DGX Spark / GB10	3	177	January 29, 2026
GB10 and Docling DGX Spark / GB10	2	174	February 17, 2026
Confidential computing support for DGX Spark / GB10 DGX Spark / GB10	4	353	October 16, 2025
CUDA Toolkit 12.8 what GPU is 'sm_120'? CUDA NVCC Compiler	8	9196	February 8, 2025
Julia CUDA on DGX Spark DGX Spark / GB10 cuda , cublas , cusolver , cufft , cusparse , curand	2	180	October 27, 2025
Hardware engineering-focused article on GB10 and resulting software implications DGX Spark / GB10	2	156	February 20, 2026

cuPQC SDK 0.4.1 crashes on DGX Spark (GB10, sm_121) — Does Blackwell support exist?

Related topics