Numba segmentation faults on DGX Spark, but not on my 3070

ue5Profiler · October 27, 2025, 6:07am

System 1:

DGX Spark
Connected through NVIDIA Sync

System 2:

Windows 11
NVIDIA RTX 3070

Issue:

On system 1, my code below will throw an error:

python numba-test.py
…/.venv/lib/python3.12/site-packages/numba/cuda/dispatcher.py:536: NumbaPerformanceWarning: Grid size 4 will likely result in GPU under-utilization due to low occupancy.
warn(NumbaPerformanceWarning(msg))
Segmentation fault (core dumped)

On system 2, the code runs fine:

[2. 2. 2. ... 2. 2. 2.]

…

Code:

# %%
from numba import cuda
import numpy as np

@cuda.jit
def vector_add(a, b, c):
    idx = cuda.grid(1)
    if idx < a.size:
        c[idx] = a[idx] + b[idx]

N = 1024
a = np.ones(N, dtype=np.float32)
b = np.ones(N, dtype=np.float32)
c = np.zeros_like(a)

threads_per_block = 256

blocks_per_grid = (N + threads_per_block - 1) // threads_per_block

vector_add[blocks_per_grid, threads_per_block](a, b, c)

cuda.synchronize()

print(c)

Question/Help With:
The only thing I can think of at this hour is that the DGX spark has unified memory, but that really shouldn’t be causing this sort of problem. I’ve also never used numba before. So it could be some strange user error on my end. Was hoping to see if anyone else has had any issues or if this is a known issue.

I was going to try cupy next, but as a last resort I will try a more traditional method and write a CUDA/C++ kernel and pybind11.

aniculescu · October 27, 2025, 6:07pm

I am not familiar with numba but I would check if they support CUDA 13 as we have seen many workloads that only support up to CUDA 12 fail

ue5Profiler · October 28, 2025, 1:19am

Thank you for your reply. This helps a lot.

It seems like numba has special instructions for CUDA 13. I will try again and come back with an update.

$ pip install numba-cuda[cu13]

system · November 17, 2025, 7:31pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
nano + tensorflow + numba Jetson Nano	3	1029	October 18, 2021
Testing the excecution with and with out GPU and CUDA in Jetson TX2 Jetson TX2	4	3335	October 18, 2021
Numba cannot see Blackwell GPU and index out of range Cybersecurity cuda , security-acceleration	1	11	January 20, 2026
Seven Things You Might Not Know about Numba Technical Blog	9	960	March 18, 2023
Cuda 13.0 segmentation fault from line: "@cuda.jit('Tuple((float64,float64))(float64, float64)', device=True)" CUDA Programming and Performance	0	112	August 12, 2025
Nvidia GPU Accelration(Segmentation Fault Core Dumped) CUDA Programming and Performance	0	583	September 5, 2019
CUDA driver version is insufficient for CUDA runtime Jetson TX2	3	1265	October 18, 2021
CudaAPIError: [1] Call to cuLaunchKernel results in CUDA_ERROR_INVALID_VALUE in Python CUDA Programming and Performance	11	9833	May 16, 2024
Anyone got nanochat training working on the DGX spark? DGX Spark / GB10	10	1182	November 21, 2025
Error running CUDA Python code in Jupyter Notebook after installing NVIDIA drivers CUDA Programming and Performance cuda , python	12	1234	August 7, 2023

Numba segmentation faults on DGX Spark, but not on my 3070

Related topics