Intermittent NvMapMemAlloc error 12 and CUDA allocator crash during PyTorch inference on Jetson Orin Nano

technical12 · October 31, 2025, 4:24pm

Hi,

I’m running a PyTorch YOLO-based inference on a Jetson Orin Nano Super, and I frequently get these errors (not always, but randomly):

NvMapMemAllocInternalTagged: 1075072515 error 12
NvMapMemHandleAlloc: error 0
Error : NVML_SUCCESS == r INTERNAL ASSERT FAILED at "/opt/pytorch/pytorch/c10/cuda/CUDACachingAllocator.cpp":838, please report a bug to PyTorch.

I tried the following, but the issue still occurs:

with torch.no_grad() during inference
os.environ['PYTORCH_CUDA_ALLOC_CONF'] = 'expandable_segments:True'
Full cleanup using torch.cuda.empty_cache(), gc.collect(), and reloading the model

The error isn’t always caught by try/except and sometimes crashes the process.

Setup:

Jetson Orin Nano
JetPack 6.2.1
PyTorch (from NVIDIA SDK): 2.5.0a0+872d972e41.nv24.08
Model: YOLO (tracking mode)

Questions:

Is this a PyTorch issue or a Jetson memory allocator issue (NvMapMemAlloc)?
Any known fix or configuration to prevent this intermittent error?

Thanks in advance for any suggestions.

vchuang · November 7, 2025, 6:28am

Hello @technical12,

Thanks for the post.

This error is typically caused by low-level memory allocation failure on the Jetson due to resource constraints or memory fragmentation. This then causes the crash in the PyTorch memory manager (CUDACachingAllocator). You could either optimize host memory or consider follow the steps Convert the PyTorch model to use TensorRT which is specifically designed to run on the Jetson with maximum efficiency and minimal memory overhead.

Good luck with your implementation!

armnyn · November 7, 2025, 1:30pm

Propably the issue is related the latest jetpack update(r36.4.4->r36.4.7)

HirataGoh · November 8, 2025, 10:39am

The following article will be helpfull.

title: Ollama errors orin nano

AastaLLL Moderator

Nov 3

Hi, both

This issue is related to the r36.4.7 update and has a failure rate.
You might see either ‘cudaMalloc failed: out of memory’ or ’ unable to allocate CUDA0 buffer’.
The underlying error for both cases are (in Ollama log):
NvMapMemAllocInternalTagged: 1075072515 error 12
Our internal team is working on the issue.
Will update more information with you later.

Thanks.

title: “unable to allocate CUDA0 buffer” after Updating Ubuntu Packages

AastaLLL Moderator

Nov 5

Hi, @all

Thank you all for the testing and sharing.
We are really sorry about the inconvenience that the r36.4.7 brings.

Although our internal team is still working on the issue, here are some updates about the issue that we can share with you:
The recent update (r38.2.1->r38.2.2, r36.4.4->r36.4.7, 35.6.2->r35.6.3) contains a security fix for CVE-2025-33182 & CVE-2025-33177:

…

The security fix adds a mechanism to prevent the allocation from going into the OOM path (to prevent a denial of service attack).
This led to some limitations in the allocable memory.

We are discussing how to minimize the impact of this security fix.
Will keep you all updated on the latest status.

lavthyagarajan · December 16, 2025, 5:25pm

Hello, any updates on the fix? I have a new Jetson nano and seeing the same issue. I am trying to train the 124M GPT2 model.

darshjain8724 · January 30, 2026, 8:39am

I am facing problem while taking inference from a video and its not processing all the frames, giving NvMemoryAlloc cuda error 12 and giving warning “system throttle due to over current”.

Device: Jetson Orin Nano Super developer kit(8GB)

TensorRT version: 10.3

Cuda version: 12.6

Deepstream: 7.1

Jetpack: 6.2.1

I had converted onnx model(in google colab) to .engine (in jetson)

This error was I tested deepstream sample config file.

If their any solution do tell

Thanks!

vchuang · February 13, 2026, 8:23am

Hello @darshjain8724
There are a few directions you could check first.

Check power mode and supply
Set a higher power mode following the step 3 and 4 of the documentation.
```
sudo nvpmodel -q        # see current mode
sudo nvpmodel -m 0      # or another performance mode for your kit
sudo jetson_clocks
```
Make sure you’re using the recommended power supply and avoid powering power‑hungry USB devices from the board.
Reduce memory load in the DeepStream config while testing
- Lower the input resolution (width/height) or batch‑size.
- If you enabled multiple streams, test with one stream first.
- Try a smaller engine (e.g., lower precision, INT8/FP16 instead of FP32) if possible.
Monitor during run

Run tegrastats in another terminal while DeepStream is running and watch GPU memory and throttling messages.

Hope this would be helpful for you to resolve the issue.

vchuang · April 28, 2026, 9:20am

Hello,

As there haven’t been any recent updates on this discussion, we’ll be closing the thread to keep the forum organized.

If you continue to experience this issue or have new questions, please start a new topic and include a reference to this discussion for context. This will help our team and other community members assist you more effectively.

Thank you for your participation in the NVIDIA Isaac ROS community.

Topic		Replies	Views
NvMapMemHandleAlloc error 12 in Llama.cpp model load when upgrading to JetPack 35.6.x Jetson Orin Nano llama	3	269	January 11, 2026
Jetson Orin Nano inconsistend behaviour and casual NvMapMemAllocInternalTagged error Jetson Nano pytorch	5	127	March 21, 2026
PyTorch CUDACachingAllocator NVML assertion when sharing CUDA context with llama.cpp on Orin Nano 8 GB (JetPack 6.2.2) Jetson Orin Nano pytorch , generative_ai , llama	10	252	June 2, 2026
JetPack 4.6.1 (L4T R32.7.1): PyTorch allocates all the memory + swap! Jetson Nano cuda , pytorch	4	663	November 22, 2023
Tensorflow crash when making an inference on Jetson Nano Jetson Nano jetpack , cuda , tensorflow	1	865	October 5, 2020
Ollama LLM inference problems on Jetson Orin Nano: CUDA memory allocation failure and CPU memory error Jetson Nano llama	1	122	March 17, 2026
Cuda0 Buffer Error Jetson Orin Nano cuda	11	1911	November 5, 2025
CUDA out of memory Jetson Orin Nano cuda	5	709	November 6, 2025
Jetson Nano crash after 20-30 hours of running DS pipeline DeepStream SDK	3	461	September 21, 2021
OSError: [Errno 12] Cannot allocate memory Error in the jetson nano Jetson Nano yolo	2	1679	November 22, 2021

Intermittent NvMapMemAlloc error 12 and CUDA allocator crash during PyTorch inference on Jetson Orin Nano

Related topics