Nvidia torch + cuda produces only NAN on CPU

anna.buchele · November 3, 2022, 6:54pm

Hi,

I’m seeing something odd on my Jetson Nano 4GB (It’s actually a Jetson AGX Orin developer kit running the emulated Jetson Nano 4GB image)

I installed the Nvidia pytorch wheel (I tried 1.12 and 1.11, from here). Running with cuda works great, but if I try to run on cpu with this version of pytorch, inference runs very slowly (like ~3s per inference on a 224x224 image) and the entire output tensor is NANs.

If I install the pip pytorch wheel, I get reasonable times (0.05s/inference) and comparable results to what I get with cuda. However, the pip pytorch wheel for aarch64 isn’t built with cuda, so I have to switch venvs to test cpu vs cuda, which feels odd to me.

Things I have tried:

upgrading numpy
different pytorch versions

I’m running a torchscript mobilenetv2 model, generated like this:

from torchvision.models import mobilenet_v2
import torch

net = mobilenet_v2()
torch.save(net,"mobilenetv2.pth")
script_module = torch.jit.script(net)
torch.jit.save(script_module,"mobilenetv2.pt")

Is this expected behavior? Any ideas of things to try?

AastaLLL · November 4, 2022, 12:15am

Hi,

It looks like you are using Nano rather than Orin Nano.
I’m moving your topic to the Nano board first.

Sorry, my bad.
You are using Orin Nano. Moving your topic back.

Thanks.

AastaLLL · November 4, 2022, 1:51am

Hi,

Could you share the JetPack version you used?
Since we have PyTorch 1.13 for JetPack 5.0.2, it’s recommended to give it a try.
https://developer.download.nvidia.com/compute/redist/jp/v502/pytorch/

Thanks.

anna.buchele · November 4, 2022, 6:25pm

Hi, I’m using JetPack 5.0.2-b231 (installed using apt).

I just tried pytorch 1.13 that you linked (I used torch-1.13.0a0+08820cb0.nv22.07-cp38-cp38-linux_aarch64.whl ), and I’m seeing the same result.

AastaLLL · November 10, 2022, 2:54am

Hi,

Thanks for your testing.

Could you share a simple source and model that can reproduce the NAN output?
We want to reproduce this issue in our environment first.

Thanks.

anna.buchele · November 10, 2022, 5:01pm

Sure! The model is generated using:

from torchvision.models import mobilenet_v2
import torch

net = mobilenet_v2()
torch.save(net,"net.pth")
script_module = torch.jit.script(net)
torch.jit.save(script_module,"mobilenetv2.pt")

And I’m running it using:

import torch
import numpy
import random

use_cuda = False

torch.manual_seed(0)
m = torch.jit.load("mobilenetv2.pt")
if use_cuda:
    m.cuda()
m.eval()
with torch.autograd.set_detect_anomaly(True):
    input_t = torch.rand((1,3,224,224))
    if use_cuda:
        input_t = input_t.cuda()
    out = m(input_t)
    print(out)

I haven’t used torchscript much, so please let me know if I’m making any obvious errors.

Thanks!

AastaLLL · November 11, 2022, 2:22am

Hi,

Thanks for sharing.
Will reproduce this issue in our environment first.

AastaLLL · November 17, 2022, 5:46am

Hi,

Thanks for your patience.

Confirmed that we can reproduce the same issue in our environment.
(JetPack 5.0.2+ l4t-pytorch:r35.1.0-pth1.12-py3 container)

We are checking this issue with our internal team.
Will share more information with you later.

Thanks.

georg.arbeiter · March 16, 2023, 9:48am

Hi,

is there any update? I also get this behaviour with the latest JetPack.

AastaLLL · March 23, 2023, 2:47am

Hi,

The CPU inference issue is still under investigation.
Since we expect users to run the model on GPU, this issue’s priority is relatively lower.

Thanks.

AastaLLL · May 8, 2023, 4:42am

Hi,

Here is some update about this issue.
Due to limited resources, we will focus on the GPU mode support of the prebuilt PyTorch package

Thanks.

system · May 31, 2023, 1:30am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Regarding the issue of CUDA and Pytorch Jetson Nano cuda , pytorch	4	307	November 21, 2024
Jetson AGX Orin don't work cuda with pytorch Jetson AGX Orin pytorch , cudnn	4	91	September 16, 2024
AssertionError: Torch not compiled with CUDA enabled Jetson Nano cuda , pytorch	8	20274	December 7, 2021
Nvcc is not installed in jetson orin nano although nvidia-toolkit is installed Jetson Nano cuda , ubuntu , jetson-inference	6	91	January 14, 2025
Cannot install Pytorch 2.x with CUDA support Jetson Orin Nano cuda , pytorch	9	856	September 2, 2024
Help with Pytorch, torchvision on Jetpack 6 Jetson Orin Nano pytorch	7	4377	March 5, 2024
No NVIDIA GPU available or detected on Nvidia Jetson Orin Nano Jetson Orin Nano cuda , gpu	3	210	July 4, 2024
Jetson Orin Nano CUDA 12.6 in JetPack 6.1 in Tensorflow and Pytorch Jetson Orin Nano cuda , pytorch	11	1265	November 21, 2024
Error RuntimeError: CUDA error: no kernel image is available for execution on the device when doing != operation on Jetson orin agx Jetson AGX Orin cuda	7	347	February 10, 2025
CUDA and NVIDIA driver configuration ERROR for PyTorch Jetson Orin Nano cuda , pytorch	2	224	December 3, 2024

Nvidia torch + cuda produces only NAN on CPU

Related topics