Jetson AGX Orin don't work cuda with pytorch

sznt.norbi · August 7, 2024, 1:34pm

Hello.

I installed the prebuild pytorch version on the Jetson. The jetpack version is 6.0.
The cuda version

nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2023 NVIDIA Corporation
Built on Tue_Aug_15_22:08:11_PDT_2023
Cuda compilation tools, release 12.2, V12.2.140
Build cuda_12.2.r12.2/compiler.33191640_0

The pytorch have cuda support

Python 3.10.12 (main, Jul 29 2024, 16:56:48) [GCC 11.4.0] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> import torch
>>> torch.cuda.is_available()
True

And the torch version

$ pip list | grep torch
torch 2.3.0
torchvision 0.18.0a0+6043bc2

i have a sample program:

from torchvision.io.image import read_image
from torchvision.models.detection import fasterrcnn_resnet50_fpn_v2, FasterRCNN_ResNet50_FPN_V2_Weights
from torchvision.utils import draw_bounding_boxes
from torchvision.transforms.functional import to_pil_image
import torch

device = torch.device('cuda:0' if torch.cuda.is_available() else 'cpu')
print(f"Using device: {device}")

img = read_image("demo.jpg")

# Step 1: Initialize model with the best available weights
weights = FasterRCNN_ResNet50_FPN_V2_Weights.DEFAULT
model = fasterrcnn_resnet50_fpn_v2(weights=weights, box_score_thresh=0.9).to(device)
model.eval()

# Step 2: Initialize the inference transforms
preprocess = weights.transforms()

# Step 3: Apply inference preprocessing transforms
batch = [preprocess(img).to(device)]

# Step 4: Use the model and visualize the prediction
prediction = model(batch)[0]
labels = [weights.meta["categories"][i] for i in prediction["labels"]]
box = draw_bounding_boxes(img, boxes=prediction["boxes"],
                          labels=labels,
                          colors="red",
                          width=4)
im = to_pil_image(box.detach())
im.save("out.jpg")
# save_image(im, "out.img")
# im.show()

and the output of the program is

python test.py 
Using device: cuda:0
/home/tkp/.local/lib/python3.10/site-packages/torch/nn/modules/conv.py:456: UserWarning: Plan failed with a cudnnException: CUDNN_BACKEND_EXECUTION_PLAN_DESCRIPTOR: cudnnFinalize Descriptor Failed cudnn_status: CUDNN_STATUS_NOT_SUPPORTED (Triggered internally at /opt/pytorch/aten/src/ATen/native/cudnn/Conv_v8.cpp:919.)
  return F.conv2d(input, weight, bias, self.stride,

Thanks for the help.

AastaLLL · August 8, 2024, 4:26am

Hi,

Thanks for sharing this issue.
We will give it a try and provide more info to you later.

Thanks.

AastaLLL · August 8, 2024, 5:15am

Hi,

Confirmed that we can reproduce the same error:

$ python3 test.py
Using device: cuda:0
Downloading: "https://download.pytorch.org/models/fasterrcnn_resnet50_fpn_v2_coco-dd69338a.pth" to /home/nvidia/.cache/torch/hub/checkpoints/fasterrcnn_resnet50_fpn_v2_coco-dd69338a.pth
100.0%
/home/nvidia/.local/lib/python3.10/site-packages/torch/nn/modules/conv.py:456: UserWarning: Plan failed with a cudnnException: CUDNN_BACKEND_EXECUTION_PLAN_DESCRIPTOR: cudnnFinalize Descriptor Failed cudnn_status: CUDNN_STATUS_NOT_SUPPORTED (Triggered internally at /opt/pytorch/aten/src/ATen/native/cudnn/Conv_v8.cpp:919.)
  return F.conv2d(input, weight, bias, self.stride,
/home/nvidia/.local/lib/python3.10/site-packages/torchvision/utils.py:210: UserWarning: boxes doesn't contain any box. No box was drawn
  warnings.warn("boxes doesn't contain any box. No box was drawn")

Based on the discussion here, the issue comes from PyTorch 2.3.0 but is fixed in 2.3.1.

When testing with 24.07 our PyTorch container, the error doesn’t appear.
Please give it a try.

$ sudo docker run -it --rm --runtime nvidia --network host -v /home/nvidia/topic_302606:/home/nvidia/topic_302606 nvcr.io/nvidia/pytorch:24.07-py3-igpu
# python3 test.py
Using device: cuda:0
Downloading: "https://download.pytorch.org/models/fasterrcnn_resnet50_fpn_v2_coco-dd69338a.pth" to /root/.cache/torch/hub/checkpoints/fasterrcnn_resnet50_fpn_v2_coco-dd69338a.pth
100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 167M/167M [00:05<00:00, 30.8MB/s]
/usr/local/lib/python3.10/dist-packages/torchvision/utils.py:211: UserWarning: boxes doesn't contain any box. No box was drawn
  warnings.warn("boxes doesn't contain any box. No box was drawn")

Thanks.

sznt.norbi · August 9, 2024, 3:22pm

Thank for the answer that works well.

system · September 16, 2024, 6:12am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Cannot install Pytorch 2.x with CUDA support Jetson Orin Nano cuda , pytorch	9	893	September 2, 2024
Error flashing Jetson Orin including Cuda Jetson AGX Orin reflash , cuda	9	1112	July 11, 2022
Incompatible torch2.2+Cuda12.2 wheel with other python libraries for AGX Orin Jetpack6.0 Jetson AGX Orin cuda , pytorch	9	1769	May 17, 2024
Nvcc is not installed in jetson orin nano although nvidia-toolkit is installed Jetson Nano cuda , ubuntu , jetson-inference	6	99	January 14, 2025
Jetson Nano Torch 1.6.0 PyTorch Vision v0.7.0-rc2 Runtime Error Jetson Nano pytorch	4	1439	October 18, 2021
Torchvision on Jetson AGX Orin DevKit Jetson AGX Orin pytorch	7	3039	June 2, 2023
Install Pytorch with cuda on Jetson Orin nano Devloper Kit Jetson Orin Nano cuda , pytorch	13	2850	July 30, 2024
Error RuntimeError: CUDA error: no kernel image is available for execution on the device when doing != operation on Jetson orin agx Jetson AGX Orin cuda	7	380	February 10, 2025
Jetson orin nano Cuda Cudnn torch torchauido torchvision Jetson Orin Nano cuda	11	836	May 27, 2024
Pytorch for jetson nano orin does not work Jetson Orin Nano pytorch	5	1393	August 22, 2023

Jetson AGX Orin don't work cuda with pytorch

Related topics