Issue with onnxruntime when using CUDAExecutionProvider

nrodwarna · February 6, 2025, 8:57pm

Hello,

I am working on the inference using onnx model. I have run this code successfully on my laptop, but have an issue when running it on Jetson.

My environment:
-Jetson AGX Orin
-JetPack 5.1.4
-onnx 1.17.0
-onnx-graphsurgeon 0.3.12
-onnxruntime-gpu 1.16.3 (installed using the whell on jetzoo)
-CUDA 11.4

Issue:
I have run to check available providers and it shows all providers avaialable ([‘TensorrtExecutionProvider’, ‘CUDAExecutionProvider’, ‘CPUExecutionProvider’])

When I set provider as ‘CPUExecutionProvider’, it works normally. However, when I switch to ‘CUDAExecutionProvider’. It stuck at this line “outputs = ort_sess.run(None, ort_inputs)” and show inference results 2-3 minutes later. At the same time, I check GPU status with “tegrastats”. It shows that the GPU frequency is around 99%.

Is there any recommendation to solve this issue? I would appreciate any suggestion. Thank you.

SivaRamaKrishnaNV · February 7, 2025, 3:11am

Dear @nrodwarna ,
Could you share the model, repro code and steps?
Also, which GPU you have on x86? What is the inference time on x86?

Does that mean, inference time is less compared to GPU?

nrodwarna · February 7, 2025, 3:39am

Thank you for your response.

The model I’m using is Jetson AGX Orin Developer Kit and the following is the object detection model and code.
I did not record the inference time on GPU as it did not provide any result. I believe there might be an issue with setup or bottleneck somewhere.

https://drive.google.com/drive/folders/1BZdSHiikksJg0f_VD3uaZ2fnKQ1-6iYK?usp=drive_link

SivaRamaKrishnaNV · February 7, 2025, 5:15am

May I know which GPU is used in x86?

nrodwarna · February 7, 2025, 5:23am

NVIDIA Tegra Orin (nvgpu)/integrated
OpenGL Version:4.6.0 NVIDIA 35.5.0

SivaRamaKrishnaNV · February 10, 2025, 5:08am

Dear @nrodwarna ,
I am asking GPU used in your laptop to get idea of expected performance on laptop vs Jetson .

nrodwarna · February 10, 2025, 6:25am

Sorry for misunderstanding.

It’s NVIDIA GeForce RTX 3060 6 GB

SivaRamaKrishnaNV · March 18, 2025, 5:48am

Dear @nrodwarna ,
Does this issue still need support? Is it possible to test the issue on latest release.

Comparing the specs of GPU and Jetson devkit, 3060 is powerful(~2x comparing the cores).

nrodwarna · March 23, 2025, 4:12am

Yes, I havent tried with the latest release yet, but I will try.
Btw, now it works with tensorrt model, but not for onnx.

Topic		Replies	Views
CudaExecutionProvider doesn't appear on onnxruntime 1.15.1 - Jetson Orin Nano Jetson Orin Nano cuda	7	1544	March 1, 2024
Onnx runtime GPU Jetson Orin Nano onnx	6	3183	March 24, 2025
Build ONNXInference-gpu wheel for Jetpack5 with Cuda and TRT Jetson AGX Orin tensorrt , cuda , onnx	6	2855	August 10, 2022
CUDAExecutionProvider is not in available provider Jetson Xavier NX cuda	4	11180	January 19, 2023
Issue using Onnxruntime with CUDAExecutionProvider on Orin Jetson AGX Orin onnx	6	6885	July 5, 2022
Cuda provider when using onnx Jetson Xavier NX cuda	3	896	October 18, 2021
Jetson Orin Nano Super调用onnx模型无法使用GPU Jetson Orin Nano onnx , gpu	9	375	June 30, 2025
Onnxruntime error Jetson Nano cuda , pytorch , onnx	9	7195	October 10, 2021
Problem with onnxruntime in docker image Jetson Orin NX cudnn , onnx	4	516	March 6, 2025
Performance DECREASE with tensorRT under onnxruntime Jetson AGX Xavier tensorrt	2	846	March 8, 2022

Issue with onnxruntime when using CUDAExecutionProvider

Related topics