The problem of deploying yolov8 on jetson orin with jetpack 5.1.1

sam11143 · December 27, 2023, 12:56pm

Hi, I am trying to develop something with a jetson orin nano module. I trained and exported the model follow: Export - Ultralytics YOLOv8 Docs on another computer, and tried to deploy it on jetson, then I got this:

2023-12-27 20:20:08.470905524 [W:onnxruntime:Default, tensorrt_execution_provider.h:75 log] [2023-12-27 12:20:08 WARNING] onnx2trt_utils.cpp:375: Your ONNX model has been generated with INT64 weights, while TensorRT does not natively support INT64. Attempting to cast down to INT32.
2023-12-27 20:21:14.354644247 [W:onnxruntime:Default, tensorrt_execution_provider.h:75 log] [2023-12-27 12:21:14 WARNING] Tactic Device request: 538MB Available: 443MB. Device memory is insufficient to use tactic.
2023-12-27 20:21:14.424114857 [W:onnxruntime:Default, tensorrt_execution_provider.h:75 log] [2023-12-27 12:21:14 WARNING] Skipping tactic 3 due to insufficient memory on requested size of 538 detected for tactic 0x0000000000000004.
2023-12-27 20:21:14.452337772 [W:onnxruntime:Default, tensorrt_execution_provider.h:75 log] [2023-12-27 12:21:14 WARNING] Tactic Device request: 538MB Available: 532MB. Device memory is insufficient to use tactic.
2023-12-27 20:21:14.473431019 [W:onnxruntime:Default, tensorrt_execution_provider.h:75 log] [2023-12-27 12:21:14 WARNING] Skipping tactic 8 due to insufficient memory on requested size of 538 detected for tactic 0x000000000000003c.
2023-12-27 20:21:14.498303943 [W:onnxruntime:Default, tensorrt_execution_provider.h:75 log] [2023-12-27 12:21:14 WARNING] Tactic Device request: 538MB Available: 531MB. Device memory is insufficient to use tactic.
2023-12-27 20:21:14.518971600 [W:onnxruntime:Default, tensorrt_execution_provider.h:75 log] [2023-12-27 12:21:14 WARNING] Skipping tactic 13 due to insufficient memory on requested size of 538 detected for tactic 0x0000000000000074.
2023-12-27 20:21:56.396434906 [W:onnxruntime:Default, tensorrt_execution_provider.h:75 log] [2023-12-27 12:21:56 WARNING] Tactic Device request: 806MB Available: 409MB. Device memory is insufficient to use tactic.
2023-12-27 20:21:56.417556252 [W:onnxruntime:Default, tensorrt_execution_provider.h:75 log] [2023-12-27 12:21:56 WARNING] Skipping tactic 3 due to insufficient memory on requested size of 806 detected for tactic 0x0000000000000004.
2023-12-27 20:21:56.457616974 [W:onnxruntime:Default, tensorrt_execution_provider.h:75 log] [2023-12-27 12:21:56 WARNING] Tactic Device request: 806MB Available: 409MB. Device memory is insufficient to use tactic.
2023-12-27 20:21:56.478414558 [W:onnxruntime:Default, tensorrt_execution_provider.h:75 log] [2023-12-27 12:21:56 WARNING] Skipping tactic 8 due to insufficient memory on requested size of 806 detected for tactic 0x000000000000003c.

The setup of the jetson module is as below:
jetpack: 5.1.1
tensorRT : 8.5.2.2
onnx-runtime: 1.15.1 ( Jetson Zoo - eLinux.org
python: 3.8.10

The computer which I trained and export the model with:
OS: Rocky Linux 9.3
GPU: Quadro RTX 4000 with CUDA 12.1
onnx: 1.15.1
python: 3.8.18

I exported the model with only one argument(format=‘onnx’), maybe I should export it into engine only? Before I tried, I found some post explaining that tensorRT would be call by the onnx-runtime, so I haven’t tried the engine file yet.

AastaLLL · December 28, 2023, 1:12am

Hi,

Do you get the inference results?

Based on your logs, it’s just some warning rather than an error.
The message mainly indicates some algorithms are not available due to the memory limit.

This is common on the memory-limited Jetson platform like Orin Nano.
Thanks.

sam11143 · December 28, 2023, 2:36am

Hi AastaLLL, thanks for your attention.
I did get the inference result, but it took a really long time. Maybe it was the cost of warming up? The first warning log made me worried that the setup of environment, or the parameters for model export. As far as I know, the weights of the exported yolov8 model would be FP32, rather than INT64. Please forgive me for this silly question, I am not a skillful developer on DL.

sam11143 · December 28, 2023, 4:01pm

Hi, AastaLLL. I just tried to export model into engine file and perform inference with the same device ( the one with Quadro RTX-4000). When I exported, the same warning :

onnx2trt_utils.cpp:374: Your ONNX model has been generated with INT64 weights, while TensorRT does not natively support INT64. Attempting to cast down to INT32.

came up again. Though I got the engine file successfully, the tensorRT API runtime.deserialize_cuda_engine(serialzied_engine) returned none, and I got a serialization error ( rt::kPLAN_MAGIC_TAG failed.) from runtime.cpp. I’m not sure these things are relevant.

AastaLLL · January 2, 2024, 7:44am

Hi,

INT64 might be used in PyTorch for some constants.
However, since the value is usually within the range of INT32, the casting doesn’t really lead to an issue.

The second error related to kPLAN_MAGIC_TAG indicates that you are trying to infer an engine on the environment that differs from the creation.
Please note that you will need to re-convert the TensorRT engine once the software/hardware environment is changed.

Thanks.

sander21 · January 24, 2024, 1:03pm

Hi Im getting the same issue, with the casting taking about 2.5 minutes. Afterwards the model will run fine. Did somebody find a way of performing this cast once and then saving the model? Exporting the model with any of the flags Ultralytics provide (half, dynamic, simplified, int8) did not work for me. On a jetson orin

Orin
Package: nvidia-jetpack
Version: 5.1-b147
Architecture: arm64

system · February 7, 2024, 1:04pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Run inference with model exported by Jetson Orin Nano 8GB on Jetson Orin Nano 4GB Jetson Orin Nano tensorrt , yolo	5	903	August 3, 2023
Benchmarck int8 similar to fp32 on yolov8 from ultralytics Jetson Orin Nano tensorrt , yolo	6	1356	December 18, 2023
Jetson Nano Python 3.7 version for Tensorrt Jetson Nano tensorrt , python	14	3808	April 12, 2023
Issue with yolov8s-seg conversion from onnx to engine Jetson AGX Orin yolo , onnx	5	44	October 29, 2024
INT8 Calibration with DS 6.3 worse than with DS 6.0 DeepStream SDK tensorrt , jetson , deepstream , tensorrt-model-optimizer	20	71	March 10, 2025
Yolov5 + TensorRT Jetson Nano tensorrt , yolo	4	5136	April 29, 2022
Orin Nano TensorRT installed but module not found when using YOLO Jetson Orin Nano tensorrt , yolo	3	335	August 12, 2024
Yolov6 Slow inference speed on the Nvidia Jetson NX board Jetson Xavier NX yolo	8	1612	August 24, 2022
Do L4T JetPack Docker containers for Jetson have memory limitations? Jetson AGX Orin docker	3	28	March 5, 2025
TensorRT problem on NVIDIA APEX ORIN NX TensorRT tensorrt , jetson-inference , cudnn	1	36	August 29, 2024

The problem of deploying yolov8 on jetson orin with jetpack 5.1.1

Related topics