Trouble building onnxruntime with tensorrt

user37927 · January 20, 2022, 9:17pm

Hi All, first time poster~

I’m trying to build onnxruntime with tensorrt support on my jetson agx xavier with jetpack v4.6. I’m following instructions off of this page Build with different EPs | onnxruntime but my build fails. The most common error is:

onnxruntime/gsl/gsl-lite.hpp(1959): warning: calling a host function from a host device function is not allowed

I’ve tried with the latest CMAKE version 3.22.1, and version 3.21.1 as mentioned on the website.

See attachment for the full text log.
jetstonagx_onnxruntime-tensorrt_install.log (168.6 KB)

The end goal of this build is to create a .whl binary to then use as part of the installation process of another program in a docker container. Any help and insight is appreciated, thank you!

-Sidney

AastaLLL · January 21, 2022, 3:14am

Hi,

You can find some prebuilt packages for JetPack 4.6 at the below link:
Does it meet your requirement or do you want to build it from the source?

https://elinux.org/Jetson_Zoo#ONNX_Runtime

Thanks.

user37927 · January 21, 2022, 2:41pm

I started off there, and found the link I included in my first post on that page too, under “Build from Source”. The prebuild wheels work fine, but they do not include the tensorrt backend. I’m trying to build onnxruntime such that it includes the tensorrt backend. Has anyone else tried or achieved this? Does anything from the attached build logfile stand out?
Thanks.

AastaLLL · January 25, 2022, 7:49am

Hi,

Thanks for your feedback.

Ideally, it should work.
We are going to reproduce this issue first. Will share more information with you later.

AastaLLL · January 25, 2022, 8:16am

Hi,

We just double-check the wheel package shared on the eLinux page.
With v1.10.0+JetPack4.6, we can run ONNXRuntime with TensorrtExecutionProvider successfully.

Would you mind giving it a try?

$ python3
Python 3.6.9 (default, Dec  8 2021, 21:08:43)
[GCC 8.4.0] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> import onnxruntime as ort
>>> sess = ort.InferenceSession('/usr/src/tensorrt/data/mnist/mnist.onnx', providers=['TensorrtExecutionProvider', 'CUDAExecutionProvider'])
2022-01-25 03:08:29.992372812 [W:onnxruntime:Default, tensorrt_execution_provider.h:53 log] [2022-01-25 08:08:29 WARNING] /home/onnxruntime/onnxruntime-py36/cmake/external/onnx-tensorrt/onnx2trt_utils.cpp:364: Your ONNX model has been generated with INT64 weights, while TensorRT does not natively support INT64. Attempting to cast down to INT32.
2022-01-25 03:08:31.460957667 [W:onnxruntime:Default, tensorrt_execution_provider.h:53 log] [2022-01-25 08:08:31 WARNING] Detected invalid timing cache, setup a local cache instead
>>>

Thanks.

user37927 · January 27, 2022, 9:26pm

Hi AastaLLL,

This helped! I was using wheel for ort v1.8.0. The latest v1.10.0 wheel for Jetsons seems to include the tensorrt provider out of the box. Thank you!

I was expecting a speed-up from using TRT with my models. Instead I’m seeing a significant (15-20x) slowdown. What am I missing? (Please let me know if I should make a new post for this question continuation).

The following runs show the seconds it took to run an inception_v3 and inception_v4 model on 100 images using CUDAExecutionProvider and TensorrtExecutionProvider respectively. The models were trained and converted to onnx using pytorch on a different computer. The runs are executed through docker on the Jetson AGX device in MAXN mode.
Using JTop I can see that with CUDAExecutionProvider the GPU is always fully engaged, and with TensorrtExecutionProvider the GPU is intermittently engaged, like it’s sputtering.

      inception_v3  inception_v4
CUDA           11s           16s
TRT           223s          257s

So the best speed I’m getting is ~9img/sec. Shouldn’t I be able to crank out more frames per seconds?

AastaLLL · January 28, 2022, 5:11am

Hi,

Yes. It will be good to open a new topic for the performance issue.

Ideally, you should get some acceleration when deploying with TensorRT.
Let’s check this in deep on the new topic.

Thanks.

system · February 11, 2022, 5:12am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Performance DECREASE with tensorRT under onnxruntime, pt2 Jetson AGX Xavier tensorrt	5	2892	May 25, 2022
Unable to use TensorRTExecution Provider on Jetson AGX Xavier Jetson AGX Xavier tensorrt	9	603	April 18, 2024
How to build onnxruntime on Xavier NX Jetson Xavier NX tensorrt , onnx	2	3606	October 18, 2021
Build ONNXInference-gpu wheel for Jetpack5 with Cuda and TRT Jetson AGX Orin tensorrt , cuda , onnx	6	2675	August 10, 2022
TensorRT Quick Start Guide Example is not running (JetPack 4.2.2) Jetson AGX Xavier tensorrt , onnx	6	897	January 5, 2022
Jetson Xavier onnxruntime Problem Jetson Xavier NX tensorrt	5	1017	October 18, 2021
Onnxruntime 1.17.3 compilation error on Jetpack 6.0 Jetson AGX Orin onnx	4	619	June 18, 2024
Engine creation causes Segfault with Jetson AGX Xavier Jetson AGX Xavier onnx	6	613	June 29, 2022
If CUDA 11.4.19 is installed, is TensorRT 8.4 required? Jetson AGX Orin tensorrt , cuda	6	997	August 31, 2023
Building a container with ONNXRuntime with TensorRT and PyTorch NGC GPU Cloud tensorrt , pytorch , onnx , gpu	2	4295	October 31, 2024

Trouble building onnxruntime with tensorrt

Related topics