Performance DECREASE with tensorRT under onnxruntime

user37927 · January 28, 2022, 6:53pm

Hi All,

I’m working on putting an onnx format image classifier NN model (inception) on a Jetson Xavier AGX. I’ve gotten it to work with onnxruntime in a docker container with CUDAExecutionProvider and TensorrtExecutionProvider providers.

I was expecting a speed-up from using TensorRT with my models. Instead I’m seeing a significant (15-20x) slowdown. What am I missing?

The following runs show the seconds it took to run an inception_v3 and inception_v4 model on 100 images using CUDAExecutionProvider and TensorrtExecutionProvider respectively. The models were trained and converted to onnx using pytorch on a different computer. The runs are executed through docker on the Jetson AGX device in MAXN mode.
Using JTop I can see that with CUDAExecutionProvider the GPU is always fully engaged, and with TensorrtExecutionProvider the GPU is intermittently engaged, like it’s sputtering.

      inception_v3  inception_v4
CUDA           11s           16s
TRT           223s          257s

So the best speed I’m getting is ~9img/sec. Shouldn’t I be able to crank out more frames per seconds?

If there’s content you need to get into the specifics, let me know!
Thanks for your help!

AastaLLL · February 7, 2022, 2:39am

There is no update from you for a period, assuming this is not an issue any more.
Hence we are closing this topic. If need further support, please open a new one.
Thanks

Hi,

Sorry for the late update and thanks for opening a new topic.

We want to reproduce this issue internally.
Would you mind sharing the ONNX model and a simple script to reproduce the CUDA and TensorRT results?

Thanks.

system · March 8, 2022, 8:26am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Performance DECREASE with tensorRT under onnxruntime, pt2 Jetson AGX Xavier tensorrt	5	3220	May 25, 2022
Onnx -> TensorRT. No speed difference between models TensorRT	1	527	June 24, 2021
Clarity needed on differences between acceleration frameworks/runtimes for AGX Xavier Jetson AGX Xavier tensorrt , cuda , onnx	4	1246	October 18, 2021
Unable to use TensorRTExecution Provider on Jetson AGX Xavier Jetson AGX Xavier tensorrt	9	729	April 18, 2024
Could not infer onnx model for TensorrtExecutionProvider provider TensorRT tensorrt , onnx	1	1202	November 11, 2022
TRT inference speed on two AGX Xavier TensorRT	1	333	September 12, 2021
Trouble building onnxruntime with tensorrt Jetson AGX Xavier tensorrt , jetson-inference	7	1908	February 11, 2022
on TX2, there is no effect using tensorRT to speed up my trained model TensorRT	0	726	March 11, 2019
Realtime Object Detection Demo on AGX Xavier TensorRT tensorrt	3	713	August 21, 2020
When can jetson TX2 support tensorRT python API? Or how to use onnx-tensorrt on TX2? Jetson TX2	2	539	October 18, 2021

Performance DECREASE with tensorRT under onnxruntime

Related topics