TensorRT inference problem

Vurc · April 10, 2019, 3:14pm

Hi,

I use tensorRT c++ API for inferencing on my jetson Xavier and I use deserializeCudaEngine to create engine object from .plan files, everything works fine except these 2 problems.

runtime->setDLACore(1) does not use DLA core but gpu. But trtexec uses DLA modules without this problem.
GPU consumption is never above 50-60% (I load lot of images from folder and do inferencing with batch size 1 - image by image). Also when I run trtexec with batch size 1 it manages to use 100% of GPU.

What could be the problem ?

Thank you.

AastaLLL · April 11, 2019, 7:15am

Hi,

1. This may be model-dependent.
Not all the operations are supported by DLA.
TensorRT will run the layer on GPU if DLA doesn’t support it. (when fallback is enabled)

2. May I know if you use the sample model for your app and trtexec.
If yes, it looks like that there is something incorrect in your application?
You can check the trtexec sample code for more information.

Thanks.

Vurc · April 11, 2019, 3:11pm

Hi,

It is the same model I run with txexec or my app (happens with sample model also).
Problem is that i can get 100% GPU only if i am inferencing the same image (as in txerece) over and over again.

Thank you for prompt reply.

AastaLLL · April 12, 2019, 9:24am

Hi,

1. Suppose not all the layers is supported by DLA.
So there are some I/O transfer between GPU and DLA.

Do you see any fallback log when compiling with TensorrRT?

2. You may meet some issue about minimizing the multimedia pipeline’s overhead. (ex.memcpy, bandwidth, …)
Please check if our DeepStream SDK can help.
https://developer.nvidia.com/deepstream-sdk

Thanks.

Vurc · April 16, 2019, 1:03pm

Hi,

I found the problem. I can not make DLA core to work with plans converted from frozen graphs with scripts from tf_to_trt_image_classification.

But if I use ./trtexec to save a plan from caffe model with --use-dla=0 argument and than use that plan in my code dla works.

Any ideas why is this happening ?

AastaLLL · April 17, 2019, 2:37am

Hi,

You look like you use different model between your app and trtexec.
Although the model achieves the same thing, it may have a different operation and lead to different TensorRT implementation.

Would you mind to test them with the same model first?
Thanks.

Vurc · April 17, 2019, 8:49am

When I use exact same model(uff) between my app and trtexec, trtexec can use DLA but my app does not.

If I output .plan file from trtexec with --use-dla=0 from that same uff and than use that resulting .plan file my app can use DLA.
Only difference is that for my app I used uff_to_plan.cpp converter and in second case I used trtexec to produce .plan file.

Thanks.

AastaLLL · April 19, 2019, 7:23am

Hi,

Not sure if I understand this issue correctly.

Please noticed that when you creating the TensorRT PLAN, the implementation (including hardware and memory …) is decided.
So the PLAN cannot be used crossing system. So as DLA.

Thanks.

Topic		Replies	Views
Errors inferencing model with DLA core TensorRT jetson-inference	3	302	August 19, 2021
Trtexec problem Jetson AGX Xavier tensorrt , jetson-inference	6	1684	September 27, 2021
Errors inferencing model with DLA core Jetson AGX Xavier tensorrt , jetson-inference	2	381	October 18, 2021
DLA usage on my Xavier Jetson AGX Xavier dla	5	1715	October 18, 2021
Trtexec layer optimization Jetson AGX Xavier dla	3	967	October 18, 2021
Streaming with GPU and DLA Jetson AGX Xavier tensorrt , deepstream	10	32	January 23, 2025
DLA for object detection supported with TF-TRT on Xavier? Jetson AGX Xavier	4	1567	October 18, 2021
Trtexec fails with null pointer exception when useDLACore enabled TensorRT dla	16	1141	November 28, 2023
Jetpack 4.3 DP DLA running Jetson AGX Xavier	5	776	October 18, 2021
Help for converting .onnx to TensorRT Jetson AGX Orin tensorrt , onnx	4	900	April 10, 2024

TensorRT inference problem

Related topics