Run engine trt file on image/video

vincent.dufour · June 8, 2020, 11:42am

Continuing the discussion from Engine Plan Inference on JetsonTX2:

Hi,
I’m trying to run an engine (.trt file) on an image but I have a special issue (see the link).

To make it clearer, I used the retinanet-example repo to train a model on a host computer. I used the export command to convert to onnx.
I sent the onnx file to my Jetson TX2 (Jetpack4.4), and by using onnx-tensorrt repo I converted it to a trt file.

You can see the scripts I used to run the engine but I have an error I can’t find a solution to. I based my scripts on these page.

If anyone has a clear workflow on how to infer using trt file I’ll take it !

Thanks

AastaLLL · June 9, 2020, 2:20am

Hi,

Please correct me if anything missing.
It looks like you convert the onnx model into TensorRT engine on desktop and try to infer it on the TX2.

Please noticed that TensorRT engine is not portable.
You will need to regenerate a .trt on the TX2 directly.

Thanks.

vincent.dufour · June 9, 2020, 6:57am

Hey,

no as mentionned, I converted the .pth to onnx on the host, but then sent the onnx file to Jetson to convert it to trt file (so the engine has been built on Jetson)

I used onnx2trt and trtexec command but I have the same issue when trying to infer.

[EDIT]: I also noticed in this tutorial that the authors use a library called “engine” and apply “import engine as eng” but I can’t find any library called this way… the weirdest being the creation of a function called “build_engine()” but then it uses the same name as a method from the “engine” library…

Thanks

AastaLLL · June 10, 2020, 1:58am

Hi,

The ‘engine’ is an object returned by TensorRT API.

engine = trt_runtime.deserialize_cuda_engine(engine_data)

Thanks.

vincent.dufour · June 10, 2020, 12:16pm

Hey,
I might have been not really clear about my question.

I do get that this “engine”:

engine = trt_runtime.deserialize_cuda_engine(engine_data)

is an object created by the command from the function “build_engine”.

What I don’t get is that this function isn’t created as a method for a class “Engine” so why do the author use this import command

import engine as eng

at the beginning of the script and then use:

engine = eng.build_engine(onnx_path,engine_name)

It doesn’t make sense for me.
Either they created a class named “Engine” and implemented a method called “build_engine” but then they must create an instance like:

eng = Engine()
engine = eng.build_engine(onnx_path,engine_name)

Or they use directly the function “build_engine” like this:

engine = build_engine(onnx_path,engine_name)

I don’t understand the mix being made here and that’s one of the reason I can’t reproduce the execution.

Also did you have a look on the error I get from the linked issue ?

Thanks

[EDIT]: I went through the workflow showed on the "Speeding up DeepLearning " link, step by step, to spot which command is a problem, and I found it.

>>> context.execute(batch_size=1,bindings=[int(dinput1),int(doutput)])
[TensorRT] ERROR: ../rtSafe/safeContext.cpp (133) - Cudnn Error in configure: 7 (CUDNN_STATUS_MAPPING_ERROR)
[TensorRT] ERROR: FAILED_EXECUTION: std::exception
False

The loading of engine is working, the creation and allocation of buffer to send the image to GPU also, but it’s when I want to execute the inference that the problem occurs.
What do you think of the error ?

Thanks

vincent.dufour · June 15, 2020, 2:45pm

Hey,
any idea about the issue ?

Thanks

AastaLLL · July 8, 2020, 7:00am

Hi,

Thanks for updating this.
The naming in this sample is a little bit confusing.

First of all, the first engine used is in this file:

def load_engine(trt_runtime, plan_path):
   with open(engine_path, 'rb') as f:
       engine_data = f.read()
   engine = trt_runtime.deserialize_cuda_engine(engine_data)
   return engine

The engine used here is a local variable so it it won’t affect other usage.
However, the key point is that this file is stored as ‘engine.py’.

So the following line doesn’t try to import the TensorRT engine but the function implemented in the file engine.py.

import engine as eng

And in this section, eng indicate the implementation in the ‘engine.py’ while the engine represent the compiled TensorRT engine.

 engine = eng.build_engine(onnx_path, shape= shape)
 eng.save_engine(engine, engine_name)

Thanks.

AastaLLL · July 8, 2020, 7:07am

And for this issue:

[TensorRT] ERROR: ../rtSafe/safeContext.cpp (133) - Cudnn Error in configure: 7 (CUDNN_STATUS_MAPPING_ERROR)

Could you share your environment setup and the tutorial you used with us first?
Thanks.

Topic		Replies	Views
Engine Plan Inference on JetsonTX2 Jetson TX2 tensorrt , python	11	1873	October 18, 2021
How to use .trt file for inference on jetson nano Jetson Nano tensorrt	4	1557	October 18, 2021
TensorRT deployment with engine generated from TLT example TensorRT tensorrt	8	788	December 5, 2020
Inference error while using tensorrt engine on jetson nano Jetson Nano tensorrt , nvbugs	23	3755	April 20, 2022
Onnx to trt engine DeepStream SDK	5	878	October 12, 2021
Run yolov3_tiny.engine from python Jetson AGX Xavier tensorrt , yolo	16	2107	October 18, 2021
Issues while converting ONNX to TRT Jetson Nano tensorrt , onnx	9	1327	October 18, 2021
How to load tensorrt engine directly with building on runtime Jetson Nano tensorrt , onnx	4	2769	October 10, 2021
trt engine inference in python without deepstream TAO Toolkit	9	1433	October 12, 2021
BUG: Output TRT engine from trtexec has completely different inference than input model TensorRT tensorrt , debugging-and-troubleshooting	3	2290	January 4, 2022

Run engine trt file on image/video

Related topics