How to perform inference using a serialized TensorRT engine (*.plan) on Jetson Nano?

monocongo · June 20, 2019, 11:13pm

I have a serialized TensorRT engine (*.plan) file that I’ve created from a pre-trained PyTorch/Retinanet model that I’ve further trained (fine-tuned) using a custom dataset as input. The model and code I’ve based this upon is provided by NVIDIA here.

This NVIDIA RetinaNet model is intended to be run within the NVIDIA PyTorch Docker container. However, this isn’t usable on a Jetson Nano since nvidia-docker is not supported yet on ARM64. So in order to use this model on Jetson Nano I need to perform inference using the TensorRT engine outside of the context of the Docker container. I’ve not yet found documentation that clearly explains how I would do this. (For example this guide is about as clear as mud to a rookie like me.)

My goal is to read image frames from a video stream and use the model to perform inference on each frame for object detection. I have this working as planned on a laptop using the fine-tuned PyTorch-RetinaNet (*.pth) model, and the TensorRT on Jetson Nano is my next frontier.

Thanks in advance for any comments or suggestions.

spahundula · September 7, 2019, 8:58pm

Hi, did you end up succeeding with using a TensorRT engine file to perform inference on multiple frames?

monocongo · September 7, 2019, 9:32pm

Not yet, as I’ve not yet worked out how to convert the various models that I’ve trained on my custom dataset into a TensorRT engine file. These have typically been PyTorch or Keras/TensorFlow-based models, and (it seems that) TensorRT works easier with Caffe-based models. How I get TensorRT engine files corresponding to my custom trained models that I can then leverage using the DeepStream SDK is still unclear. Is there a simple recipe for mortals? This seems to be poorly documented, and/or I’ve just not yet managed to find a good guide that I can follow to the end for my situation.

Topic		Replies	Views
Questions about implementing RetinaNet with TensorRT on Jetson TX2 Jetson TX2	2	1405	October 18, 2021
TensrorRT on Jetson Nano Jetson Nano tensorrt	2	320	May 8, 2023
Unable to inference a trt model in jetson nano/ xavier nx Jetson TX2 tensorrt , jetson-inference	3	979	March 2, 2022
Inferring image classification model in Jetson nano Jetson TX2 tensorflow , jetson-inference	2	822	April 12, 2022
TensorRT Detectron2 webcam TensorRT	5	887	March 30, 2023
Running custom trained models on Jetson Nano Deep Learning (Training & Inference) tensorrt , tensorflow , jetson-inference , pytorch	0	545	July 2, 2020
How to use .trt file for inference on jetson nano Jetson Nano tensorrt	4	1534	October 18, 2021
Running inference on tensorrt engine on jetson nano Jetson Nano tensorrt , pytorch , onnx	2	328	February 26, 2024
TensorRT deployment with engine generated from TLT example TensorRT tensorrt	8	776	December 5, 2020
How to use tlt trained model on Jetson Nano TAO Toolkit tensorrt , jetson-inference	7	2089	October 12, 2021

How to perform inference using a serialized TensorRT engine (*.plan) on Jetson Nano?

Related topics