Best way to convert PyTorch to TensorRT model

pompos2 · May 2, 2024, 10:20am

Description

I am trying understand the differences between the various ways to compile/export a PyTorch model to a TensorRT engine. I’m using PyTorch 2.2.

Background: My end goal is to export and use my detectron2 PyTorch trained model as a TensorRT .engine file in order to use it in NVIDIA Deepstream afterwards.

This got me into reading about TorchScript, torch.fx, torch.export, torch.compile, TorchDynamo with different backends e.g. torch_tensorrt which apparently cannot be serialized?, as well as the standalone torch_tensorrt project.

Since the model (mask2former with SWIN transformer backend) and the codebase includes complex code constructs and dynamic control flow, I’ve ruled out torch.fx and all tracing methods (please correct me if my thinking is wrong).

I’m now left with these questions:

Should I first convert to TorchScript using torch.jit.script? Is it the only “easy” option due to graph breaks and wanting to use it outside Python runtime?
Is torch.compile (TorchDynamo) with the PyTorch model as input suitable for my goal (eventually serializing to a TensorRT engine file for use in Deepstream), or should I first convert the model to TorchScript?
After compiling the model with any of the above methods, my understanding is I still need to use torch_tensorrt to serialize the model. Is there another way?
I’ve stumbled upon the torch2rt project but I’m not certain if it’s a better option.

Sorry for the long post and appreciate any help!

Environment

TensorRT Version: 8.4.1.6
GPU Type: RTX3090
Nvidia Driver Version : 550.54.15
CUDA Version: 12.1
CUDNN Version: 8.9.2
Operating System + Version: Ubuntu 22.04
Python Version (if applicable): 3.8
TensorFlow Version (if applicable): -
PyTorch Version (if applicable): 2.2
Baremetal or Container (if container which image + tag): -

AakankshaS · May 20, 2024, 8:46am

Hi @pompos2 ,
Request you to raise teh concern on Issues · pytorch/pytorch · GitHub
Thanks

lemonsqueeze · May 20, 2024, 11:52am

Since this is not a bug we have raised the question here but received no reply.

pompos2 · May 20, 2024, 11:56am

This is both an NVIDIA issue and a PyTorch issue, and in my opinion it’s more related to NVIDIA.
What is the recommended way to obtain a TensorRT engine from a PyTorch model according to NVIDIA? There are multiple tools all implemented by NVIDIA (including the deprecated backends for torch compile…

AakankshaS · May 30, 2024, 6:31pm

Hi @pompos2 ,
One of the recommended and commonly used way is to convert pytorch model to onnx and then to trt engine.
Would you like to give this a try and update us if issue is still there.
Thanks

lemonsqueeze · June 3, 2024, 1:38am

Converting to onnx using torch.onnx.export (i.e. using torchscript as a backend) is indeed what most tutorials from NVIDIA suggest. After PyTorch 2.0 torchscript seems to be an abandonned project and we’re moving towards dynamo. In practical terms converting any model that has some level of complexity (like a swin transformer) to a TensorRT engine is an impossible feat.

Taking mask2former as an example, which uses swin as a backbone, someone would have to either go all in converting it to torchscript and hoping it will get converted or resort to using the conversion of others (in this case from OpenMMLabs) gatekeeping the Pytorch → TensorRT conversion as an arcane spell.

lemonsqueeze · June 14, 2024, 4:05pm

As an example,

" torch.onnx.export is in maintenance mode and we don’t plan to add new operators/features or fix complex issues."

source

Topic		Replies	Views
How to directly convert a trained Pytorch model into TensorRT model？There are already. pth files ready after training TensorRT	7	17611	February 9, 2023
Is there any overal introduction about how to convert pytorch detection model trt engine to accelerate by tensorrt？ TensorRT	3	587	November 14, 2023
Guidance for PyTorch to TensorRT Deployment TensorRT	1	450	September 15, 2021
Reason For Not Supporting Pytoch DeepStream SDK	8	853	October 12, 2021
Struggling to get model onto tensorrt TensorRT tensorrt , cuda , pytorch	2	680	March 30, 2024
Convert Faster RCNN Tensorflow model to TensorRT? TensorRT	2	795	March 24, 2021
About Detectron2 on TensorRT TensorRT tensorrt , pytorch	8	6418	February 9, 2023
04_video_dec_trt How to use the pytorch model (.pt) in the sample? Jetson Xavier NX tensorrt , pytorch	8	1128	October 18, 2021
Model Conversion to TensorRT TensorRT tensorrt	1	629	August 1, 2023
Converting a .pt file to a .engine file on Jetson Xavier NX TensorRT	1	969	April 17, 2023

Best way to convert PyTorch to TensorRT model

Description

Environment

Related topics