How to fast build Tensorrt engine for exact version of Triton-inference-server

Chieh · August 28, 2020, 2:06am

Description

Hi everyone,
I have a question about TensorRT (i.e., TRT) engine on Triton-inference-server (i.e., TRTIS).
We all know that we can use docker to deploy the Triton-inference-server very easily that we can deploy our models including TRT, onnx, etc) on local. However, the TRT engine should be generated by the environment as same as Triton-inference-server, so I have to create another container in order to generate a TRT engine for TRTIS.

On NGC tag, we can see that there are two images (One is for server, and the other is for client.). Hence, I checked the client which doesn’t have TRT relevant packages (libraries) so that I cannot build a TensorRT engine directly.

I wonder that is there any faster way to generate the TensorRT engine which can correspond to TRTIS environment.

Thank you so much!!!

Best regards,
Chieh

AakankshaS · August 28, 2020, 8:09am

Hi @Chieh,
This issue looks like more related to TRTIS. Hence request you to raise your query on the below link.

Thanks!

Chieh · August 28, 2020, 8:11am

No problem. Thanks for your reminding!

Topic		Replies	Views
How to fast build Tensorrt engine for exact version of Triton-inference-server Triton Inference Server (archived)	2	691	October 12, 2021
How to generate TRT engine from TAO on Triton-Server (TensorRT incompatible) TAO Toolkit	3	1076	July 6, 2023
Tensorrt engine file generated by TLT is not acceptable to inference server TensorRT	3	653	August 16, 2020
TRT engine across different driver version TensorRT	3	1478	December 1, 2020
Running TensorRT Inference Server without Docker Command TensorRT	1	609	August 5, 2019
Question regarding Tensorrt engine build vs inference environment (TensorRT version, Platform, etc) TensorRT	4	924	October 21, 2021
Inferencing of Inception_v2 on OEM server with V100. TensorRT	0	521	January 13, 2020
Trtexec generates different engines when using the same platform/machine with the same onnx model TensorRT	3	1191	March 29, 2022
How to use TensorRT engine obtained using tlt-convertor TAO Toolkit	4	677	October 12, 2021
TensorRT - Error: could not build engine GPU-Accelerated Libraries	4	3407	November 9, 2017

How to fast build Tensorrt engine for exact version of Triton-inference-server

Description

Related topics