Use TensorRT model with TAO Toolkit inference

user44752 · January 26, 2022, 1:44pm

• Network Type: EfficientNet B1
• TLT Version (TAO Toolkit 3-21.11)

Hello,

I’ve been looking into the TAO Toolkit documentation and I’ve seen that in “running inference on a model” there’s a part that says “TensorRT Python inference can also be enabled”.

However, on the sample command, the -m parameter is for the “path to the pretrained model (TAO model)”.

My question is, it could be possible to use a TensorRT generated engine to run the inference using this tao command?
And also, could I use another extension like .etlt?

Thanks in advance.

Morganh · January 26, 2022, 1:54pm

For inference, usually there are 3 ways.

Tao inference. Currently it can only run against tlt model.
With deepstream. Refer to
Issue with image classification tutorial and testing with deepstream-app - #21 by Morganh
With python inference. Refer to tao-toolkit-triton-apps/configuring_the_client.md at main · NVIDIA-AI-IOT/tao-toolkit-triton-apps (github.com) and Issue with image classification tutorial and testing with deepstream-app - #25 by dzmitry.babrovich

user44752 · January 26, 2022, 1:58pm

Thank you.

But what does it mean TensorRT Python inference can also be enabled? Is it an alternative mode for tao inference command?

Morganh · January 26, 2022, 2:04pm

As mentioned above, it should be talking about Integrating TAO CV Models with Triton Inference Server — TAO Toolkit 3.22.05 documentation.

user44752 · January 26, 2022, 2:07pm

Ok, thank you so much!

system · February 9, 2022, 2:07pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Inference on .etlt model TAO Toolkit	7	1528	December 7, 2021
Do inference of tao generated engine in python without deepstream TAO Toolkit jetson-inference , python , tf-trt , tao , deepstream	2	1541	February 18, 2022
How can I convert a tlt model to run inference in a light-weight python script on CPU? TAO Toolkit	8	570	July 6, 2022
TAO tensorRT model inferencing using python TAO Toolkit	2	982	December 7, 2021
GPU difference inference TAO Toolkit	11	986	February 18, 2022
Run .egine models of TLT with TensorRT in general TAO Toolkit	2	546	October 12, 2021
TensorRT Inference form a .etlt model on Python TAO Toolkit tensorrt	7	1229	November 16, 2021
How preform inference retinanet using a TLT export .engine file by python TAO Toolkit tensorrt	4	885	October 12, 2021
How to run nvidia pretrained model directly on T4（or similar cards, not edge device）? TAO Toolkit tensorrt , python	3	681	March 10, 2022
Doing tlt inference only with tensorrt TAO Toolkit	3	939	October 9, 2021

Use TensorRT model with TAO Toolkit inference

Related topics