GPU difference inference

rishika.v · January 12, 2022, 10:35am

Hello,

Can the tlt models generated on GPU 1080 be inferred on Triton server with GPU 3070, 3080?

Morganh · January 12, 2022, 10:53am

Yes, but please note that it is .etlt model. You can run tao export and then copy the .etlt model to Trition server.

rishika.v · January 12, 2022, 1:01pm

Thank you!

rishika.v · January 24, 2022, 10:30am

Hi,

How to run a detectnet_v2 model on the triton server, it is a little confusing?

Morganh · January 24, 2022, 12:45pm

rishika.v · January 25, 2022, 6:21am

I saw this documentation earlier, but it is not clear…
Is there another client that I have to install for Detectnet_v2?

Morganh · January 25, 2022, 6:28am

After you have trained a detectnet_v2 model, you can deploy the model to run inference.
For inference, usually there are below ways.

Directly run “tao detectnet_v2 infernece xxx”
Run it with deepstream , refer to https://docs.nvidia.com/tao/tao-toolkit/text/object_detection/detectnet_v2.html#deploying-to-deepstream
Run with triton server. You can run it refer to the NVIDIA-AI-IOT/tao-toolkit-triton-apps: Sample app code for deploying TAO Toolkit trained models to Triton (github.com) . Peoplenet or dashcamnet is actually based on detectnet_v2 network.
You can also run with your standalone inference code. For example, Run PeopleNet with tensorrt - #21 by carlos.alvarez

rishika.v · January 25, 2022, 6:57am

How about the post processing, is it a file or a folder?
If it is a file then what is the extension?

The DetectNet_v2 inference sample has 2 components that can be configured

Morganh · January 25, 2022, 9:14am

For running in triton server, please refer to the README.
For example, running official released peoplenet model with it, see its postprocessing config file in tao-toolkit-triton-apps/clustering_config_peoplenet.prototxt at main · NVIDIA-AI-IOT/tao-toolkit-triton-apps (github.com)

rishika.v · February 3, 2022, 5:02am

Can I run classification with a standalone inference code?

Morganh · February 4, 2022, 8:59am

system · February 18, 2022, 9:00am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Triton deployment and inference TAO Toolkit	4	1305	July 27, 2021
How to run nvidia pretrained model directly on T4（or similar cards, not edge device）? TAO Toolkit tensorrt , python	3	681	March 10, 2022
How preform inference retinanet using a TLT export .engine file by python TAO Toolkit tensorrt	4	885	October 12, 2021
How can I convert a tlt model to run inference in a light-weight python script on CPU? TAO Toolkit	8	570	July 6, 2022
How to deploy a Tao generated DetectNet_v2 model using TensorRT runtime? TAO Toolkit tensorrt , tao	4	577	May 10, 2023
Inference on .etlt model TAO Toolkit	7	1528	December 7, 2021
PeopleNet on MX130/ GTX1060Q TAO Toolkit	6	608	July 27, 2021
TensorRT Inference form a .etlt model on Python TAO Toolkit tensorrt	7	1229	November 16, 2021
Use TensorRT model with TAO Toolkit inference TAO Toolkit omniverse_extension	5	999	February 9, 2022
Peoplenet resnet34 v2.0 inference , bag detection TAO Toolkit	12	638	January 2, 2024