Triton deployment and inference

krzysztof.begiedza · March 1, 2021, 1:29pm

Hello.

I’m trying to run inference on Triton Inference Server with faster rcnn model trained with TLT, converted to TensorRT.

In docs I can see:

Note that the models can also be deployed outside of DeepStream using TensorRT but users will need to do image pre-processing and post-process the output Tensor after inference.

What kind of pre-process and post-process should I use for FasterRCNN and other available object detection models?
Currently I have OpenCV’s Mat (8UC3)-> 32FC3 → channel-wise array (3, width, height) (RRRGGGBBB) → pack data into bytestring and send request.

Then I parse output, but NMS_1 always equals zero.

Triton generated this config:
{"name":"frcnn_fp16","versions":["1"],"platform":"tensorrt_plan","inputs":[{"name":"input_image","datatype":"FP32","shape":[-1,3,384,1248]}],"outputs":[{"name":"NMS","datatype":"FP32","shape":[-1,1,100,7]},{"name":"NMS_1","datatype":"FP32","shape":[-1,1,1,1]}]}

Morganh · March 1, 2021, 3:15pm

In TLT 3.0, for post-processing, please refer to https://github.com/NVIDIA-AI-IOT/deepstream_tlt_apps/tree/master/post_processor. For pre-processing, please refer to https://github.com/NVIDIA-AI-IOT/deepstream_tlt_apps/blob/master/configs/frcnn_tlt/pgie_frcnn_tlt_config.txt.

In TLT 2.0, for post-processing, please refer to https://github.com/NVIDIA-AI-IOT/deepstream_tlt_apps/tree/release/tlt2.0.1/nvdsinfer_customparser_frcnn_tlt

Reference topics:
For classification network,

For detectnet_v2 network,

rob91 · July 19, 2021, 7:53pm

Hi to all,
I was able to launch the INCEPTION SSD network plan file via tritonserver and write a python client to query it. However while I can get the correct output for NMS for the bboxes, NMS_1 (Keepcount) always gives me zero value. How is it possible ? The weird thing is that if I run the example script of tensorrt which doesn’t use the triton server I get the correct output of 100. thanks in advance

kayccc · July 27, 2021, 3:14am

Hi rob91,

Please help to open a new topic if it’s still an issue. Thanks

Topic		Replies	Views
How preform inference retinanet using a TLT export .engine file by python TAO Toolkit tensorrt	4	887	October 12, 2021
How to use tlt trained model on Jetson Nano TAO Toolkit tensorrt , jetson-inference	7	2094	October 12, 2021
GPU difference inference TAO Toolkit	11	986	February 18, 2022
TensorRT Inference form a .etlt model on Python TAO Toolkit tensorrt	7	1229	November 16, 2021
Pre processing for MaskRCNN TAO Toolkit	4	827	December 6, 2021
Apart from Deepstream where else I can deploy tlt-converted models or .trt engine files TAO Toolkit	5	1400	October 12, 2021
Performance drop when performing inference with a .trt engine on a python script TensorRT tensorrt	3	482	February 18, 2022
Couldn't run CV Inference Pipeline bodyposeNet TAO Toolkit tensorrt	33	1055	October 12, 2021
Interpreting output of MaskRCNN from TLT to TRT TAO Toolkit tensorrt	7	1680	October 9, 2021
TAO-Converter TRT engine inference results is blank TAO Toolkit tensorrt , tao , image-processing	9	576	July 21, 2023

Triton deployment and inference

Related topics