How does TAO prepare input?

nathaniel.tagg · November 21, 2022, 3:18pm

Hi,
I’m trying to use the outputted engine of the TAO training on an edge device. That means I am no longer working in the TAO framework, but I need to ensure I’m feeding inputs into the network the same way the training was done. How do I figure this out?

For example, in the Mask-RCNN training, there is a setting for data_config.image_size which the documentation simply describes as “indicates the dimension of the resized and padded input”. This size is actually set as the hardcoded input size to the network when running the .engine file, so whatever I feed it has to conform.

Does the scaling preserve aspect ratio? How does a 1920x1080 image get scaled and padded? What are the padded values? (Black? or Zeros after image normalization?)

Similarly, how do I know what image normalization values to use? I’m using some typical values, which seem to work, but it would be good to actually verify.

I’m asking both
a) what are the answers to these questions, and
b) where can I find the source code that actually does these manipulations so I can reverse-engineer them?

Is the code even available? It’s a complex mess with some things handled at the tao host layer and most of the code inside the (undocumented) container.

All of this is aimed at checking the ACTUAL latency of the network on a jetson device - time from image taken to time the inference is available on the GPU. Preprocessing is not an option in this case.

Morganh · November 22, 2022, 8:38am

Actually as mentioned previously, for running inference against the Mask_rcnn tensorrt engine, please try to refer to peoplesegnet in GitHub - NVIDIA-AI-IOT/tao-toolkit-triton-apps: Sample app code for deploying TAO Toolkit trained models to Triton. The postprocessing code can be found in tao-toolkit-triton-apps/configuring_the_client.md at main · NVIDIA-AI-IOT/tao-toolkit-triton-apps · GitHub
For preprocessing , please refer to tao-toolkit-triton-apps/frame.py at main · NVIDIA-AI-IOT/tao-toolkit-triton-apps · GitHub.
Peoplesegnet is a purpose-built model which is trained on Mask_rcnn network. So we can leverage it.

For “All of this is aimed at checking the ACTUAL latency of the network on a jetson device” , usually the fps is checked by using /usr/src/tensorrt/bin/trtexec .
In Jetson device, for example,
/usr/src/tensorrt/bin/trtexec --loadEngine=your_makrcnn_tensorrt.engine --fp16 --batch=1 --useSpinWait --avgRuns=1000

Then check the “GPU Compute Time” in the log.

nathaniel.tagg · November 22, 2022, 4:34pm

Thanks, I’ll look at those!

The benchmark you mention seems to give similar results to the one I was using - thanks for that.

system · December 6, 2022, 4:34pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Pre processing for MaskRCNN TAO Toolkit	4	823	December 6, 2021
TAO input image resizing TAO Toolkit	9	1186	April 18, 2022
Interpreting output of MaskRCNN from TLT to TRT TAO Toolkit tensorrt	7	1675	October 9, 2021
Clarification needed for MaskRCNN Config file (image_size, eval_samples, gt_mask_size) TAO Toolkit	5	681	October 12, 2021
Classification inference huge performance degradation TAO Toolkit	11	1532	February 18, 2022
TAO Preprocessing steps for yolo_v4 model and grayscale dataset TensorRT tensorrt , cudnn	2	92	July 11, 2024
Questions regarding the preparation of images for training yolo_v4 model on TAO toolkit TAO Toolkit	5	571	January 17, 2024
Triton inference server with SSD : interpreting responses TAO Toolkit tensorrt , inference-server-triton , tao	2	646	October 6, 2023
Converting etlt file to .engine for jetson TAO Toolkit	17	2913	October 25, 2022
LPRNET - tao_converter Error: no input dimensions given TAO Toolkit	4	561	September 29, 2023

How does TAO prepare input?

Related topics