TAO-Converter TRT engine inference results is blank

glingk · July 17, 2023, 9:38am

Sorry for this question, I’ve looked around the forum and wasn’t able to fix this issue yet.

I’ve trained a retinanet model via the TAO framework and tested it’s inferencing & it was able to produce a satisfactory result

However, when I attempted to export the .tlt model & further convert it to a tensorRT engine & tested inferencing on the tensorRT engine the results are empty.
Potential problems that I’m contemplating:

What is the actual image preprocessing before feeding it to the network? Where can I find it?
Maybe I’m not converting the model correctly?

Here’s the command I used to export TLT to ETLT (On workstation with RTX2070 super):

export DOCKER_REGISTRY="nvcr.io"
export DOCKER_NAME="nvidia/tao/tao-toolkit"
export DOCKER_TAG="4.0.0-tf1.15.5"
export DOCKER_CONTAINER=$DOCKER_REGISTRY/$DOCKER_NAME:$DOCKER_TAG

export KEY="nvidia_tlt"
export EXPERIMENT_NAME="mobilenet_v2_qat_2"
export SPEC_FILE="retinanet_train_mobilenet_v2.txt"
export EXPORT_EPOCH="022"
docker run -it --rm --gpus all --ipc=host --ulimit memlock=-1 --ulimit stack=67108864 -v /home/developer/apps/tao-training/workspace:/workspace \
$DOCKER_CONTAINER \
retinanet export   -o /workspace/retinanet/models/$EXPERIMENT_NAME/export/retinanet_mobilenet_v2_epoch_$EXPORT_EPOCH.etlt \
                         -e /workspace/retinanet/specs/$SPEC_FILE \
                         -m /workspace/retinanet/models/$EXPERIMENT_NAME/weights/retinanet_mobilenet_v2_epoch_$EXPORT_EPOCH.tlt \
                         -k $KEY \
                          --data_type int8 \
                          --cal_cache_file /workspace/retinanet/models/$EXPERIMENT_NAME/export/cal.bin \
                          --cal_json_file /workspace/retinanet/models/$EXPERIMENT_NAME/export/cal.json

Here’s the command I used to convert ETLT to tensorRT engine (Run On Orin AGX using nvcr.io/nvidia/l4t-ml:r35.2.1-py3 as base image) :

./tao-converter -k $KEY \
            -d 3,544,736 \
            -o NMS \
            -m 1 \
            -t int8 \
            -c /workspaces/convert_to_trt/etlt_models/sorter_object_detection/retinanet/tao_etlt_retinanet_mobilenet_v2_int8/cal.bin \
            -e /workspaces/convert_to_trt/etlt_models/sorter_object_detection/retinanet/tao_etlt_retinanet_mobilenet_v2_int8/retinanet_mobilenet_v2_epoch_022.engine \
            /workspaces/convert_to_trt/etlt_models/sorter_object_detection/retinanet/tao_etlt_retinanet_mobilenet_v2_int8/retinanet_mobilenet_v2_epoch_022.etlt

retinanet_train_mobilenet_v2.txt (1.7 KB)
retinanet_mobilenet_v2_epoch_022.tlt (38.2 MB)

retinanet_mobilenet_v2_epoch_022.etlt (20.8 MB)
cal.bin (2.3 KB)
retinanet_mobilenet_v2_epoch_022.engine (13.6 MB)

Morganh · July 18, 2023, 2:14am

glingk:

./tao-converter -k $KEY \
            -d 3,544,736 \
            -o NMS \
            -m 1 \
            -t int8 \
            -c /workspaces/convert_to_trt/etlt_models/sorter_object_detection/retinanet/tao_etlt_retinanet_mobilenet_v2_int8/cal.bin \
            -e /workspaces/convert_to_trt/etlt_models/sorter_object_detection/retinanet/tao_etlt_retinanet_mobilenet_v2_int8/retinanet_mobilenet_v2_epoch_022.engine \
            /workspaces/convert_to_trt/etlt_models/sorter_object_detection/retinanet/tao_etlt_retinanet_mobilenet_v2_int8/retinanet_mobilenet_v2_epoch_022.etlt

To narrow down, could you please generate fp16 or fp32 tensorrt engine and try again?

glingk · July 18, 2023, 3:21am

I tried with fp32 and still empty inferences :(

./tao-converter -k $KEY \
            -d 3,544,736 \
            -o NMS \
            -e /workspaces/convert_to_trt/etlt_models/sorter_object_detection/retinanet/tao_etlt_retinanet_mobilenet_v2_no_data_type/retinanet_mobilenet_v2_epoch_022.engine \
            /workspaces/convert_to_trt/etlt_models/sorter_object_detection/retinanet/tao_etlt_retinanet_mobilenet_v2_no_data_type/retinanet_mobilenet_v2_epoch_022.etlt

Morganh · July 18, 2023, 4:52am

How did you run inference?

glingk · July 18, 2023, 5:22am

Here’s the code is use to run inference
test_detection_trt_engine.py (4.7 KB)

Morganh · July 18, 2023, 5:56am

Could you try to run inference with the tao deploy docker?
Refer to RetinaNet with TAO Deploy - NVIDIA Docs

glingk · July 18, 2023, 6:49am

@Morganh I’m not sure what the game plan is here… I need to be able to run inference on a Jetson Platform using python (Orin AGX), and I believe that tao deploy doesn’t work on Jetson Platform yet…
Can you please elaborate on the thought process of troubleshooting this issue, what could have gone wrong and any reference examples/documents that could help?
My initial thought was that either

tao-export was not ran correctly → as I don’t understand how the model dimensions get set, where does the command get it from?
tao-convert was not ran correctly
What is the correct preprocessing step before running inference → the only thing I found was to make sure the image is CHW → Is there a proper example that demonstrates the preprocessing? Is is dependent on the backbone/encoder ?

Morganh · July 18, 2023, 7:01am

Officially, there are two kinds of ways to run inference outside tao docker.
One is deepstream_tao_apps. Another is triton_tao_apps.
For retinanet, you can refer to GitHub - NVIDIA-AI-IOT/deepstream_tao_apps: Sample apps to demonstrate how to deploy models trained with TAO on DeepStream, the preprocess is https://github.com/NVIDIA-AI-IOT/deepstream_tao_apps/blob/master/configs/retinanet_tao/pgie_retina_tao_config.txt#L25-L27

Also, you can refer to be below.
https://github.com/NVIDIA-AI-IOT/tao-toolkit-triton-apps/blob/ae6b5ec41c3a9651957c4dddfc262a43f47e263c/tao_triton/python/entrypoints/tao_client.py#L323

The postprocessing is in https://github.com/NVIDIA-AI-IOT/tao-toolkit-triton-apps/blob/ae6b5ec41c3a9651957c4dddfc262a43f47e263c/tao_triton/python/postprocessing/retinanet_postprocessor.py

glingk · July 21, 2023, 1:36pm

Thank you for the pointer. Got it working :)

system · August 4, 2023, 1:36pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Converting etlt file to .engine for jetson TAO Toolkit	17	2774	October 25, 2022
Wrong inference results with etlt or engine, Faster RCNN in TLT 3.0 batch size > 1 TAO Toolkit	10	668	November 4, 2021
transfert learning toolkit-> export model TAO Toolkit	11	3574	October 12, 2021
Classification inference huge performance degradation TAO Toolkit	11	1517	February 18, 2022
How to export model using tlt-converter for Jetson Nano TAO Toolkit	69	8476	October 12, 2021
TAO and Jetson-Inference ...ooops TAO Toolkit jetson-inference	9	1029	February 20, 2023
Inferring resnet18 classification etlt model with python TAO Toolkit	45	3973	October 12, 2021
Nvidia TLT TAO Toolkit	15	1611	October 12, 2021
TensorRT Inference form a .etlt model on Python TAO Toolkit tensorrt	7	1200	November 16, 2021
[TAO] use trt of tao on tensorrt , process infer happened repeated calls TAO Toolkit tensorrt , tao	5	698	November 7, 2022

TAO-Converter TRT engine inference results is blank

Related topics