Faster R-cnn Tensorrt results diferences between TLT 3.0 and TensorRT C++

thebambuproject · September 26, 2022, 10:41am

Hi!

We train a Faster Rcnn model with resnet18 backbone on TLT 3.0 container, train and evaluate test, inference works perfect with int8 calibration. Here TLT config:

faster_rcnn_config.txt (4.3 KB)

We export model to an .etlt file, we call our output tensor NMS -o option. After this we export model with TLT-converter tool on Jetson NX with calibration options and sizes. Seems ok. We are using Tensor RT on a c++ environment to make inference. Tensor input/outputs sizes are:

0 - input_image
     · 0 - 3
     · 0 - 1080
     · 0 - 1920
1 - NMS
     · 0 - 1
     · 0 - 100
     · 0 - 7
2 - NMS_1
     · 0 - 1
     · 0 - 1
     · 0 - 1

At this time, detections seems to be less than on TLT container inference, we think that its problem of image preprocess before pass to the input tensor. We have a Opencv Mat as RGB source image (image).
As result we have 20-30% detections in coparision with tlt tests. As you can see reverse RGB order to BGR, substract channel mean and divide by 1.0 as tlt documentation says on input_image_config parameter specification.

float* hostDataBuffer = static_cast<float*>(buffers.getHostBuffer("input_image"));
float pixelMean[3]{103.939,116.779, 123.68};

for (int i = 0, volImg = C * H * W; i < 1; ++i){
     for (int c = 0; c < C; ++c){
         for (unsigned j = 0, volChl = H * W; j < volChl; ++j){
              hostDataBuffer[i * volImg + c * volChl + j] = float(((float(image.data[j * C + 2- c])) - pixelMean[c]))/1.0F;
         }
     }
}

buffers.copyInputToDevice();
bool status{true};
status = context->execute(1, buffers.getDeviceBindings().data());
buffers.copyOutputToHost();
const float* nms = static_cast<const float*>(buffers.getHostBuffer("NMS"));
for (int det_id = 0; det_id <100; det_id++){
      float x1 =  nms[det_id * 7 + 3];
}

Can anyone help?
Thanks

TensorRT Version: 4.
GPU Type: JETSON NX XAVIER
Jetpack: 32.6 4.6
Cuda: 10.2
cuDNN: 8.2.1
TensorRT: 7.2
Operating System + Version: Ubuntu 18 + Jetpack

spolisetty · September 27, 2022, 11:15am

Hi,

Looks like you’re using a very old version of the TensorRT. We recommend you please use the latest version of the TensorRT 8.4.3.

Thank you.

thebambuproject · September 27, 2022, 11:56am

Ok thanks for your recommendation!, but we have some constrains arround version, please, can you help me to check if image pre process its correct? Why older tensorRT version may impact on detections if TLT environment was ok? Why Mask-RCNN works ok, and Faster RCNN no?

thebambuproject · September 29, 2022, 9:42am

Sorry, but we are stoped by this, only need to know if inputs are ok, and if its a know problem. We try training a FASTER RCNN with RESNET 50 backbone but seems the same. Mask RCNN Resnet 50 on TLT 3.0 works ok.

Thanks

thebambuproject · October 6, 2022, 11:27am

Can you help me? Im a little desesperate about this. If network output use NMS pluggin seems no results. I try with Yolo too, and bad results.
Thanks

spolisetty · October 7, 2022, 11:36am

Hi,

Sorry for the delayed response, could you please provide us issue repro script and onnx model to try from our end for better debugging.
Also, are you able to reproduce this issue on the latest TensorRT version ?

Thank you.

Topic		Replies	Views
Interpreting output of MaskRCNN from TLT to TRT TAO Toolkit tensorrt	7	1675	October 9, 2021
Wrong inference results with etlt or engine, Faster RCNN in TLT 3.0 batch size > 1 TAO Toolkit	10	668	November 4, 2021
How to do inference with a TLT faster rcnn model? TAO Toolkit	15	1696	October 12, 2021
TLT different results TAO Toolkit	9	988	September 18, 2021
Deploying Deep Neural Networks with NVIDIA TensorRT Technical Blog	17	646	October 8, 2017
How can I finetune the TensorRT faster RCNN Sample? Jetson TX2	14	5137	October 18, 2021
TensorRT 8 : C++ inference gives different results compared to tensorflow python inference TensorRT	7	1346	October 5, 2021
Big difference between infer results of onnxruntime and tensorrt TensorRT cudnn	2	57	March 20, 2025
How to do inference with a TLT faster rcnn model? TensorRT	1	975	January 2, 2020
Triton deployment and inference TAO Toolkit	4	1304	July 27, 2021

Faster R-cnn Tensorrt results diferences between TLT 3.0 and TensorRT C++

Related topics