Detectned_V2 bbox

mazitov.ta · May 14, 2020, 11:18am

Description

I trained Detectned_V2 in Transfer Learning Toolkit on kitti dataset (It was very comfortable).

Now i want to use this model in TensorRT(Not in deepstream) in my own c++ aplication.
TensorRT engine has 2 outputs: output_cov/Sigmoid and output_bbox/BiasAdd.
I want to get probability and bounding boxes.
As I understand for probability i must use some variant of DBSCAN algorithm for output_cov/Sigmoid and NMS for output_bbox/BiasAdd
Is there some C++ code sources for postprocessing this outputs (especially for output_cov/Sigmoid postprocessing)?
Another information that i cant find - Is it necessary for this model to use some preprocessing (using mean value)?

Environment

nvcr.io/nvidia/tlt-streamanalytics:v2.0_dp_py2

SunilJB · May 14, 2020, 3:50pm

Please refer to below sample if in case it helps:

github.com

NVIDIA/TensorRT/blob/07ed9b57b1ff7c24664388e5564b17f7ce2873e5/samples/opensource/sampleUffFasterRCNN/config.py

# Copyright (c) 2019, NVIDIA CORPORATION. All rights reserved.
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
#     http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
#
import tensorflow as tf
import graphsurgeon as gs


CropAndResize = gs.create_plugin_node(name='roi_pooling_conv_1/CropAndResize_new', op="CropAndResize", inputs=['activation_7/Relu', 'proposal'], crop_height=7, crop_width=7)
Proposal = gs.create_plugin_node(name='proposal', op='Proposal', inputs=['rpn_out_class/Sigmoid', 'rpn_out_regress/BiasAdd'],     input_height=272, input_width=480, rpn_stride=16, roi_min_size=1.0, nms_iou_threshold=0.7, pre_nms_top_n=6000, post_nms_top_n=300, anchor_sizes=[32.0, 64.0, 128.0], anchor_ratios=[1.0, 0.5, 2.0])

This file has been truncated. show original

Thanks

mazitov.ta · May 18, 2020, 9:28am

I successfully received output_cov/Sigmoid
But i cant receive right bboxes
top- my boxes(all predicted box fromo utput_bbox/BiasAdd ); bot - TLT result

CODE:
outputIndex1 = engine.getBindingIndex(“output_bbox/BiasAdd”);
float box3[4 * 24 * 78];
cudaMemcpyAsync(box1, buffers[outputIndex1], 4 * 24 * 78 * sizeof(float), cudaMemcpyDeviceToHost, stream);
cudaStreamSynchronize(stream);

///Postprocessing

for (int i = 0; i < 24 * 78; ++i)
{
int dx = (i % 78) ;
int dy = (i / 78) ;
float x1 = (box1[i * 4 + 0] + dx) * 16;
float y1 = (box1[i * 4 + 1] + dy) * 16;
float x2 = (box1[i * 4 + 2] + dx) * 16;
float y2 = (box1[i * 4 + 3] + dy) * 16;
cv::rectangle(image, cv::Point(x1, y1), cv::Point(x2, y2), cv::Scalar(0, 255, 0), 1, 8, 0);
}
Something wrong in my output_bbox/BiasAdd postprocessing.

Regards.

SunilJB · May 19, 2020, 5:57am

Please refer to below sample:

github.com

NVIDIA-AI-IOT/deepstream_4.x_apps/blob/master/nvdsinfer_customparser_mrcnn_uff/nvdsinfer_custombboxparser_mrcnn_uff.cpp

/**
 * Copyright (c) 2017-2019, NVIDIA CORPORATION.  All rights reserved.
 *
 * NVIDIA Corporation and its licensors retain all intellectual property
 * and proprietary rights in and to this software, related documentation
 * and any modifications thereto.  Any use, reproduction, disclosure or
 * distribution of this software and related documentation without an express
 * license agreement from NVIDIA Corporation is strictly prohibited.
 *
 */

#include <cstring>
#include <iostream>
#include <vector>

#include "nvdsinfer_custom_impl.h"
#include <cassert>

#define MIN(a,b) ((a) < (b) ? (a) : (b))
#define MAX(a,b) ((a) > (b) ? (a) : (b))

This file has been truncated. show original

Thanks

NightOwl · December 1, 2020, 11:12am

Hi mazitov

Have you found the proper way to postprocess the output? I would be very apperciated if you can share your code.

Regards

mazitov.ta · December 1, 2020, 11:22am

#include <src/cnn/nvdsinfer.h>
#include <src/cnn/nvdsinfer_dbscan.h>

void Cnn::assignClass(std::vector &objectList, std::vector &objectListRes,
int num)
{
int objNum = objectList.size();
if (objNum < 1)
return;

for (int i = 0; i < num; ++i)
{
objectListRes[i].classId = objectList[0].classId;
float ax = objectListRes[i].left + objectListRes[i].width / 2;
float ay = objectListRes[i].top + objectListRes[i].height / 2;

float bx = objectList[0].left + objectList[0].width / 2;
float by = objectList[0].top + objectList[0].height / 2;

float dist = sqrt((ax - bx) * (ax - bx) + (ay - by) * (ay - by));

for (int j = 1; j < objNum; ++j)
{
bx = objectList[j].left + objectList[j].width / 2;
by = objectList[j].top + objectList[j].height / 2;

float distItr = sqrt((ax - bx) * (ax - bx) + (ay - by) * (ay - by));
if (dist > distItr)
{
dist = distItr;
objectListRes[i].classId = objectList[j].classId;
}

}
}
}

void Cnn::doInference(IExecutionContext& context, float* inputData, cudaStream_t &stream)
{

cudaMemcpyAsync(buffers[inputIndex0], inputData, batch_size * INPUT_D * sizeof(float), cudaMemcpyHostToDevice, stream);
context.enqueue(batch_size, buffers, stream, nullptr);

float probs[classNum * OUT_DIM_H * OUT_DIM_W];
float boxs[classNum * 4 * OUT_DIM_H * OUT_DIM_W];

cudaMemcpyAsync(probs, buffers[outputIndex0], classNum * OUT_DIM_H * OUT_DIM_W * sizeof(float), cudaMemcpyDeviceToHost, stream);
cudaMemcpyAsync(boxs, buffers[outputIndex1], classNum * 4 * OUT_DIM_H * OUT_DIM_W * sizeof(float), cudaMemcpyDeviceToHost, stream);
cudaStreamSynchronize(stream);

std::vector objectList;

int gridW = OUT_DIM_W;
int gridH = OUT_DIM_H;
int gridSize = gridW * gridH;
float gcCentersX[gridW];
float gcCentersY[gridH];
float bboxNormX = 35.0;
float bboxNormY = 35.0;
float *outputCovBuf = (float *) probs;
float *outputBboxBuf = (float *) boxs;

int strideX = DIVIDE_AND_ROUND_UP(INPUT_W, gridW);
int strideY = DIVIDE_AND_ROUND_UP(INPUT_H, gridH);

for (int i = 0; i < gridW; i++)
{
gcCentersX[i] = (float) (i * strideX + 0.5);
gcCentersX[i] /= (float) bboxNormX;

}
for (int i = 0; i < gridH; i++)
{
gcCentersY[i] = (float) (i * strideY + 0.5);
gcCentersY[i] /= (float) bboxNormY;
}

for (int c = 0; c < classNum; c++)
{
float *outputX1 = outputBboxBuf + (c * 4 * gridW * gridH);
float *outputY1 = outputX1 + gridSize;
float *outputX2 = outputY1 + gridSize;
float *outputY2 = outputX2 + gridSize;

float threshold = 0.02;

for (int h = 0; h < gridH; h++)
{
for (int w = 0; w < gridW; w++)
{
int i = w + h * gridW;
if (outputCovBuf[c * gridSize + i] >= threshold)
{

NvDsInferObjectDetectionInfo object;
object.classId = c;
object.detectionConfidence = outputCovBuf[c * gridSize + i];

float rectX1f, rectY1f, rectX2f, rectY2f;

rectX1f = (outputX1[w + h * gridW] - gcCentersX[w]) * -bboxNormX;
rectY1f = (outputY1[w + h * gridW] - gcCentersY[h]) * -bboxNormY;
rectX2f = (outputX2[w + h * gridW] + gcCentersX[w]) * bboxNormX;
rectY2f = (outputY2[w + h * gridW] + gcCentersY[h]) * bboxNormY;

/* Clip object box co-ordinates to network resolution */
object.left = CLIP(rectX1f, 0, INPUT_W - 1);
object.top = CLIP(rectY1f, 0, INPUT_H - 1);
object.width = CLIP(rectX2f, 0, INPUT_W - 1) - object.left + 1;
object.height = CLIP(rectY2f, 0, INPUT_H - 1) - object.top + 1;

objectList.push_back(object);

}
}
}
}

size_t numObjects = objectList.size();
auto unclasteredObjectList = objectList;
NvDsInferDBScanCluster(DBScan, &DBScanParams, &objectList[0], &numObjects);
assignClass(unclasteredObjectList, objectList, numObjects);

for (int i = 0; i < numObjects; ++i)
{
auto object = objectList[i];
cv::Scalar color(255, 255, 0);
if (object.classId == 0)
color = cv::Scalar(0, 255, 0);
else if (object.classId == 1)
color = cv::Scalar(255, 0, 0);
else if (object.classId == 2)
color = cv::Scalar(0, 0, 255);

cv::rectangle(visualize, cv::Rect(object.left, object.top, object.width, object.height), color, 2, 8, 0);
}
}

NightOwl · December 2, 2020, 6:18am

Hi mazitov

Thanks for sharing your code.
Where is the source code of this method

NvDsInferDBScanCluster

I did found the header file in DeepStream SDK, but I can’t found the source code

Topic		Replies	Views
Incorrect Results When Using TensorRT Inference Server With TLT Model TAO Toolkit tensorrt	19	1857	October 12, 2021
DeepStream implementation of working nwesem/mtcnn_facenet_cpp_tensorRT needed DeepStream SDK	8	851	October 12, 2021
Converting Custom RetinaNet model to TensorRT in DeepStream DeepStream SDK tensorrt , neural-network-framework , jetson , deepstream , net	29	101	January 21, 2025
How to run Detectnet_v2_resnet18.trt without deepstream TAO Toolkit	11	1703	October 12, 2021
DetectnetV2 C++ tensorrt inference,no result TAO Toolkit	4	658	October 12, 2021
Regarding doubts about ultra_light_320.onnx convert into tensorrt DeepStream SDK gstreamer	8	306	July 10, 2023
Inference on LPDNet onnx file TAO Toolkit	8	565	March 6, 2024
Deepstream on Jetson - getting cv::gpuMat from NvBufSurface DeepStream SDK	5	206	August 9, 2024
NvDsInferLayerInfo not giving expected no. of outputs DeepStream SDK	60	2081	October 12, 2021
Issue with Bounding Boxes and Object Detection in DeepStream Using YOLOv8 Model DeepStream SDK yolo , deepstream	9	64	April 18, 2025

Detectned_V2 bbox

Description

Environment

Related topics