DeepStream TensorRT Tensor Output Meta: Persisting bounding boxes

xtianhb.glb · January 26, 2022, 10:03pm

Please provide complete information as applicable to your setup.

• Hardware Platform (Jetson / GPU)
GPU GTX1080
• DeepStream Version
6.0
• TensorRT Version
8.0.1
• NVIDIA GPU Driver Version (valid for GPU only)
495.29.05
• Issue Type( questions, new requirements, bugs)
Bug?
• How to reproduce the issue ?

Build a custom DeepStream pipeline using Python bindings for object detection and drawing bounding boxes from tensor output meta.
Grab a Pytorch model of YoloV5 and optimize it with TensorRT.
Take the optimized model and configure the DeepStream pipeline to use Triton server and make it load the TRT YoloV5 model.
Run the inference pipeline.
When there are target objects in the video, they are detected correctly. Later when objects leave the scene, the network still outputs boxes from past detections, like ghost bounding boxes, until new valid objects come into the scene. The ghost boxes are not random, they repeat old detections.
The model with its TRT optimization has been tested with the Triton server alone, and it works fine. This problem is new when we introduce DeepStream.

I can’t share the model or the complete pipeline at this moment. I know that makes it difficult to reproduce.
Maybe I can work on a reduced version of the code and find a public model to test.
The model is based on:

Bear with me please, maybe you could help me identify the origin of this behavior.
If I have to make a guess, I would call it some issue with memory management between DeepStream, Triton TRT, and Yolo backend.

Any thoughts or previous reports?

xtianhb.glb · January 26, 2022, 10:11pm

config.pbtxt (535 Bytes)
triton.txt (1.1 KB)

# Maximum number of objects
N_OBJS=1000
# Number of parameters
N_DATA=6
N_YOLO=N_OBJS*N_DATA
# Dimensions from raw NN model
LAYER_DIMS = [ N_OBJS*N_DATA+1, 1, 1 ] 
# Dimension for actual data
DIMS = [N_OBJS, N_DATA]
# Top K
NPART = 25
# Dims
W=640
H=380
# Output Layer
LAYER_NAME="prob"

def get_ObjDet(output_layer_info):  
    """ 
    Parse output from object detection
    """

    numdets = 0
    idx = []
    xyboxes = []
    classesid = []
    scores = []

    layer_bboxes = lfinder(output_layer_info, LAYER_NAME)
    if layer_bboxes is None:
        sys.stderr.write( "ERROR: layer missing in output tensors\n" )
        return numdets, xyboxes, classesid, scores

    Ptr = ctypes.cast( pyds.get_ptr(layer_bboxes.buffer), ctypes.POINTER(ctypes.c_float) )

    bboxes_flat = np.array( np.ctypeslib.as_array( Ptr, shape=LAYER_DIMS ) )

    # Convert from [6001, 1, 1] to [1000, 6]
    data = np.reshape( bboxes_flat[0:N_YOLO, 0, 0], DIMS)

    # Drop values with conf less than 0.5
    data = data[ data[:,5]>=0.50 ]

    #Yolo.     #0:ClassId     #1:Xc 2:Yc 3:W 4:H    #5:Score
    data_class = data[:, 0]
    data_conf = data[:, 5]
    data_xy = data[:, 1:5]    
    
    # Partition for the top NPART results
    idx = np.argpartition(data_conf, -NPART)[-NPART:]

    for i in idx:
        # Further filtering
        if data_conf[i] >= 0.75:
            numdets += 1
            xyboxes.append( [ (data_xy[i,0]-data_xy[i,2]/2.0)/W, (data_xy[i,1]-data_xy[i,3]/2.0)/H, 
                        (data_xy[i,0]+data_xy[i,2]/2.0)/W, (data_xy[i,1]+data_xy[i,3]/2.0)/H ] )
            classesid.append( data_class[i] )
            scores.append(data_conf[i])

    layer_bboxes = None

    return numdets, xyboxes, classesid, scores

Also, I’ve found the Yolo Trt plugin is based on this

github.com

wang-xinyu/tensorrtx/blob/master/yolov5/yololayer.cu

#include <assert.h>
#include <vector>
#include <iostream>
#include "yololayer.h"
#include "cuda_utils.h"

namespace Tn
{
    template<typename T> 
    void write(char*& buffer, const T& val)
    {
        *reinterpret_cast<T*>(buffer) = val;
        buffer += sizeof(T);
    }

    template<typename T> 
    void read(const char*& buffer, T& val)
    {
        val = *reinterpret_cast<const T*>(buffer);
        buffer += sizeof(T);

This file has been truncated. show original

kayccc · February 8, 2022, 1:28am

Sorry for the late response, is this still an issue to support? Thanks

xtianhb.glb · February 8, 2022, 1:23pm

We are still debugging this. It may be related to a TensorTRT minor version change, or something like that, impacting in the yolo.so lib file. So far we couldn’t find we comes from. Thanks.

mchi · February 15, 2022, 3:57pm

do you still the support about this?

I ever met similar issue before, are you usingy your own post-processor?

xtianhb.glb · February 15, 2022, 5:36pm

Hi @mchi

Yes, we still have this issue. Although it’s difficult to tell if is a problem inside deepstream/triton, or it’s external.

In DeepStream we have rewritten our object detection code for other reasons. It works fine for all models except this one. Our code is in the pgie callback of DeepStream for tensor meta output post-processing in Python.
YoloV5 is optimized with TensorRT, which is based on the Yolo links above, one for the model, and the other for the yololayer.cu file. We are using T-RT 8.0.1. Triton Server uses the yolo.so lib.
It looks like somewhere in the code, when there are no active objects, old objects from the buffer are still there, valid.

After some time, I think the problem is related to some mismatch between Tensor-RT and this yolo.cu source code, which is outside the Nvidia domain, so that’s why I closed the topic.

Any further thoughts are welcome.

Thank you

mfoglio · February 15, 2022, 5:48pm

Hi @xtianhb.glb , I am working on a similar project. Unfortunately, I can’t share the code. So far I have been able to integrate YoloV5

I got the TensorRT model from this repository GitHub - wang-xinyu/tensorrtx: Implementation of popular deep learning networks with TensorRT network definition API
I am using nvinferserver to run the model
I am doing custom postprocessing using Python
I succeeded in doing that, but I have a memory leak issue. @xtianhb.glb , do you have the same issue too? I am currently working on re-implementing the model using nvinfer instead of nvinferserver.

xtianhb.glb · February 15, 2022, 7:06pm

Hi @mfoglio!

It looks like we are using the same sources for YoloV5 and building similar pipelines (Python, NvInferServer, etc). That is great, we are on the same page.
In my case the model runs almost correctly, doesn’t crash or anything. The only problem I noticed is there are old bounding boxes detected when there are no new objects. When new objects enter the image, the old ones disappear one by one.
I have the feeling that this is related to a circular memory buffer, or some leak in memory as you said.
There has been a bug fix on the output buffer to avoid a race condition with cuda memset, my code already has that patch:
[Bug in YoloV5Layer implementation · Issue #720 · wang-xinyu/tensorrtx · GitHub](https://Bug in YoloV5Layer implementation)

system · March 1, 2022, 7:07pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
TensorRT engine giving wrong/different output in DeepStream DeepStream SDK	26	4081	February 22, 2022
Instructions to integrate TAO 3.0 YoloV4 model into DeepStream produce no output on Jetson NX DeepStream SDK	10	384	December 5, 2023
Custom YOLOv3 model in DeepStream 5.0 DeepStream SDK	16	1579	October 12, 2021
Deepstream YoloV8 model causing stream distorsion DeepStream SDK yolo	7	625	November 1, 2023
TensorRT Myelin Division by 0 error with YOLOv8 Segmentation model DeepStream SDK tensorrt , deepstream	13	104	October 16, 2024
GRPC Data Corruption/Issue with Yolo Object Detection with Triton on Jetson DeepStream SDK	20	615	June 25, 2024
How to use custom object detector i.e nvinfer in ds-example DeepStream SDK	6	2343	October 12, 2021
How to Use Deepstream Python Bindings with Custom Model Weights (YoloV3) DeepStream SDK yolo , python	13	1093	October 12, 2021
Deepstream 6.0 Python Yolo bad performance DeepStream SDK	8	1663	December 28, 2021
Jetson Xavier NX performance with yolov8 General tensorrt , jetson-inference	16	1022	July 4, 2024

DeepStream TensorRT Tensor Output Meta: Persisting bounding boxes

Related topics