Performing multiple inference in the nvinfer backend

hatake_kakashi · April 8, 2021, 8:46am

Hi!

I’m trying to extend the nvinfer backend by performing inference for an input surface more than once. For instance, in the nvdsinfer_backend.cpp, there’s a ImplicitTrtBackendContext::enqueueBuffermethod that queues an input buffer for inference:

    if (!m_Context->enqueue(batchDims.batchSize, bindingBuffers.data(), stream,
            (consumeEvent ? &consumeEvent->ptr() : nullptr)))
    {
        dsInferError("Failed to enqueue inference batch");
        return NVDSINFER_TENSORRT_ERROR;
    }

Where the bindingBuffers are the input and output buffers of the model. Now, what my requirement is, for an input buffer (say an image I1) I would like to perform the inference twice, once for I1 and once for flip(I1) where the flip is defined as an image flip operation. Finally, I would like to average out the predictions (I think I can figure out this part)

How I imagine this would work:

Extract the NvBufSurface which is the input image.
Perform a buffer transform using NvBufferTransform_Flip by setting the flip transformation parameter to NvBufferTransform_FlipX to create a mirror image.
Call .enqueue() on this batch
Extract the GPU buffer, perform average operation.
Return the GPU buffer

I have theoretically figured how this would work out. Now my question is:

Is my approach stated above correct? If no, how should I go about it?
How do I access the NvBufSurface in the ImplicitTrtBackendContext::enqueueBuffer method which uses the CUDA Stream to perform inference?

• Hardware Platform: T4 / Jetson NX / Jetson TX2NX
•DeepStream Version: 5.1
•JetPack Version (valid for Jetson only): 4.5.1
•TensorRT Version: 7.2
•NVIDIA GPU Driver Version (valid for GPU only): 455
•Issue Type( questions, new requirements, bugs): question
•Requirement details: Editing the nvdsinfer_backend.cpp to perform inference on a given image buffer twice

spolisetty · April 8, 2021, 5:06pm

Hi @hatake_kakashi,

We recommend you to post your query on DeepStream forum. You may get better help here.

Thank you.

hatake_kakashi · April 8, 2021, 5:11pm

It’s already in the DeepStream forum right? Am I missing something?

hatake_kakashi · April 10, 2021, 7:48pm

Hi any update on this?

bcao · April 13, 2021, 7:53am

Hey, we are checking it and update you ASAP.

hatake_kakashi · April 13, 2021, 9:31am

Thanks, I figured it out. I wrote custom kernel to perform the flip operation because FLIP flag in buffer transform isn’t supported on dGPU. Furthermore, I edited the push to input thread function to create a “duplicate” batch with the new buffers and append it to the original batch queue. This way every batch was 2x’ed one with the original frames, and second with the flipped ones. Later, I just wrote a simple probe to parse the tensor meta data and add both of them together. Thanks!

bcao · April 13, 2021, 3:18pm

Great work! thanks for your sharing, BTW, how do you handle the batch size since every batch size is 2x’ed than the original batch.

hatake_kakashi · April 14, 2021, 12:14pm

My engine was dynamic size, so the nvinfer backend worked just fine. In the gst-nvinfer plugin however, I patched the gst_nvinfer_output_loop to get “get rid” of the redundant frames of the input:

if (!batch->frames.empty() && nvinfer->enable_tta) {
        /* TTA is enabled, so half the batch needn't be processed */
        assert(batch->frames.size() % 2 == 0);
        batch->frames.resize(batch->frames.size() / 2);
}

Using this other features like tracking history and so forth will not break and just continue as before.

Topic		Replies	Views
How to inference with a multi-input model that requires two streams of images DeepStream SDK gstreamer , deepstream	15	854	May 15, 2024
Prevent certain frames from being inferred on DeepStream SDK	13	747	July 13, 2022
How to pass custom input to non image layer of model during runtime DeepStream SDK cuda , jetson-inference , gstreamer , jetson , deepstream	14	99	December 13, 2024
DeepStream nvinfer input tensor contains incorrect image DeepStream SDK jetson-inference , gstreamer	14	1398	August 8, 2022
Two rgb inputs to a duel encoder semantic segmetation model DeepStream SDK	5	353	July 26, 2022
Populate NvBufSurface from cv::Mat DeepStream SDK	11	3279	October 12, 2021
How to manually form a batch DeepStream SDK	34	2296	October 12, 2021
Does nvInfer skip frames in deepstream-app -c application? DeepStream SDK	4	1013	October 12, 2021
DeepStream SDK and decoding RTSP on GPU DeepStream SDK	23	2004	October 12, 2021
Package the tensorRT inference results into GstBuffer and push them to the downstream DeepStream SDK	7	1303	October 17, 2022

Performing multiple inference in the nvinfer backend

Related topics