TensorRT context.enqueue gives wrong result for all frames other than first

mayank.mittal · October 15, 2018, 3:59pm

Hi,
I am trying to run inference on multiple images using TensorRT API.
Pseudo Code snippet for my application is

context.enqueue(batchSize, buffers, stream, nullptr);

Here,

buffers[0] = batchSize * INPUT_C * INPUT_H * INPUT_W
        buffers[1] = batchSize * outputSize

If I run with batchSize=1 , I get correct output but with batchSize > 1 , detection for images other than first image are wrong.
Further, with batchSize=1, inference time is 7ms and for batchSize=3 it is around 16ms. So solving this issue will give a lot of boost to my application and as in general.

Can someone please suggest me what all things I can try to solve this issue.
I am allocating memory to “buffers” this way

for (int b = 0; b < engine.getNbBindings(); b++)
    {
        DimsCHW dims = static_cast<DimsCHW&&>(engine.getBindingDimensions(b));
        size_t size = batchSize * dims.c() * dims.h() * dims.w() * sizeof(float);
        std::cout << "size of buff = " << size << std::endl;
        CudaCHECK(cudaMalloc(&buffers[b], size));
    }

Is it supposed to be a 1D vector or 2D vector.

Thanks!

Topic		Replies	Views
enqueue/execute batch size argument must be same as maximum specified at build TensorRT	0	811	June 8, 2018
Output of batch inference TensorRT	6	2747	January 7, 2021
Tensorrt Batch Inference TensorRT tensorrt	8	1604	December 1, 2020
Inference multiple images TensorRT TensorRT	8	2297	November 9, 2020
TensorRT running inference with batch size > 1 TensorRT tensorrt	8	3739	January 26, 2021
Different TensorRT inference results from the same input when batchSize > 1 TensorRT	2	2042	October 12, 2021
tensorrt execute batch_size >1 images Jetson TX2	2	930	October 18, 2021
[TensorRT] C++ batch inference gives weird results TensorRT	3	1051	October 12, 2021
How to organize memory for inference with batch size > 1 TensorRT	1	775	June 4, 2020
TensorRT 5 builder when set max_batch_size to 8 the output shape? Jetson TX2	3	1455	October 18, 2021

TensorRT context.enqueue gives wrong result for all frames other than first

Related topics