TensorRT engine giving wrong/different output in DeepStream

There seems to be a bug in the open source code you are following. The kernels in plugin layer should use the same incoming cuda stream which it receives in the call to “enqueue”. This will ensure in-order execution of all the kernels in the entire network.