Hi,
I am running multiple detection engines with TensorRT on Jetson Xavier
The plan is to have GPU and NVDLA running in different threads of the same process to reduce process time.
But what I got is GPU and NVDLA are working in turn, and execution time increased.
Is there anything that I should pay attention to?