Why enqueueV2() call in different threads can't execute concurrently?

tjliupeng · July 19, 2022, 12:47pm

We have 3 trt models which use the same image input to inference. The 3 inference outputs are needed simultaneously for next processing. So, Each model is loaded in different thread and has it own engine and context.
And we find that the whole time cost of concurrent enqueueV2() call in 3 threads is equal to the sequential enqueueV2() calls for 3 models in one thread . It seems that the multi-thread does not increase the performance. Why?

The application runs in docker container.

Hardware: RTX 3090
Cuda: 11.0
Tensort: 8.0.4
OS: Ubuntu 18.04
Docker: 19.03

achartiernv · July 19, 2022, 8:30pm

Moved to TensorRT forum.

spolisetty · July 22, 2022, 7:45am

Hi,

Could you please share with us a minimal issue repro script/model to try from our end for better debugging.

Thank you.

Topic		Replies	Views
Tensorrt Threads affect each other during multithreaded inference TensorRT tensorrt	16	1817	September 6, 2024
TensorRT: Calling enqueue() from multiple host threads Jetson TX2	7	3444	October 18, 2021
Latency when running TensorRT engine on two GPU TensorRT	9	1335	August 24, 2020
Is enqueueV2 thread safe? TensorRT	2	846	March 31, 2023
Unable to do inference of multiple engines in parallel TensorRT tensorrt , nano	3	1848	May 6, 2022
TensorRT enqueueV2 take a long time TensorRT cudnn	4	686	January 30, 2024
TRT concurrently Jetson TX2 tensorrt	7	1194	September 5, 2021
[TensorRT] Speed of concurrent execute multiple TensorRT model on one GPU TensorRT tensorrt	1	1861	May 24, 2020
Multiple calls of enqueueV2 TensorRT	15	2364	September 19, 2021
[Question] trtexec understanding issue TensorRT	4	1097	December 6, 2021

Why enqueueV2() call in different threads can't execute concurrently?

Related topics