Are there any issues with calling enqueueV3 on multiple Streams with a single ExecutionContext?

suchir · March 16, 2024, 2:01am

enqueue and enqueueV2 include the following warning in their documentation:

Calling enqueueV2() in from the same IExecutionContext object with different CUDA streams concurrently results in undefined behavior. To perform inference concurrently in multiple streams, use one execution context per stream

enqueueV3’s documentation does not. Should it? Is there any locking or other performance concerns I should be aware of?

spolisetty · June 10, 2024, 6:28am

The documentation for enqueueV3 doesn’t explicitly mention the limitation, but it’s likely still applicable. There hasn’t been any official confirmation of a change in behavior, and it’s safer to assume it works with name-based methods similarly to enqueueV2.

Topic		Replies	Views
IExecutionContext and multiple streams TensorRT	2	495	May 18, 2020
Why enqueueV2() call in different threads can't execute concurrently? TensorRT tensorrt	2	622	July 22, 2022
[Question] trtexec understanding issue TensorRT	4	959	December 6, 2021
Is enqueueV2 thread safe? TensorRT	2	748	March 31, 2023
Concurrent instances of TensorRT TensorRT	0	715	March 9, 2019
Concurrent inference in a single IExecutionContext TensorRT	2	965	February 11, 2020
TensorRT: Calling enqueue() from multiple host threads Jetson TX2	7	3344	October 18, 2021
Why would IExecutionContext:: enqueue(V2/V3) return false? TensorRT tensorrt , cudnn	2	31	January 16, 2025
IExecutionContext::enqueue - Multiple cuda streams NOT parallelized on TX2 but parallelized on host TensorRT	0	382	September 17, 2019
Tensorrt & multiple streams GPU-Accelerated Libraries	0	995	February 6, 2018

Are there any issues with calling enqueueV3 on multiple Streams with a single ExecutionContext?

Related topics