Tensorrt & multiple streams

alexttv1y · February 6, 2018, 3:26am

Hello,

Is it possible to use a single instance of tensorrt context/CudaEngine with multiple streams concurrently?

In our problem, we don’t know a-priori the batchSize, it’s dependent on the value set in the configuration file. As a result at runtime, the batchSize might exceed the maxBatchSize used when serializing the engine.

I was hoping, I could split the batchSize into N maxBatchSize batches and process them in parallel. However, I get funny results back unless I put a cudaDeviceSynchronize between the context->encqueue invocations.

Topic		Replies	Views
Multi Stream in TensorRT TensorRT	1	2149	July 28, 2020
Batch inference parallelization on tensorrt DeepStream SDK tensorrt	2	515	October 12, 2021
How should batching be handled in TensorRT custom Plugin implementations. (Does TensoRT create seperate CUDA streams for each batch?) TensorRT	2	878	October 12, 2021
TRT concurrently Jetson TX2 tensorrt	7	1177	September 5, 2021
Batch inference parallelization on tensorrt TensorRT tensorrt , cuda	5	1012	May 5, 2021
Is multi threaded execution possible with tensorRT? TensorRT	3	2302	April 13, 2020
Multiple tensorrt engine contexts for different models TensorRT	3	1915	March 16, 2023
Inference Time When Using Multi Stream in TensorRT is Much Slower than a Single One TensorRT tensorrt	5	2572	March 30, 2023
IExecutionContext and multiple streams TensorRT	2	528	May 18, 2020
TensorRT on Multiple CUDA-Streams GPU-Accelerated Libraries	1	2455	May 9, 2018

Tensorrt & multiple streams

Related topics