TensorRT on Multiple CUDA-Streams

Tfuru · September 15, 2017, 1:00am

Hi.

I’m running an inference application with TensorRT 2.1 on multiple CUDA-streams. However, the application has low CUDA-stream concurrency. According to my debugging with Visual Profiler, the trtwell_scudnn_128x32_relu_interior_nn functions for each CUDA-stream does not run in parallel. (It seems that only one trtwell_scudnn_128x32_relu_interior_nn function can run a time.) Does it have any mutual exclusions?

Because it seems that TensorRT has many CPU-GPU interactions, I made POSIX threads for each CUDA-stream, so that the CPU routine inside TensorRT can run in parallel. Each POSIX worker thread repeats the following functions:

sem_wait for a batch input
cudaMemcpyAsync (Host to Device)
nvinfer1::IExecutionContext::enqueue
cudaMemcpyAsync (Device to Host)

I’m seeing the same behavior on both Quadro GP100 and Jetson TX2.

Thanks.

viacheslav.natashyn · May 9, 2018, 8:19pm

I’m having even worth results with TensorRT 3.2 as it does not create multiple overlapping kernel executions and it uses a lot of HtoD memory transfers associated with the Default stream.

Topic		Replies	Views
TensorRT 3.0.2 with multi-streaming TensorRT	3	2888	September 10, 2018
Batch inference parallelization on tensorrt TensorRT tensorrt , cuda	5	1067	May 5, 2021
Issue in making streams concurrent Jetson AGX Xavier	6	991	April 11, 2019
Is multi threaded execution possible with tensorRT? TensorRT	3	2339	April 13, 2020
Batch inference parallelization on tensorrt DeepStream SDK tensorrt	2	553	October 12, 2021
Inference Time When Using Multi Stream in TensorRT is Much Slower than a Single One TensorRT tensorrt	5	2643	March 30, 2023
[Question] trtexec understanding issue TensorRT	4	1110	December 6, 2021
TRT concurrently Jetson TX2 tensorrt	7	1205	September 5, 2021
how to run trt in multithreading？ Jetson TX2	15	8259	October 18, 2021
Can multiple cudaStream instances share the same tensorrt execution context? TensorRT	2	270	April 3, 2024

TensorRT on Multiple CUDA-Streams

Related topics