Whether the model execution using multiple streams in the tensorRT framework USES multicore concurrency

AI & Data Science Deep Learning (Training & Inference) TensorRT

576902280 March 4, 2020, 7:14am 1

Hi all,

I modified the fast- RCNN code to add two streams to run two separate fast- RCNN models。
I learned from cuda’s documentation that multiple streams can generate multicore concurrency to improve performance.But when I tested it, I didn’t see any performance improvements, using the RTX2060.

Topic		Replies	Views
Multithread does not improve inference performance with tensorrt models TensorRT tensorrt	2	1170	May 11, 2021
Performance Comparison: Multiple CUDA Streams with Multiple TensorRT Models vs. Combining Multiple TensorRT Models TensorRT tensorrt , cuda	0	377	December 23, 2023
Use multiple CUDA streams with multiple TensorRT models Jetson AGX Orin tensorrt , cuda	3	390	December 26, 2023
Speedup by increasing # of streams vs. batch size TensorRT	2	692	June 23, 2022
Inference Time When Using Multi Stream Multi Context in TensorRT is Slower than a Single One TensorRT tensorrt , cuda , cudnn	1	30	November 30, 2024
Some questions about using 10x0 and 20x0 cards for DL cuDNN	1	524	December 22, 2018
Running two models in multiple models increases the FPS TensorRT tensorrt , cuda , python	1	1427	October 28, 2020
Multi Stream in TensorRT TensorRT	1	2066	July 28, 2020
Parallel execution of several trt contexts on one GPU TensorRT onnx	1	1124	August 7, 2023
About optimize cuda program and get more throughput on T4 TensorRT	0	290	August 4, 2019

Whether the model execution using multiple streams in the tensorRT framework USES multicore concurrency

Related topics