Tensorrt multi gpu with multi threads

user139950 · February 18, 2022, 9:07am

Description

Hello, All. I am use tensorrt to inference the AI module in c++.
But I met a problem with multi-gpu and multi threads.

first . i build tensorrt module from multi thread (one gpu with one thread).
seoncd, As we know, tensorrt use multi gpu should call cudaSetDevice in create engine and infer. like.

cudaSetDevice(m_gpuIndex);

But, I found when one thread enter ‘cudaStreamCreate’ or ‘cudaMemcpy’ or ‘enqueueV2(infer context)’ or other cuda methods. AT this time, if other threads enter.
the program will blocking. if I use a mutex to lock before any infer. it will ok. But the performance is bad. Could any one help me?

Environment

TensorRT Version: 8.2.2.1
GPU Type: rtx-3070 (notebook)
Nvidia Driver Version: 470.74
CUDA Version: 11.1
CUDNN Version: 11.1
Operating System + Version: ubuntu 18.04 with linux kernel 5.4.0-99
Python Version (if applicable): no
TensorFlow Version (if applicable): no
PyTorch Version (if applicable): no
Baremetal or Container (if container which image + tag):

Relevant Files

— later …if need.

Steps To Reproduce

NVES · February 18, 2022, 9:37am

Hi,
The below link might be useful for you
https://docs.nvidia.com/deeplearning/tensorrt/best-practices/index.html#thread-safety

https://docs.nvidia.com/cuda/cuda-runtime-api/group__CUDART__STREAM.html
For multi threading/streaming, will suggest you to use Deepstream or TRITON
For more details, we recommend you to raise the query to the Deepstream or TRITON forum.

Thanks!

Topic		Replies	Views
Tensorrt Threads affect each other during multithreaded inference TensorRT tensorrt	16	1427	September 6, 2024
TensorRT MultiThread with MultiGPU TensorRT	1	494	February 14, 2023
Is TensorRT safe to create engine & context in one thread, and execute in another thread? TensorRT	1	701	June 5, 2022
Multiple threads execution with different engines in tensorrt TensorRT tensorrt	3	2517	December 13, 2022
Is multi threaded execution possible with tensorRT? TensorRT	3	2250	April 13, 2020
Multithread does not improve inference performance with tensorrt models TensorRT tensorrt	2	1184	May 11, 2021
Not able to inference multiple input models using TRT TensorRT tensorrt , tensorflow , jetson-inference	1	442	August 12, 2021
Speeding up multi-threaded C++ program of TensorRT models TensorRT tensorrt	7	1363	February 20, 2025
How to inference with tensorrt on multi gpus in python TensorRT	2	2173	April 9, 2021
Error in cuda when trying to inference via multiprocessing TensorRT	2	1698	November 14, 2021

Tensorrt multi gpu with multi threads

Description

Environment

Relevant Files

Steps To Reproduce

Related topics