How to inference with tensorrt on multi gpus in python

lizcomeon · April 9, 2021, 1:17am

Description

Hi, i have 2 different tensortrt models, i want to run trt model A on gpu 1, and run trt model B on gpu 2 with python.
A clear and concise description of the bug or issue.

Environment

Ubuntu 18.04
TensorRT Version: 7.2.3.4
GPU Type: V100
Nvidia Driver Version: 418
CUDA Version: 10.2
CUDNN Version: 8.1
Operating System + Version:
Python Version (if applicable):
TensorFlow Version (if applicable):
PyTorch Version (if applicable):
Baremetal or Container (if container which image + tag):

Relevant Files

Please attach or include links to any models, data, files, or scripts necessary to reproduce your issue. (Github repo, Google Drive, Dropbox, etc.)

Steps To Reproduce

Please include:

Exact steps/commands to build your repro
Exact steps/commands to run your repro
Full traceback of errors encountered

NVES · April 9, 2021, 8:08am

Hi,
The below link might be useful for you
https://docs.nvidia.com/deeplearning/tensorrt/best-practices/index.html#thread-safety

https://docs.nvidia.com/cuda/cuda-runtime-api/group__CUDART__STREAM.html
For multi threading/streaming, will suggest you to use Deepstream or TRITON
For more details, we recommend you to raise the query to the Deepstream or TRITON forum.

Thanks!

spolisetty · April 9, 2021, 9:05am

Hi @lizcomeon,

Following link may answer your query.
https://github.com/NVIDIA/TensorRT/issues/322

Thank you.

Topic		Replies	Views
Not able to inference multiple input models using TRT TensorRT tensorrt , tensorflow , jetson-inference	1	437	August 12, 2021
Run multiple model(engine) with tensorrt without deepstream TensorRT	1	1121	April 20, 2020
Multiple context and/or multithreading TensorRT	1	1259	March 24, 2022
Tensorrt Threads affect each other during multithreaded inference TensorRT tensorrt	16	1380	September 6, 2024
TensorRT MultiThread with MultiGPU TensorRT	1	482	February 14, 2023
tensorRT api 8.6 torch.multinomial TensorRT	1	579	March 22, 2023
Multi-model parallel inferencing TensorRT	1	371	March 31, 2023
Tensorrt multi gpu with multi threads TensorRT	1	1090	February 18, 2022
Latency when running TensorRT engine on two GPU TensorRT	9	1233	August 24, 2020
Inference issue queuing up on one GPU TensorRT tensorrt , cuda , cudnn	1	262	May 31, 2024

How to inference with tensorrt on multi gpus in python

Description

Environment

Relevant Files

Steps To Reproduce

Related topics