Inference issue queuing up on one GPU

96gund · April 2, 2024, 1:53am

Description

Hi,
I am continuing to this issue How to do two different inference with TensorRT on two different GPU on same machine or PC.

I am creating as per your mension in this post. Through this i am able to load model on both GPU. But while inferencing both inferences running on one GPU and getting queuing error. Instead of running seperately.

Environment

TensorRT Version: 8.3.2
GPU Type: RTXa2000
Nvidia Driver Version:
CUDA Version: 11.2
CUDNN Version: 8.4
Operating System + Version: windows
Python Version (if applicable):
TensorFlow Version (if applicable):
PyTorch Version (if applicable):
Baremetal or Container (if container which image + tag):

Relevant Files

Please attach or include links to any models, data, files, or scripts necessary to reproduce your issue. (Github repo, Google Drive, Dropbox, etc.)

Steps To Reproduce

Please include:

Exact steps/commands to build your repro
Exact steps/commands to run your repro
Full traceback of errors encountered

AakankshaS · May 31, 2024, 3:45am

Hi @96gund ,
Can you please help us with the logs?

Thanks

Topic		Replies	Views
How to inference with tensorrt on multi gpus in python TensorRT	2	2254	April 9, 2021
How to do two different inference with TensorRT on two different GPU on same machine or PC TensorRT	2	536	September 29, 2023
Can not run two tensorrt models (two dockers) on same GPU TensorRT tensorrt , tensorflow , tf-trt	1	959	September 7, 2021
Latency when running TensorRT engine on two GPU TensorRT	9	1314	August 24, 2020
Not able to inference multiple input models using TRT TensorRT tensorrt , tensorflow , jetson-inference	1	486	August 12, 2021
Bug : Tensorrt Model not loading on same GPU on a different device (slight driver version difference) TensorRT tensorrt , cudnn	1	281	April 30, 2024
Issue with Inferencing with TensorRT on Python TensorRT	7	1254	July 20, 2022
Capable of running multiple Inference concurrently on the same GPU using TensorRT? TensorRT	0	901	March 27, 2019
Is it possible to run multiple TensorRT model inference on a GPU simultaneously and parallelly? TensorRT tensorrt , cuda	3	2170	August 23, 2022
Unable to run two TensorRT models in a cascade manner TensorRT tensorrt , python	7	5091	October 12, 2021

Inference issue queuing up on one GPU

Description

Environment

Relevant Files

Steps To Reproduce

Related topics