TensorRT support for multiple GPUs - URGENT

alex15 · October 27, 2021, 5:12pm

We are finding that the only way we can use TensorRT (7.2.3.4) on a new GPU that we haven’t used before, we have to rebuild TensorRT on that GPU type first.

For example, our software works on RTX 2070 Max Q but didn’t work on a GTX 1050 TI. So we got hold of a 1050 TI to build TRT on that machine but it didn’t work on a 1050. So we had to buy a 1050 to build yet another version. We thought that our TensorRT built on a GTX1660 would work on an RTX 2080 TI but it turned out we were wrong. It returns null when trying to load the engine into memory

Is the lack of inter-GPU compatibility expected with Tensor RT? If yes, what is the bare minimum of GPU types we would need to buy for to support all your GPUs above GTX1050.

@NVES @spolisetty Please help asap as we have an unhappy customer because of our lack of RTX 2080 TI support.

NVES · October 27, 2021, 6:09pm

Hi,
The below link might be useful for you
https://docs.nvidia.com/deeplearning/tensorrt/best-practices/index.html#thread-safety

https://docs.nvidia.com/cuda/cuda-runtime-api/group__CUDART__STREAM.html
For multi threading/streaming, will suggest you to use Deepstream or TRITON
For more details, we recommend you to raise the query to the Deepstream or TRITON forum.

Thanks!

alex15 · October 27, 2021, 6:15pm

@NVES thank you for the prompt reply and for the links.

So does TensorRT not transfer well between GPUs without building on that specific GPU type?

My question wasn’t about multi-threading/streaming i don’t think?

spolisetty · October 28, 2021, 4:39am

Hi,

Serialized engines are not portable across platforms or TensorRT versions. Engines are specific to the exact GPU model they were built on (in addition to the platforms and the TensorRT version). It is recommended to build the serialized engines on the targeted platforms directly.

Please refer to the below link for the same.

Thank you.

alex15 · October 28, 2021, 6:09am

@spolisetty ok understood. Can you recommend which GPUs to use that will support the most number of other GPU types?

For example, if we build Tensor RT on a RTX 2070 then you should be able to support x, y and z GPUs. Or do we always need to have the EXACT same GPU as every customer?

Also, if we were to build on the non-TI version of a GPU, could we use tensorRT on the TI version (or vice- versa)? Ie if we build on a GTX 1050, would TRT work on a 1050TI?

alex15 · October 28, 2021, 6:12am

@spolisetty @NVES is there anything you can recommend to allow us to benefit from the faster inference that TensorRT provides, but that is easier to port between machines? For example, should we be using ONNX or something?

spolisetty · October 28, 2021, 3:04pm

Hi,

I believe this may not work better. It is alway recommended to build engine on the same host we will run infence, even same type of gpu.

Yes we can use ONNX to port across the platforms and use it to build the TensorRT engine.

Please refer support matrix for more info on TensorRT hardware/software requirements.

Thank you.

Topic		Replies	Views
Build TensorRT on Cuda compute capability 7.5 and make it backward compatible with previous capabilities TensorRT tensorrt	4	1765	May 19, 2022
When to update a tensorrt engine file? TensorRT	5	925	May 11, 2022
Does GTX 1050ti or 1650 for notebook support tensorflow-gpu Frameworks tensorflow	13	55489	October 28, 2024
GPU Utilization TensorRT tensorrt	3	700	August 29, 2023
Configuring multiple versions of TensorRT and Tensorflow on HPC share cluster; TF-TRT Warning: Cannot dlopen some TensorRT libraries TensorRT	8	12717	June 28, 2023
Deepstream 6.2 - NVIDIA driver is incompatible with required CUDA DeepStream SDK	11	750	January 4, 2024
getPluginCreator could not find plugin BatchedNMS_TRT version 1 TensorRT	5	3985	December 23, 2020
What Nvidia GPUs can I use for TensorRT execution provider in ONNX runtime? TensorRT	6	1146	October 12, 2021
Tensor Flow with GPU support for GTX 1650 max Q TensorRT	1	812	April 10, 2023
nVidia release versions compatibility TensorRT	4	914	September 20, 2023

TensorRT support for multiple GPUs - URGENT

Related topics