Poor performance TF-TRT versus TensorRT C++ API

sanchezvr7 · January 3, 2019, 12:15am

Hi, I have trained my own model (using transfer learning with resnet50 as the base model) and generated the tensorRT inferences using TF-TRT versus TensorRT C++ API. I was expecting higher performance with the TRT C++ API implementation compared to TF-TRT but I got the opposite results. Could you please assist?. See the chart below:

GPU model: Tesla T4

NVES_K · January 3, 2019, 7:35pm

Looking at your graph it looks like you’re only seeing an issue once the batch size hits 32. Is that right? If you can get me a small repro I can look into it more. PM me directly if you don’t want to post code or data to the public forum.

sanchezvr7 · January 3, 2019, 8:27pm

hello, the issue is with the entire set of batch sizes using native TensorRT C++ API, I ran the same tests with the pre-trained model resnet-50 as the benchmark and similar throughput is what we are expecting for our custom model, please see enclosed the chart. I will send a PM with the repro.

sanchezvr7 · January 7, 2019, 5:14pm

Hi Kevin, I have sent you a PM with the repro instructions, thanks in advance for your support!

Topic		Replies	Views
Tensorrt inference slower than tensorflow TensorRT	3	486	November 27, 2020
Inference on large batch size TensorRT	5	4599	September 21, 2018
Performance using the integration TensorFlow-TensorRT vs direct TensorRT TensorRT	7	2216	October 12, 2021
TensorRT TensorRT	1	445	August 26, 2021
Slow first inference and very slow two models inference TensorRT	3	1247	August 2, 2022
TRT Uses INT 32 VS INT 16 TensorRT	3	1010	October 12, 2021
TensorRT runtime batch processing in C++ TensorRT tensorrt	5	1571	September 8, 2021
Performance comparison of TensorRT-optimized model between: (i) TF-TRT vs (ii) TensorRT C++ API? TensorRT	5	2789	October 12, 2021
Unexpected TensorRT5.1.2 Results vs TRTIS1.0.0 Results TensorRT	0	733	April 19, 2019
Why is TensorRT faster than TensorFlow? TensorRT	3	1641	April 26, 2022

Poor performance TF-TRT versus TensorRT C++ API

Related topics