Is the inference cost time affected by the frequency of calls?

zehua.12 · November 21, 2020, 2:55am

Description

Calls tensorrt inference at 10 fps, the cost time per call is 30ms.
But calls at 50 fps, it is down to 13ms, reduced so much.

How is the inference cost time affected by the frequency of calls?

Environment

TensorRT Version: TensorRT-7.1.3.4
GPU Type: Tesla 4
Nvidia Driver Version: 418.87.00
CUDA Version: 10.2
CUDNN Version:
Operating System + Version: Ubuntu18.04
Python Version (if applicable):
TensorFlow Version (if applicable):
PyTorch Version (if applicable): 1.4
Baremetal or Container (if container which image + tag):

SunilJB · November 23, 2020, 7:41am

Hi @zehua.12,
What’s the batch size you used during model generation? Is there any optimized profile set for your model?
Could you please share the model and script files so we can help better?

Thanks

zehua.12 · November 25, 2020, 9:33am

Sorry, I can’t share private model and script.

The tensorrt engine generated from onnx which input batch size is 1, and used default optimized profile. Always do image interference one by one, not batch, so i guess optimized profile has nothing with this issue.
Maybe gpu cache cause the issue?

Topic		Replies	Views
Inference time changes after training TensorRT tensorrt	5	576	September 25, 2020
Performance discrepancy using TensorRT engines TensorRT tensorrt	3	654	October 5, 2021
TensorRT on RTX 3080 slow down TensorRT tensorrt	6	2000	September 16, 2022
The impact of network input data types on inference speed and accuracy ！ TensorRT tensorrt	1	432	July 4, 2022
TensorRT 5.0.2 Batch Size Problem: bigger batch size Inference Time increase??? General	6	1535	October 12, 2021
ONNX runtime prediction using GPU and with different intervals TensorRT	4	1863	January 19, 2022
TensorRT inference time extremely slow TensorRT	1	443	January 31, 2023
Batch inference on tensorrt TensorRT tensorrt	4	423	February 15, 2021
TensorRT copy data cost a lot of time TensorRT	1	626	April 8, 2020
Tensorrt inference time fluctuated when test a big model TensorRT tensorrt	2	658	June 4, 2021

Is the inference cost time affected by the frequency of calls?

Description

Environment

Related topics