First 2 inference request with inception model taking more time through TRT IS

abhishekdg1983 · January 21, 2020, 3:07pm

We found out that there are first 2 inference req taking more time with inception v3 model while after that it is taking very less time. How can we configure that with model warm up option? What are the options we have to set or is there any configuration to do that?

David_Goodwin · January 21, 2020, 5:56pm

The 1.9.0 version of the inference server (container version 19.12) has a warmup option. See Documentation – Pre-release :: NVIDIA Deep Learning Triton Inference Server Documentation

abhishekdg1983 · January 22, 2020, 1:39pm

We are using 19.11 TRT IS version, will this work with that? Secondly can you share example like configuration file which will help us, I am not finding enough documentation on this?

David_Goodwin · January 22, 2020, 10:35pm

Warmup was added in 19.12 so you will have to update to use it.

Note that this same issue has been discussed on the GitHub issues:https://github.com/NVIDIA/tensorrt-inference-server/issues/708

There is an L0_warmup CI test that uses the warmup parameters. They are patched into existing config.pbtxt files as here: https://github.com/NVIDIA/tensorrt-inference-server/blob/master/qa/L0_warmup/test.sh#L66.

Topic		Replies	Views
Tensorrt cold start (First time inference) TensorRT tensorrt , cuda , ubuntu , python	2	387	May 30, 2024
Slow first inference and very slow two models inference TensorRT	3	1367	August 2, 2022
TensorRT execution inference time occasionally increases dramatically after the warmup TensorRT	1	1799	January 7, 2022
The first inference using tensorRT model takes far longer time than that using tensorflow model TensorRT	0	704	November 13, 2020
TensorRt inference is taking 1.5 sec to inference a single frame.i want to speed up my inference.How can i do that TensorRT tensorrt , cuda , jetson-nano	3	823	March 13, 2023
Inference Time is not stable TensorRT	10	1889	January 3, 2019
Inferencing of Inception_v2 on OEM server with V100. TensorRT	0	555	January 13, 2020
Inference time changes after training TensorRT tensorrt	5	658	September 25, 2020
Tensorflow inference using TRT converted model TensorRT	10	1174	May 25, 2021
repeat post, please ignore this TensorRT	0	727	July 12, 2019

First 2 inference request with inception model taking more time through TRT IS

Related topics