TensorRT engines are built so differently with the same IBuilderConfig, how to fix?

slzhang1998 · September 16, 2021, 3:51pm

Description

I am searching for an optimized construction of a network. I run the same code several times to build the engine and calculate the average inference time. The average inference time varies, which indicates that the engines built with the same builder configuration perform differently in different experiments.

I have increased the parameters in setMinTimingIterations() and setAvgTimingIterations() , hoping the engine’s construction can converge to an optimized one, but not work. How can I find an optimized and fixed engine with IBuilderConfig? I am not expecting to serialize the engine. Thank you!

Environment

TensorRT Version: 8.0.1.6
GPU Type: 1080ti
Nvidia Driver Version:
CUDA Version:
CUDNN Version:
Operating System + Version:
Python Version (if applicable):
TensorFlow Version (if applicable):
PyTorch Version (if applicable):
Baremetal or Container (if container which image + tag):

Relevant Files

Below is my code to set the profiler and the config.

bool Profiler::construct_s1(
    TRTUniquePtr<nvinfer1::IBuilder>& builder,
    TRTUniquePtr<nvinfer1::INetworkDefinition>& network,
    TRTUniquePtr<nvinfer1::IBuilderConfig>& config,
    TRTUniquePtr<nvonnxparser::IParser>& parser)
{
    auto profile = builder->createOptimizationProfile();
    samplesCommon::OnnxSampleParams params;
    params.dataDirs.emplace_back("./models");
    auto parsed = parser->parseFromFile(locateFile("resnet.onnx", params.dataDirs).c_str(),
    static_cast<int>(sample::gLogger.getReportableSeverity()));
    if (!parsed) {
        return false;
    }

    input_dims_s1 = network->getInput(0)->getDimensions();
    input_tensor_names_ = network->getInput(0)->getName();

    nvinfer1::Dims min_dims = input_dims_s1;
    min_dims.d[0] = batch_size_s1_;
    nvinfer1::Dims opt_dims = input_dims_s1;
    opt_dims.d[0] = batch_size_s1_;
    nvinfer1::Dims max_dims = input_dims_s1;
    max_dims.d[0] = batch_size_s1_;

    profile->setDimensions(input_tensor_names_.c_str(), nvinfer1::OptProfileSelector::kMIN, min_dims);
    profile->setDimensions(input_tensor_names_.c_str(), nvinfer1::OptProfileSelector::kOPT, opt_dims);
    profile->setDimensions(input_tensor_names_.c_str(), nvinfer1::OptProfileSelector::kMAX, max_dims);

    config->addOptimizationProfile(profile);
    config->setMaxWorkspaceSize(3_GiB);
    config->setMinTimingIterations(5);
    config->setAvgTimingIterations(5);
    return true;
}

spolisetty · September 20, 2021, 3:25am

Hi @slzhang1998,

Could you please let us know, is variation in inference time huge ? or negligible. Aslo, if you save the engine, is time always same?
We don’t think engine building is supposed to be deterministic as tactics are chosen based on observed runtime.
Please refer following post for more details.

If time difference is more, could you try building engine using trtexec.

Thank you.

Topic		Replies	Views
TensorRT Engine Creation Methods’ Differences TensorRT tensorrt	1	423	September 27, 2023
TensorRT inference take too much time than expected TensorRT tensorrt	2	1032	December 22, 2020
buildEngineWithConfig returns null_ptr TensorRT	9	2158	July 2, 2021
Getting different trtmodel while compiling on the same PC, the same ONNX model TensorRT	1	312	May 30, 2022
Trtexec generates different engines when using the same platform/machine with the same onnx model TensorRT	3	1141	March 29, 2022
Performance discrepancy using TensorRT engines TensorRT tensorrt	3	661	October 5, 2021
Different engines give different inference results when using the same onnx model and giving the same input TensorRT	4	958	December 31, 2023
Is TensorRT inference deterministic/reproducibile? TensorRT tensorrt	5	2609	October 12, 2021
How could I change the batchsize during inference when using a tensorRT model converted by onnx? TensorRT	8	4686	October 12, 2021
Troubleshooting Suggestions for ONNX v. TensorRT discrepancies TensorRT	7	1850	October 12, 2021

TensorRT engines are built so differently with the same IBuilderConfig, how to fix?

Description

Environment

Relevant Files

Related topics