Deterministic TensorRT optimization

domagoj.krivosic · August 24, 2020, 12:44pm

Description

When I optimize the same model with tensorRT_optimization tool on Nvidia Drive AGX platform twice in a row , I get two binary files of different sizes. Also, it is mentioned here GitHub - NVIDIA/framework-determinism: Providing determinism in deep learning frameworks that TensorRT behaves non-deterministically. Drive OS comes with TensorRT 5.1.2. Is it fixed in newer versions? Can the model be optimized deterministically using TensorRT C++ API? When will solution be available on Nvidia Drive?

Environment

TensorRT Version: 5.1.2
CUDA Version: 10.2

AakankshaS · August 24, 2020, 6:04pm

Hi @domagoj.krivosic
If you are using same engine with same input, TensorRT should be deterministic.
Can you please try this on latest TRT release.

Thanks!

domagoj.krivosic · August 25, 2020, 7:37am

Sorry if I wasn’t clear enough. I am not comparing inference results, I am generating engines from UFF file. When I generate two engines from the same UFF file with the same arguments, the engine files have different sizes.

AakankshaS · August 26, 2020, 10:56am

Hi @domagoj.krivosic,
Can you help me with the model and the script you are using.
Thanks!

domagoj.krivosic · August 26, 2020, 11:25am

Hi @AakankshaS ,

Unfortunately, we can’t share the model at this point (we might prepare a sample model a bit later). However, we don’t see this as a question specific to the model we are currently looking at. We plan to use TensorRT for optimization of various models in future, so at this point we are just wondering about the general behavior of the TensorRT product. If we optimize the same model (with the same parameters - weights) twice, should we expect to get exactly the same engine files? If not, is it possible to achieve this reproducibility?

We are working on the Nvidia Drive platform (Drive OS 5.1.6.1), optimizing with the shipped binary /usr/local/driveworks/tools/dnn/tensorRT_optimization, but the question is not limited to the mentioned tool. We would like to know if the reproducibility is achievable with TensorRT API or any other way.

AakankshaS · August 26, 2020, 5:41pm

Hi @domagoj.krivosic,

The below link will help you answer your query

Thanks!

domagoj.krivosic · August 27, 2020, 11:44am

@AakankshaS thank you. I see that algorithm selector is available from TensorRT version 7, but not in earlier versions. Is it safe to say that fully deterministic engine building is supported only with TensorRT 7.0 and newer?

AakankshaS · August 27, 2020, 2:31pm

Hi @domagoj.krivosic,
Yes, we can say that. With latest TRT Releases, you will find more features with improved performance.
Thanks!

Topic		Replies	Views
Is TensorRT inference deterministic/reproducibile? TensorRT tensorrt	5	2616	October 12, 2021
Trtexec generates different engines when using the same platform/machine with the same onnx model TensorRT	3	1141	March 29, 2022
Using and updating TensorRT on Pegasus platform DRIVE AGX Xavier General	6	763	October 12, 2021
Driveworks sample_dnn_tensor : How can I regenerate engine file? TensorRT driveworks	3	1448	March 24, 2021
Question about TensorRT reproducibility on different architectures TensorRT	3	906	October 12, 2021
TensorRT engines are built so differently with the same IBuilderConfig, how to fix? TensorRT	1	624	September 20, 2021
TensorRT engine dependencies TensorRT	4	1716	March 10, 2022
Non-deterministic TensorRT engine building TensorRT tensorrt	3	567	March 10, 2021
Issue with TensorRT binary TensorRT tensorrt	2	739	March 26, 2021
Question regarding Tensorrt engine build vs inference environment (TensorRT version, Platform, etc) TensorRT	4	905	October 21, 2021

Deterministic TensorRT optimization

Description

Environment

Related topics