TensorRT ICudaEngine serialization size is different for the same uff file INetworkDefinition

NVES · September 5, 2018, 5:29pm

Hello orong13,

In general, the CUDA engine produced by a given INetworkDefinition by a TensorRT builder may depend on various system factors (GPU, OS/kernel, CPU, system load, available memory, etc.) that affect layer implementation availability and timing during the process of building an engine.

The serialization of the engine is dependent on the layer implementations chosen for that engine and therefore is also dependent on these system factors. It is therefore not unexpected behavior if a given INetworkDefinition produces different engines which have different serializations over multiple runs. If the same engine in TensorRT produced different serializations over multiple runs, that would be unexpected behavior.

Topic		Replies	Views
TensorRT doesn't perform properly the Tensorflow concat and\or reshape commands TensorRT	8	3576	December 27, 2018
TensorRT added layer before output TensorRT	3	774	February 13, 2020
Different TensorRT inference results from the same input when batchSize > 1 TensorRT	2	2017	October 12, 2021
paring the UFF network with the uffparser and making an engine with build_cuda_engine fails TensorRT	3	1143	January 30, 2019
Problem with custom layers and Python UFF parser in TensorRT 3.0 RC Jetson TX2	41	7734	October 18, 2021
model accuracy penalty with tensorRT on jetson TX2 Jetson TX2	7	635	October 18, 2021
Low Compute utilization of converted TensorFlow model during inference Jetson TX2	19	1693	October 18, 2021
how to import uff model from a UFF File Jetson TX2	15	3407	October 18, 2021
Tensorflow RNN UFF conversion not yet supported? TensorRT	8	1263	October 12, 2021
I don't get similar results with TensorRT and the trained tensorflow model! Jetson TX2	20	4473	October 18, 2021

TensorRT ICudaEngine serialization size is different for the same uff file INetworkDefinition

Related Topics