The goal - get faster inference time, running on TX2 The flow: I have a keras model which I have trained and converted to tensorRT, using the function - “trt.create_inference_graph”. Inference time ~27[msec]. after pruning the model and converting it to tensorRT, inference time remains the same. p…

Hi, Sorry for the late update. The layer replacement log can be enabled with this command: sess = tf.Session(config=tf.ConfigProto(log_device_placement=True)) Thanks.

Should pruning a model prior to converting it to tensorRT make inference faster?

Robotics & Edge Computing Jetson & Embedded Systems Jetson TX2

AastaLLL June 22, 2020, 5:59am 7

Hi,

Another thing worth to check is how many layers are inferenced with TensorRT.

Please noticed that the frameworks you used is TF-TRT.
TF-TRT integrated TensorRT into TensorFlow interface so the layer may be inference with either TensorFlow or TensorRT.

The layer deployment can be found in the TensorFlow log.
Could you help to collect it and share with us?

Thanks.

Detection tesnorRT takes seconds to run on TX2

Topic		Replies	Views
Does network pruning speed up inference speed? TensorRT	6	1683	January 7, 2022
Inference is so slow with torch1.6 Jetson Xavier NX nvbugs , pytorch	12	3538	October 23, 2020
Lower performance with TRT than plain TF? Jetson Xavier NX tensorrt , jetson-inference	14	1955	October 18, 2021
optimizing tf-trt load time Jetson Nano	12	4175	October 15, 2021
Low Compute utilization of converted TensorFlow model during inference Jetson TX2	19	1695	October 18, 2021
Time of inference in FP16 and FP32 is the same Jetson TX2 tensorrt	20	1683	August 10, 2022
Inference time using TF-TRT is the same as Native Tensorflow for Object Detection Models TensorRT tensorrt , tf-trt	4	1008	March 31, 2022
Tlt-infer is slow TAO Toolkit	13	830	October 12, 2021
Inference time changes after training TensorRT tensorrt	5	578	September 25, 2020
Inference time on jetson nano Jetson AGX Xavier tensorrt , cuda , kernel , jetson-inference	2	940	May 30, 2022

Should pruning a model prior to converting it to tensorRT make inference faster?

Related topics