half float can't accelerate in tensorRT.

ClancyLian · December 28, 2017, 2:06am

Hi,

I just use tensorRT instead of caffe framework. And I have written a plugin for PReLU, and when I test fp32 and fp16. the speed was not much different.

Thanks.

AastaLLL · December 28, 2017, 3:01am

Hi,

Could you share more information about your use case?

Here are some initial suggestions:

1. Please remember to maximize the TX2 performance first.

sudo ./jetson_clocks.sh

2. It’s recommended to use TensorRT profiler to figure out the bottleneck layer.
Please check our native sample for information:
/usr/src/tensorrt/samples/sampleGoogleNet/sampleGoogleNet.cpp

3. FP16 cuts memory in half but not always double the performance.
The time to process a specific layer (Ex. IP layer) may be longer in FP16 mode.
It is encouraged to compare the performance between FP16 and float.

Thanks.

Topic		Replies	Views
FP32 and FP16 imagenet Jetson TX2	3	875	October 18, 2021
TensorRT on TX1 with jetpack 2.3.1 FP16 mode support Jetson TX1	4	684	October 18, 2021
which layers of TensorRT will work in fp16 mode when enable the --half2 option? Jetson TX2	1	1009	March 17, 2017
xavier ternsorrt mnist fp16 is slower the fp32? Jetson AGX Xavier	10	867	October 18, 2021
FP16 not even two times faster than using FP32 in TensorRT TensorRT	0	651	June 12, 2019
Which layers of TensorRT will work in fp16 mode when enable the --half2 option? Jetson TX1	2	546	October 18, 2021
Time of inference in FP16 and FP32 is the same Jetson TX2 tensorrt	20	1752	August 10, 2022
TensorRT 2.1 implement yoloV2 with fp16 mode result error Jetson TX1	8	1194	July 18, 2019
Increase fps on Jetson TX2 using Tensorflow and TensorRT Jetson TX2	3	596	October 18, 2021
No performance difference between Float16 and Float32 optimized TensorRT models Jetson AGX Xavier tensorrt	4	3135	October 10, 2021

half float can't accelerate in tensorRT.

Related topics