FP16 integration in custom API implementation

dvbr · June 19, 2018, 9:29am

Hello,

I would like to use the FP16/Half mode for my convolution, batchnorm and pooling layers.

I use the fp16.h file provided in the last samples of TensorRT 4.0 to transform convolutions and batchnorms parameters to half data type.

Inputs and outputs of my networks are FP32 data type.

I set the outputs tensor type of all my layers using :

layer_output_tensor->setType(DataType::kHALF);

However during the initialization phase i received these message for all the layers:

“Tensor DataType is determined at build time for tensors not marked as input or output.”

With FP16 DataType the network outputs are wrong (correct results with FP32 Datatype), there is not a lot of information in the TensorRT documentation to use the FP16 under API (without Caffe parser or others tensorflow abstraction layer). I would like to know if i have to transform my inputs under FP16 and my outputs too ?

Thanks

Topic		Replies	Views
TensorRT FP16 model creation TensorRT	1	752	June 1, 2018
Plugin to convert to and from half precision within the network TensorRT	2	775	October 12, 2021
TemsorRT Fp16 mode Jetson TX1	6	1279	October 18, 2021
Which layers of TensorRT will work in fp16 mode when enable the --half2 option? Jetson TX1	2	546	October 18, 2021
Different FP16 inference with tensorrt and pytorch TensorRT	5	4539	October 25, 2021
TensorRT Half2 Accuracy Issue Jetson TX1	5	890	October 18, 2021
Data type for TensorRT engine created from UFF model with DataType.HALF TensorRT	2	1286	May 2, 2018
Use FP16 regardless if it is slower or not TensorRT	4	984	May 16, 2022
the difference of DataType::kHALF in the `parse` function of `ICaffeParser* parser` and `builder->setHalf2Mode(true)`? TensorRT	3	658	October 12, 2021
Is DataType::kHALF deprecated? TensorRT	0	566	November 11, 2020

FP16 integration in custom API implementation

Related topics