TensorRT Half2 Accuracy Issue

Leo_Li · May 17, 2017, 3:33pm

Hi,

I was trying to implement a VGG16 model on TX1 using TensorRT1.0. With the help of built-in tutorial of GIE, I modified it for my own use. It works well for DataType::kFLOAT, but when I tried to use DataType::kHALF to speedup the process, it failed to pass the assertion, which seems to be some problems from dimension match.

Error message below:

caffe_parser: cudnnReformatLayer.cpp:31: virtual void nvinfer1::cudnn::ReformatLayer::execute(const nvinfer1::cudnn::CommonContext&): Assertion `in.getDimensions() == out.getDimensions()’ failed.

Can someone help?

Thanks.
Leo

Leo_Li · May 17, 2017, 4:07pm

Interesting findings!

I designed a smaller model which is actually a part of my original model, I found that it would work if we only have one output for this model when using Half2. Is this a bug?

Regards.
Leo

AastaLLL · May 18, 2017, 3:08am

Hi,

Looks like your model has some not-supported format in fp16 mode.
Could you paste your prototxt file for us debugging?

Thanks.

Leo_Li · May 18, 2017, 1:11pm

input: “data”

input_shape {

dim: 1

dim: 3

dim: 321

}

layer {

bottom: “data”

top: “conv1_1”

type: “Convolution”

convolution_param {

num_output: 21

pad: 1

kernel_size: 3

}

layer {

bottom: “conv1_1”

top: “conv1_1”

type: “ReLU”

}

layer {

bottom: “conv1_1”

top: “conv1_2”

type: “Convolution”

convolution_param {

num_output: 26

pad: 1

kernel_size: 3

}

layer {

bottom: “conv1_2”

top: “conv1_2”

type: “ReLU”

}

layer {

bottom: “conv1_2”

top: “pool1”

type: “Pooling”

pooling_param {

pool: MAX

kernel_size: 2

stride: 2

}

layer {

bottom: “pool1”

top: “conv2_1”

type: “Convolution”

convolution_param {

num_output: 22

pad: 1

kernel_size: 3

}

layer {

bottom: “conv2_1”

top: “conv2_1”

type: “ReLU”

}

layer {

bottom: “conv2_1”

top: “conv2_2”

type: “Convolution”

convolution_param {

num_output: 28

pad: 1

kernel_size: 3

}

layer {

bottom: “conv2_2”

top: “conv2_2”

type: “ReLU”

}

layer {

bottom: “conv2_2”

top: “pool2”

type: “Pooling”

pooling_param {

pool: MAX

kernel_size: 2

stride: 2

}

layer {

bottom: “pool2”

top: “conv3_1”

type: “Convolution”

convolution_param {

num_output: 24

pad: 1

kernel_size: 3

}

layer {

bottom: “conv3_1”

top: “conv3_1”

type: “ReLU”

}

layer {

bottom: “conv3_1”

top: “conv3_2”

type: “Convolution”

convolution_param {

num_output: 17

pad: 1

kernel_size: 3

}

layer {

bottom: “conv3_2”

top: “conv3_2”

type: “ReLU”

}

layer {

bottom: “conv3_2”

top: “conv3_3”

type: “Convolution”

convolution_param {

num_output: 13

pad: 1

kernel_size: 3

}

layer {

bottom: “conv3_3”

top: “conv3_3”

type: “ReLU”

}

layer {

bottom: “conv3_3”

top: “pool3”

type: “Pooling”

pooling_param {

pool: MAX

kernel_size: 2

stride: 2

}

layer {

bottom: “pool3”

top: “Fea_P3”

type: “Convolution”

convolution_param {

num_output: 18

pad: 1

kernel_size: 3

}

layer {

bottom: “Fea_P3”

top: “Fea_P3”

type: “ReLU”

}

layer {

bottom: “pool3”

top: “conv4_1”

type: “Convolution”

convolution_param {

num_output: 33

pad: 1

kernel_size: 3

}

layer {

bottom: “conv4_1”

top: “conv4_1”

type: “ReLU”

}

layer {

bottom: “conv4_1”

top: “conv4_2”

type: “Convolution”

convolution_param {

num_output: 20

pad: 1

kernel_size: 3

}

layer {

bottom: “conv4_2”

top: “conv4_2”

type: “ReLU”

}

layer {

bottom: “conv4_2”

top: “conv4_3”

type: “Convolution”

convolution_param {

num_output: 18

pad: 1

kernel_size: 3

}

layer {

bottom: “conv4_3”

top: “conv4_3”

type: “ReLU”

}

layer {

bottom: “conv4_3”

top: “pool4”

type: “Pooling”

pooling_param {

pool: MAX

kernel_size: 2

stride: 2

}

layer {

bottom: “pool4”

top: “Fea_P4”

type: “Convolution”

convolution_param {

num_output: 28

pad: 1

kernel_size: 3

}

layer {

bottom: “Fea_P4”

top: “Fea_P4”

type: “ReLU”

}

Here I mark both Fea_P4 and Fea_P3 as output.

AastaLLL · May 23, 2017, 4:25am

Hi,

Sorry for keeping you waiting.

This issue is fixed in our next release.
Please pay attention to our announcement and update.

Thanks.

Topic		Replies	Views
Half2Mode (fast FP16) on TX1 with TensorRT 2.1 doesn't seem to work Jetson TX1	8	1755	October 18, 2021
which layers of TensorRT will work in fp16 mode when enable the --half2 option? Jetson TX2	1	1014	March 17, 2017
Which layers of TensorRT will work in fp16 mode when enable the --half2 option? Jetson TX1	2	549	October 18, 2021
TensorRT on TX1 with jetpack 2.3.1 FP16 mode support Jetson TX1	4	699	October 18, 2021
Create TensorRT net error using DataType::kHALF DeepStream SDK	3	1074	January 2, 2018
Inference with tensorRT using half precision floats on TX2 yields NAN outputs Jetson TX2	2	1460	October 18, 2021
Data type for TensorRT engine created from UFF model with DataType.HALF TensorRT	2	1293	May 2, 2018
Half mode, trt7 correct, trt5 wrong TensorRT	0	321	November 5, 2020
Dose TITAN Xp support FP16 TensorRT tensorrt , cuda , ubuntu	1	597	November 15, 2023
FP16 integration in custom API implementation TensorRT	0	630	June 19, 2018

TensorRT Half2 Accuracy Issue

Related topics