The difference between TF and TRT weights?

jianxiangm · January 28, 2019, 10:32am

Hi, I create a network with TRT API, and load tf weights into the TRT network, but the results are different.
I think the weight format difference may lead to the problem. So i have some questions to ask:

(1) for the conv weights, TF uses RSCK ([filter_height, filter_width, input_depth, output_depth]) and TensorRT uses KCRS, so do i need to transpose the weights before feeding to the conv layers? but in the SAMPLEMNISTAPI, there is no tranpose to tf weights for TRT.
for my example:
w1 = weightMap[“tf_conv1”] #RSCK
transpose the tf weights???
conv1 = network.add_convolution(data, 32, (11,41), trt.infer.Weights(w1), b1)

(2) for the IScaleLayer, I set tensorrt.ScaleMode.CHANNEL. The TRT doc says that the channel dimension is assumed to be the third to last dimension, but TRT format is CHW, so do i need to transpose the input of ScaleLayer from CHW to HWC?

(3) for RNN layer, TRT docs says that tf weights format is data_lenhidden_size, while TRT expects hidden_sizedata_len, so should i need to transpose each gate weights of RNN cell?

(4) recently, tf can set data format like channel_first(NCHW) and channel_last(NHWC), so if I train a tf model with data_format=channel_first, are the tf weights format the same as TRT weights format? If so, does it mean I don’t need to convert TF weights for TRT?
Thanks.

NVES · January 28, 2019, 5:17pm

Hello,

For TensorFlow, the uff parser automatically performs the required format handling.
Note: no matter the model is NCHW or NHWC, remember to register your input blob with NCHW format.

You can find detail information in our document:
https://docs.nvidia.com/deeplearning/sdk/tensorrt-developer-guide/index.html#mnist_uff_keyconcepts

jianxiangm · January 29, 2019, 2:22am

Hi, what i mean is that I create a network definition from scratch using the python TRT API, and load TF weights to it like:

w1 = weightMap[CONV1_CONVW]
b1 = trt.infer.Weights.empty(trt.infer.DataType.FLOAT)
conv1 = network.add_convolution(data, 32, (11,41), trt.infer.Weights(w1), b1)

and the TF weight format is different from TRT format.

So I wonder whether i need to convert TF weights (w1) to TRT format , then feed to the network.add_convolution API?

jianxiangm · January 29, 2019, 2:23am

jianxiangm · January 29, 2019, 2:24am

he44 · August 9, 2019, 8:26pm

Hi @jianxiangm,

did you test transposing the weight format yet? I am doing something similar here: coding the network definition using C++ API and loading the convolutional and bias weights from a trained .h5 file from keras. The C++ code would give me totally different results, even with only one conv layer.

I am also concerned whether the weight format could be an issue and am about to test it out. But i am wondering whether you already tested this?

Thank you very much!

jianxiangm · August 12, 2019, 1:26am

The tf weights format is RSCK, but trt weights format is KCRS, so you need to transpose the weights for conv layers.

he44 · August 12, 2019, 2:15pm

Hi @jianxiangm,

thank you very much for the reply! I transposed the weight format and now my c++ network is able to do inference.

Thanks again for your post in the first place! I didn’t find anything related to weight format in the latest documentation. But from your post, I was able to find documentation for TRT-2.1 or something like that and get a better hold of the issue.

Topic		Replies	Views
Inconsistent output between TensorRT network and Tensorflow output, Kernel order TensorRT	2	746	December 3, 2018
How to feed a 3 channel image to tensorrt Jetson TX2	7	4364	September 28, 2018
TensorRT3 results are different with Tensorflow (with a minimal example code) General	4	2140	August 13, 2018
TensorRT support NHWC model? Jetson TX2	3	3309	June 25, 2018
Does Conv2D support format, NCHW. Order size is not matching the number dimensions of TensorRT TensorRT	5	2108	December 4, 2020
TensorRT conv layer output with shape HWC instead of CHW TensorRT	3	921	January 5, 2020
Wrong inference with Tensorrt 2.1 with sampleMNISTAPI with own Keras/Theano trained model GPU-Accelerated Libraries	1	942	December 18, 2019
Speech recognition in TensorRT5 TensorRT	6	858	January 9, 2019
TensorRT: .wts format for tensorflow model weights in sampleCharRNN GPU-Accelerated Libraries	0	1087	August 22, 2017
TensorRT gives different output to original Tensorflow model (conv2d layer conversion) TensorRT	9	3810	August 29, 2018

The difference between TF and TRT weights?

Related topics