TensorRT specify layer NCHW -> NHWC

michele.pratusevich · December 19, 2018, 2:27pm

TensorRT expects inputs to a network in NCHW format - is there any way to specify a different format, when you are constructing the network from a Caffe / UFF / ONNX parser? I can’t find anything in the https://docs.nvidia.com/deeplearning/sdk/tensorrt-archived/tensorrt_401/tensorrt-api/c_api that suggests that you can, but I wanted to ask if anyone ran into a similar problem.

I know that NCHW is better performance-wise, but I am willing to take the hit on one single layer (my input layer) to avoid having to reorder my memory manually (or using a CUDA helper).

AastaLLL · December 20, 2018, 5:15am

Hi,

It’s recommended to use NCHW format to get better performance with TensorRT.

Actually, we allow a user to input an NHWC model but automatically insert several format converter to make it compatible.
We choose NCHW as our implementation due to GPU acceleration.

If NHWC format is preferred, you can just let the uff parser to handle the compatibility for you.
If performance is more important, it’s recommended to use NCHW across all the model.

Thanks.

michele.pratusevich · December 20, 2018, 1:35pm

I see that this is possible with the UFF parser - is there any way this function can be applied after a network has already been parsed, to the input layer? I am working with a Caffe (and not a UFF) model and didn’t realize the functionality for importing was different.

For example, if I have an ITensor (https://docs.nvidia.com/deeplearning/sdk/tensorrt-archived/tensorrt_401/tensorrt-api/c_api/classnvinfer1_1_1_i_tensor.html) there doesn’t seem to be a way to override the ordering for just that layer, as opposed to the entire model?

AastaLLL · December 21, 2018, 7:01am

Hi,

In general, Caffe use NCHW format and doesn’t have the format compatible issue.

Do you want to use the converter as part of your model?
May I know more about of your use case?

Thanks.

michele.pratusevich · December 21, 2018, 1:59pm

I have a model in Caffe and I am importing it for use in TensorRT. I want to change the first layer (or add a layer) that takes an input in HWC (or frankly even CWH format) but have the rest of the network use CHW format.

AastaLLL · December 24, 2018, 3:57am

Hi,

You can add a permute layer between input the the rest of the network:
[url]https://github.com/intel/caffe/blob/master/src/caffe/layers/permute_layer.cpp[/url]

Thanks.

michele.pratusevich · December 27, 2018, 2:38pm

Sorry, am I missing something? That layer doesn’t have an equivalent in TensorRT, so I would need to implement it manually anyway? Is that right?

Topic		Replies	Views
TensorRT UFF Tensorflow NHWC (channels last) to NCHW (channels first) conversion buggy TensorRT	2	3192	November 2, 2019
TensorRT 5 Input Tensor Format NCHW / NHWC TensorRT	6	4986	October 24, 2019
TensorRT support NHWC model? Jetson TX2	4	3181	October 18, 2021
How to feed a 3 channel image to tensorrt Jetson TX2	8	4173	October 18, 2021
Can we make TensorRT handle nhwc Tensor? TensorRT	7	1394	June 4, 2020
Does TensorRT 3.0 C++ API only works for NCHW? Jetson TX2	5	1530	October 18, 2021
Concat error with uff parser for TensorRT4 TensorRT	10	1660	October 12, 2021
Different inference output when loading from uff and pb file Jetson AGX Xavier	8	978	January 7, 2020
Converting NHWC to NCHW TensorRT	2	903	October 11, 2023
Using nhwc format instead of nchw for deepstream DeepStream SDK tensorrt , gstreamer	13	1323	October 12, 2021

TensorRT specify layer NCHW -> NHWC

Related topics