TensorRT NCHW vs cuDNN NHWC

albecenz · May 10, 2021, 2:59pm

Hi everybody, I have a question regarding tensor memory layout in TensorRT. Some frameworks use a NCHW layout, others use NHWC layout.
Reading cuDNN documentation it seems 2d convolutions on tensor cores achieve best performance with NHWC memory layout. This thread though explicitly states that TensorRT implementation is NCHW.
Since I have tensor cores available, is there a way in which my CNN can take advantage of the extra performance provided by the NHWC memory layout? Is it automatically taken care of by Tensor RT?

spolisetty · May 11, 2021, 10:07am

Hi @albecenz,

Yes it automatically taken care by TRT,
TRT internally tries all kinds of tensor layouts.

Thank you.

albecenz · May 11, 2021, 11:23am

Thanks

OnePieceOfDeepLearning · August 3, 2023, 7:13am

@spolisetty

I beleive Reformat (like NCHW to NHWC) is still a cost for large input, so is it better to convert your model to NHWC first before feeding to tensorrt converter? But to the best of my knowledge, I have not seen any onnx model with NHWC format. Do you have any idea?

Topic		Replies	Views
Can we make TensorRT handle nhwc Tensor? TensorRT	7	1504	June 4, 2020
TensorRT memory layout TensorRT	1	1269	November 25, 2019
TRT 7 - NHWC support? TensorRT	4	702	October 12, 2021
NHWC inputs/outputs TensorRT	5	891	July 21, 2021
TensorRT specify layer NCHW -> NHWC Jetson TX2	7	2146	October 18, 2021
nchwTonhwc Kernel Launches? TensorRT tensorrt , nsight	2	1121	August 3, 2023
Unsolvable, see posted link below why: CUDNN_TENSOR_NCHW vs. CUDNN_TENSOR_NHWC GPU-Accelerated Libraries	3	1932	March 9, 2017
TensorRT 5 Input Tensor Format NCHW / NHWC TensorRT	6	5302	October 24, 2019
cuDNN6.0: NCHW vs. NHWC GPU-Accelerated Libraries	0	2028	May 22, 2017
Problems moving to cuDNN Frontend cuDNN cudnn	1	73	September 18, 2025

TensorRT NCHW vs cuDNN NHWC

Related topics