The inference of [ Deconvolution + Other Operations ], for example [ Deconvolution + Convolution ] in tensorrt is slower than mxnet

934232965 · December 27, 2019, 4:40am

Hi All,
We want to convert some models trained by mxnet1.5.0 to tensorrt7.0 through onnx-tensorrt(7.0) to speed up the inferece. But there are some questions about deconvolution. If there are other operations follow deconvolution, for example convolution, the inference of the model may has slower speed and lager gpu memory. The comparisons between them are as follows.

deconv+conv
mxnet speed: 21.4385s/100 gpu_memory: 1745M
tensorrt speed: 39.9927s/100 gpu_memory: 1631M
deconv+activation+conv
mxnet speed: 22.1862s/100 gpu_memory: 1745M
tensorrt speed: 41.0732s/100 gpu_memory: 2653M
deconv+slice_axis
mxnet speed: 20.6279s/100 gpu_memory: 1745M
tensorrt speed: 38.6454s/100 gpu_memory: 1627M
upsampling+conv
mxnet speed: 67.3117s/1000 gpu_memory: 1741M
tensorrt speed: 56.8737s/1000 gpu_memory: 1625M

We think it is strange thant the performance in tensorrt cannot be better than mxnet.

Any help is appreciated. Thank You!

NVES_R · December 28, 2019, 9:19pm

Hi,

Can you provide the following information so we can better help?

Provide details on the platforms you are using:
o Linux distro and version
o GPU type
o Nvidia driver version
o CUDA version
o CUDNN version
o Python version [if using python]
o Tensorflow and PyTorch version
o TensorRT version

Also, please share the scripts / model file to reproduce the issue.

934232965 · January 4, 2020, 9:50am

Hello，
The details are:

ubuntu18.04
GTX 1080
driver version: 430.35
cuda vesion: 10.0
cudnn version: 7.6
python version: 3.6
Tensorflow and PyTorch are not used. We use mxnet1.5.0, onnx1.2.1, onnx-tensorrt 7.0
tensorrt version: 7.0

Looking forward for your reply!

NVES_R · January 5, 2020, 2:13am

Also, please share the scripts / model file to reproduce the issue.

fjsmhch · May 18, 2020, 1:51pm

I meet a similar problem, the deconvolutionNd(3d) in TRT is much slower than conv_transpose3d in pytorch.

Topic		Replies	Views
The inference time of Deconvolution in tensorrt is slower than pytorch Triton Inference Server - archived tensorrt	0	789	April 15, 2020
TensorRT Inference is Slower Than Other Frameworks TensorRT	7	3712	December 9, 2019
TensorRT 3 grouped deconvolution slower than non-grouped TensorRT	4	751	May 2, 2018
TensorRT inference slower than PyTorch, different tactics are being selected TensorRT tensorrt	1	1388	November 27, 2023
Tensorrt inference slower than tensorflow TensorRT	3	487	November 27, 2020
TensorRT inference time extremely slow TensorRT	1	451	January 31, 2023
Tensorrt is slower than pytorch TensorRT	2	2232	September 15, 2021
Model inferenced with tensorrt is slower than regular pytorch TensorRT cudnn	1	470	February 16, 2024
Huge speed difference between engines built from scratch and engines built from onnx Jetson AGX Xavier tensorrt , nvbugs	11	869	August 3, 2021
Inference time of tensorrt 6.3 is slower than tensorrt 6.0 TensorRT tensorrt , driveos	7	916	October 12, 2021

The inference of [ Deconvolution + Other Operations ], for example [ Deconvolution + Convolution ] in tensorrt is slower than mxnet

Related topics