TensorRT vs TVM

emizhang · August 15, 2019, 6:18am

I have been training a Yolov3 model in Pytorch and converting it to an onnx file to run with TensorRT. I’ve noticed some scenarios of different performance between the Pytorch model and the TensorRT model and I’m wondering what are the pros and cons of TensorRT compared to other compilers such as TVM?

scrin · August 15, 2019, 8:20am

Convolution: TensorRT implement many algorithms for both fp32 and int8 convolution, tvm only implement direct and winograd convolution and it requires almost 1 day to find fast conv config in a server.

Deconvolution: TensorRT has full support, TVM’s deconv don’t support group, don’t support int8.

Quantization: TensorRT has full post-training quantization support, open-sourced TVM quantization is incomplete.

TVM pros:

open source.
after 1 day tuning in a server, tuned model may a little faster than tensorrt.

TensorRT cons:

still some bugs.

I’m not familiar with other compilers.

user148674 · March 24, 2022, 5:04am

Nearly 2+ years later, how does the comparison play out now?

ucekmez · December 1, 2023, 7:01pm

Yeah, it’d be nice to see a recent comparison

Topic		Replies	Views
The inference of [ Deconvolution + Other Operations ], for example [ Deconvolution + Convolution ] in tensorrt is slower than mxnet TensorRT	4	1116	May 18, 2020
TensorRT 2x slower than Cudnn for single Conv2D (74 ms vs. 156 ms) TensorRT	6	818	February 5, 2021
TensorRT Inference is Slower Than Other Frameworks TensorRT	7	3712	December 9, 2019
Question about output TensorRT cudnn	1	228	January 11, 2024
TensorRT group convolution get wrong results TensorRT	5	512	November 25, 2021
Why TensorRT model is slower? TensorRT tensorrt	3	1354	June 20, 2022
NVIDIA-AI-IOT/torch2trt vs NVIDIA / Torch-TensorRT TensorRT	1	2505	May 4, 2022
TensorFlow-Yolov3 to ONNX to trt engine TensorRT tensorrt , tensorflow , yolo , onnx	5	1470	March 26, 2021
The inference speed of yolov5 tensorrt has little difference between int8 and fp16 TensorRT tensorrt , cuda	1	1496	September 8, 2022
Differenct between converting to TensorRT from ONNX, Tensortflow or PyTorch (The best way to get TensorRT) Jetson Nano tensorrt	3	668	October 15, 2021

TensorRT vs TVM

Related topics