RTX3070 performance with TensorRT

gahang · December 9, 2020, 5:42am

Description

I use RTX3070 to do a ResNet18 inference and get a throughput of <300/s at batch 50. At the same environment, the RTX2080Ti can reach 500/s.
In this a normal result? Or the drivers should be optimized?

Environment

TensorRT Version: 7.2.1
GPU Type: RTX3070
Nvidia Driver Version:455.45
CUDA Version: 11.1
CUDNN Version: 8.0.5
Operating System + Version: Ubuntu 16.04
Python Version (if applicable):
TensorFlow Version (if applicable):
PyTorch Version (if applicable):
Baremetal or Container (if container which image + tag):

Relevant Files

NA

Steps To Reproduce

NA

SunilJB · December 9, 2020, 9:00am

Hi @gahang,

Could you please share the model and script file to reproduce this issue?
Also if possible, could you please share the verbose logs and profiler output of both the test runs?

Thanks

Topic		Replies	Views
TensorRT poor inference performance on Ampere TensorRT	1	459	February 25, 2021
Maximum Performance of ResNet50 model for NVIDIA T4 in TensorRT using trtexec Deep Learning (Training & Inference)	1	439	October 5, 2020
TensorRT on RTX 3080 slow down TensorRT tensorrt	6	2032	September 16, 2022
BIggest Latency in TensorRT TensorRT cudnn	1	306	October 19, 2023
Tensorrt inference runs slower in RTX4090 than RTX 3090 Ti TensorRT tensorrt	3	2061	January 10, 2023
Nvidia driver hang up with RTX3080 when run trtexec TensorRT tensorrt , cuda , ubuntu	2	695	January 4, 2022
RTX 3070 vs RTX 3070 laptop TensorRT	1	637	December 8, 2021
A100 graphics card inference performance is not strong TensorRT	4	564	April 12, 2022
Tensorrt is slower than pytorch TensorRT	2	2232	September 15, 2021
Performance differences DeepStream SDK tensorrt , camera , ubuntu , gstreamer	3	392	June 21, 2023

RTX3070 performance with TensorRT

Description

Environment

Relevant Files

Steps To Reproduce

Related topics