Performance Differences between RTX 3080 and Nvidia T4 GPUs in a DeepStream Application

madisi98 · January 24, 2023, 12:18pm

• Hardware Platform (Jetson / GPU) GPU
• DeepStream Version 6.1.1
• TensorRT Version Latest ngc cloud deepstream triton image
• NVIDIA GPU Driver Version (valid for GPU only) 515
• Issue Type( questions, new requirements, bugs) Questions

I currently have two computers where I’m running my software. One at home with an rtx 3080 which is where I develop and another one in a “production” server which has 2 Nvidia T4s.

I have been doing some benchmarks of a yolo 7 engine generated with trtexec for both computers and I’m seeing some odd numbers.

In the 3080 I’m able to get 581 fps with a batch of 8, while in a single T4 I’m only being able to get 172 fps with the same batch.

This is for me counter intuitive, since T4s are cards that are supposed to go in data centers and servers, and also way more expensive. Is there a reasoning behind this? Or does 3080 make more sense for my use case?

Forgot to mention the engine is FP16

mchi · January 26, 2023, 9:50am

Hi @madisi98
From Tesla T4 vs GeForce RTX 3080 [in 3 benchmarks] , RTX 3080 is Ampere GPU arch, while T4 is Turing GPU arch. RTX 3080 has more CUDA Cores/Tensor Cores, and RTX 3080 has higher GPU clock.
So, I think it’s expected that RTX 3080 has higher tops than T4.
But T4 is data center card, it has longer lifetime and ECC feature.

madisi98 · January 26, 2023, 1:43pm

Right, I see also the 3080 can only do 3 h264 encodings at a given time while the T4 has an unrestricted amount of encodings that can be done.

Why is it this way if the 3080 is a more capable card?

mchi · January 26, 2023, 2:51pm

Sorry, I don’t get your question

madisi98 · January 26, 2023, 5:40pm

I don’t get why the T4 can do more video encodings at once than the 3080, since the later is capped at 3

mchi · January 27, 2023, 9:30am

There is no update from you for a period, assuming this is not an issue anymore.
Hence we are closing this topic. If need further support, please open a new one.
Thanks

Here is the info - NVENC Application Note :: NVIDIA Video Codec SDK Documentation

system · February 28, 2023, 5:36am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Performance differences DeepStream SDK tensorrt , camera , ubuntu , gstreamer	3	443	June 21, 2023
Performance on T4 is over 3x slower than that on 2080Ti on DeepStream5 DeepStream SDK gstreamer	7	1881	October 12, 2021
H264 1080p transcoding: RTX 5000 or Tesla T4 Video Processing & Optical Flow	1	1629	July 1, 2020
Tesla T4 has significantly higher GPU utilization than RTX 2080 Ti Frameworks (archived) tensorflow	0	929	July 29, 2020
Best GPU for Video transrate (H264/VP8/VP9) DeepStream SDK	5	3738	October 12, 2021
Best GPU for AI workloads (not DL training) CUDA Programming and Performance	16	6745	April 1, 2021
Performance differences DeepStream SDK	2	366	June 21, 2023
GTX 1080 is fatser than Tesla P4 with INT8 accelerating ??? DeepStream SDK	2	2040	April 25, 2019
GPU with Maximum number of video trans coding with single session DeepStream SDK	4	1063	October 12, 2021
Understanding When/Why DeepStream 5.0 caps the performance DeepStream SDK performance	7	1347	October 12, 2021

Performance Differences between RTX 3080 and Nvidia T4 GPUs in a DeepStream Application

Related topics