GTX 1080 is fatser than Tesla P4 with INT8 accelerating ???

zhouzhi9 · April 25, 2019, 2:55am

Hello,

Refer to official website/blogs/forum from Nvidia, Tesla(such as P4) for INT8 accelerating inference is a good choice, and GTX GPUs(such as 1080) is not recommended for accelerating.

But I have try several times, and it seems that 1080 GPU is faster than P4, below is my test environments:

deepstream 3.0
tensorRT 5.0
one GTX 1080 gpu & one Tesla P4 gpu
ubuntu 18.08
1080p h264 video

results:

With INT8 accelerating, SSD object detector inference, GTX 1080 can run 24 streams in real-time, but Tesla P4 can only run 20 streams in real-time. both GPU-Util is between 60%~70%;
With INT8 accelerating, YoloV3 object detector inference, GTX 1080 can run 8 streams in real-time, but Tesla P4 can only run 4 streams in real-time. both GPU-Util is 80% ~90%;

I used nvinfer plugin in both 2 test pipelines, and SSD inference is referred to demo from DeepStream 3.0 Release package, YoloV3 inference is referred to github(https://github.com/NVIDIA-AI-IOT/deepstream_reference_apps/tree/master/yolo/samples/objectDetector_YoloV3)

So, who can tell me why 1080 is faster than P4 with INT8 accelerating? thanks!

ChrisDing · April 25, 2019, 5:06am

1080 has more cuda cores.

1080 has 3584 cuda cores.
Tesla P4 has 2560 cuda cores.

And the pclk of 1080 can be higher.

From nvidia policy, 1080 can’t be used to do inference.

zhouzhi9 · April 25, 2019, 6:31am

OK, Thanks!

So the result is reasonable.

Topic		Replies	Views
GTX 1080 vs GTX 1080 ti does both have INT8 instruction ? CUDA Programming and Performance	2	2802	September 6, 2018
Tesla P4, P40 Accelerators Deliver 45x Faster AI CUDA Programming and Performance	11	3003	September 19, 2016
TensorRT 2.1.2 supports INT8 inference on GTX1080 and Pascal TitanX? GPU-Accelerated Libraries	1	1004	September 4, 2017
Which GPU support INT8? Tesla V100? TensorRT	0	1134	June 26, 2019
int8 mode is different between 1080ti with 2080ti TensorRT	0	505	September 3, 2019
FP16 --half=true option doesn't work on GTX 1080 TI although it runs ./sample_int8 INT8 GPU-Accelerated Libraries	2	4960	August 23, 2017
Understanding When/Why DeepStream 5.0 caps the performance DeepStream SDK performance	7	1272	October 12, 2021
Performance Differences between RTX 3080 and Nvidia T4 GPUs in a DeepStream Application DeepStream SDK	6	6749	January 27, 2023
platformHasFastInt8 returns true on Geforce GTX1060 TensorRT	3	1464	October 12, 2021
Is TensorRT 1.0 supported on GTX 1080? CUDA Setup and Installation	0	1987	April 29, 2017

GTX 1080 is fatser than Tesla P4 with INT8 accelerating ???

Related topics