Inference Time between Tesla K80 and GTX 1080 in Tensorflow

hongym7 · January 3, 2019, 5:52am

Hi, all.
I have a question.
I run object detection application in tensorflow
But K80 inference time is higher than gtx 1080.

A : Tesla K80 1, Windows Server 2012 R2, CUDA 9.0
B : GTX 1080 1, Windows 7, CUDA 9.0

A inference time is 3.2s ~ 3.4s
B inference time is 1.6s ~ 1.8s

Why ?

NVES · January 3, 2019, 4:45pm

Hello,

Could be due to several factors including system setup, storage configuration, make sure the tensorrt used to optimize the model is the same as the tensorrt used for inference.

can you provide details on the platforms you are using?

Linux distro and version
GPU type
nvidia driver version
CUDA version
CUDNN version
Python version [if using python]
Tensorflow version
TensorRT version

To help us debug, can you share a small repro containing the model, inference code, and sample input data that demonstrate the performance difference?

Topic		Replies	Views
inference time of tensorrt is slower than tensorflow !!! TensorRT	2	1435	September 27, 2019
The first inference using tensorRT model takes far longer time than that using tensorflow model TensorRT	0	658	November 13, 2020
TensorRT inference time extremely slow TensorRT	1	452	January 31, 2023
Slow inference speed on RTX 3080 Frameworks cuda , tensorflow	0	1109	November 17, 2020
TensorRT inference is slower than tensorflow model TensorRT	1	954	June 28, 2019
Inference time mismatch between same configuration on Windows and Ubuntu TensorRT tensorrt , windows-driver	2	661	September 27, 2023
Tensorrt inference slower than tensorflow TensorRT	3	487	November 27, 2020
TensorRT inference time much slower than cuDNN TensorRT	3	2018	October 12, 2021
Inference time is more for TensorRT engine than Pytorch model for Retinaface TensorRT tensorrt , pytorch	1	1051	June 4, 2021
Inference time of tensorrt 6.3 is slower than tensorrt 6.0 TensorRT tensorrt , driveos	7	917	October 12, 2021

Inference Time between Tesla K80 and GTX 1080 in Tensorflow

Related topics