CUDA performance running at fp16 precision in Linux 35-40% lower than in Windows with RTX 2080TI

zz4032 · November 24, 2018, 4:50pm

Hello,

I saw a similar topic just below mine with lower performance on Kepler hardware, but decided to open a new thread.
In my case it’s an RTX 2080 TI running on Linux Ubuntu 18.04, same problem occurs in Ubuntu 16.04.

I get about 35-40% less performance in Linux than in Windows using CUDNN backend with fp16 precision and using driver version 410.72 when running a chess neural network engine. With fp32 precision the performance is as expected (and same as in Windows).
With fp16 I’m also observing a lower GPU utilization of 40-50% (while it should be 95-100%).

Nvidia bug report is attached, it was running when GPU was loaded.

cuDNN version: v7.4.1 (Nov 8, 2018)
CUDA version: 10.0

Same problem with previous CUDNN version 7.4.0.

It looks to me as if the GPU just wouldn’t be used at full power at this mode.

Edit:
Solved by correcting compile options. Performance in Linux is ok now.

Topic		Replies	Views
GPU performance is very low on Ubuntu installed on a laptop Linux cuda , linux	0	715	November 2, 2020
Keras/Tensorflow Running slowly on Windows 10 CUDA Programming and Performance	8	1587	March 8, 2019
CUDA on Windows much slower than on linux CUDA Programming and Performance	5	3502	January 26, 2013
Is there anyone know about the performance at linux and windows? CUDA Programming and Performance	4	993	November 2, 2012
Slow CUDA performances on Linux VS cuda Windows CUDA Programming and Performance	3	2324	December 26, 2012
GPU performance is very poor General Topics and Other SDKs cuda , performance , windows-driver	0	1093	June 3, 2022
Low efficiency of AI application: RTX 3080 vs GTX 1060 cuDNN ubuntu	4	1444	January 8, 2021
Cudnn 7.3 has poor performance on GeForce RTX 2080 cuDNN	0	865	October 12, 2018
CUDA performance ubuntu 16.04 vs windows 7? CUDA Programming and Performance	0	563	November 4, 2016
Upgraded from Cuda 5.5 (CentOS 6.5) to Cuda 9.1 (CentOS 7.4), now GPU application running slower aft... CUDA Programming and Performance	0	522	June 7, 2018

CUDA performance running at fp16 precision in Linux 35-40% lower than in Windows with RTX 2080TI

Related topics