PGI CUDA FORTRAN performance issue

mbpl · May 5, 2010, 1:14pm

Hello,

When I use pgi 10.3 to compile my cuda fortran programs I got a slow down of about 2.5 comparatively to a compilation with the version 10.2
Did someone else have the same issue ?

Matt

MatColgrove · May 5, 2010, 6:06pm

Hi Matt,

Does your program perform many divides? In 10.2 we were getting reports of wrong answers when using divides. The default hardware divide is simply not precise enough for many customer. Hence, in 10.3 we started using a more precise divide. Unfortunately, this precise divide is much slower then the default and began causing slow-downs. In 10.4, we updated the ‘-Mcuda=fastmath’ flag to revert back to the less precise but faster divide.

If you do use many divides, then I would recommend upgrading to 10.4 (or 10.5 in a day or two) and use the “-Mcuda=fastmath” flag.

If you don’t use many divides, please send a report to PGI Customer Service (trs@pgroup.com) including a reproducing example, since we will need to investigate the problem.

Thanks,
Mat

mbpl · May 6, 2010, 7:56am

Hi Mat,

Thanks for your answer, it has been driving me nuts.
About one tenth of my floating point operations are divisions (according to the ptx). I did not remark accuracy problem using the version 10.2.
So I will keep going with the 10.2 until the admin install the newest versions.

Matt

Topic		Replies	Views
division in CUDA Fortran Legacy PGI Compilers (archived)	2	3402	December 4, 2010
Compilation speed of different compiler versions. Legacy PGI Compilers (archived)	2	3300	October 22, 2010
CUDA Fortran slower? Legacy PGI Compilers (archived)	9	4971	March 7, 2011
cuda fortran does not work on my computer now Legacy PGI Compilers (archived)	6	1774	December 3, 2019
Compilation time Legacy PGI Compilers (archived)	2	2650	October 26, 2010
Strange performance across CUDA versions CUDA Programming and Performance	2	510	December 28, 2020
Survey for PGI FORTRAN compiler ï¼Thanks~ CUDA Programming and Performance	7	12617	July 27, 2010
program does not work with PGI194+cuda10.1 under Ubuntu 18.04 Legacy PGI Compilers (archived)	4	1101	September 28, 2019
Performance decrease with PGI 12.1 Legacy PGI Compilers (archived)	11	6457	May 10, 2012
compiling and executing Legacy PGI Compilers (archived)	1	1936	February 1, 2010

PGI CUDA FORTRAN performance issue

Related topics