Slow perfomance Runtime 4.1

pemolux · February 7, 2012, 7:18am

Hello.
I have a problem. I wrote CUDA program, MS VS 2008, CUDA Runtime 3.2. Dll wrote in C++, Dll functions called from C#. Program works correctly. Then I rewrote solution for VS 2010, CUDA Runtime 4.2, NVIDIA Nsight 2.1. I created new solution, then added new C# and C++ (Nsight) projects and paste in this projects my code from VS 2008 projects. Program works correctly but speed of CUDA functions became 3 x slower. Have anybody seen problem like this? Do you have ideas about compiler settings or something else??
GPU - Nvidia M540 (mobile)
CPU - Core I7 (2 GHz)
OS - Windows 7 (x64)

PatricioVidal · February 9, 2012, 10:44pm

I am having a similar issue. I upgraded the CUDA runtime from 3.2 to 4.1 and some of my kernels are 20% to 40% slower.

njuffa · February 9, 2012, 11:18pm

For significant slowdowns in CUDA 4.1 due to code generation, such as those reported here, I would suggest filing bugs against the compiler. Please attach a self-contained repro case to the bug report. A link to a bug reporting form can be found on the start page of the registered developer website, partners.nvidia.com

CUDA 4.1 contains significant changes to the compiler infrastructure. While much effort has gone into avoiding performance regressions with the revamped compiler, compilers are complex pieces of software containing numerous heuristics, and it is impossible to cover every possible permutation in testing. Thus regressions can occur but are expected to be fairly rare. Filing bugs for any functional issues or significant performance regressions will help with eliminating the remaining kinks. Thank you for your help.

Topic		Replies	Views
CUDA v4.1 substantially slower than v4.0 CUDA Programming and Performance	10	18182	February 12, 2012
Is cuda 2.0 faster than the previous versions? CUDA Programming and Performance	3	3375	July 25, 2008
CUDA 11 performance degradation CUDA Programming and Performance cuda , performance , compile	4	1561	October 2, 2020
Program compilied with CUDA 5.5 is slower than with 5.0 (about 10% degradation) CUDA Programming and Performance	4	904	May 22, 2014
Bad performance using VS 2010 + CUDA 4.0 CUDA Programming and Performance	2	1031	July 19, 2011
cuda build rule 3.2 slower than 3.0 ? CUDA Programming and Performance	0	6643	December 13, 2010
CUDA build rule v4.0 VS v3.0 for MS visual studio CUDA Programming and Performance	0	811	June 29, 2011
Compute performance degradation except when in Remote Desktop CUDA Programming and Performance	3	537	October 17, 2019
CUDA/Nsight unstable and inconsistent performance. CUDA Programming and Performance	3	1267	August 29, 2019
Visual C++ 2008 runtimes distributed with CUDA 6.5 make Matlab 2014b run obscenely slow CUDA Programming and Performance	1	602	December 11, 2014

Slow perfomance Runtime 4.1

Related topics