faster at small runtimes, slower for larger runtimes

eotakos · June 4, 2010, 3:17pm

(cuda C newbie here…)

So!, I my code is actually a kernel, using two other device functions, run in a for loop that includes only this kernel and a memory transfer. When I execute this code for 500 loops, it runs faster than it does on the conventional cpu code, but when I execute it for 50000 loops, it is slower.

I am using this cpu timer :

#include <time.h>

clock_t start = clock();

...

...

cout << "elapsed time" << ( (double)clock() - start ) / CLOCKS_PER_SEC;

which I don’t know how accurate it is - and I actually don’t care, I want it only for comparisons - , but I’ve compared it with a timewatch and it is acceptable for a two-minute interval. I can’t measure the “under a second” timings though.

for the 500 loops the timing results are :

CPU: 0,33 GPU: 0,01

for the 50000 loops I get

CPU : 27,43 GPU: 96,82

I thought It might be a memory leak slowing things down, but I don’t see anywhere in the programming guide a part of a code that frees the memory of the variables allocated in the global or the device functions.

any advice?? what to look for???

Thanks in advance

eotakos · June 4, 2010, 5:35pm

I know that slow execution times are a common issue that can be dealt with better memory management, use of the occupancy calculator to improve the kernel’s execution setup etc… The odd thing in my case is that when the application is run for 500 loops, it is much faster, as it is supposed to be…

(this is a bump to attract some attention… thanks for reading, I hope you answer too :-) )

Topic		Replies	Views
Issue with running CPU and GPU code Asynchronously CUDA Programming and Performance	0	3168	June 8, 2011
Strange Performance Issues Strange Performance Issues at the First Kernel Execution CUDA Programming and Performance	1	838	August 8, 2009
What could be possible reasons for affecting the kernel launch overhead for fast small kernels? CUDA Programming and Performance	5	25	October 22, 2024
CUDA trouble CUDA Programming and Performance	3	977	March 19, 2013
help with first cuda program CUDA Programming and Performance	5	3879	June 24, 2009
device speed vs. host speed Why is my device program so slow? CUDA Programming and Performance	8	7890	August 16, 2007
CUDA perormances CUDA Programming and Performance	10	7127	January 22, 2008
Why GPU might slow down. I'm having a problem with a CUDA program slowing down CUDA Programming and Performance	2	1805	December 22, 2010
System Time Too much time spent on system while running a kernel CUDA Programming and Performance	0	1185	July 17, 2009
CPU vs GPU Timer Is CUDA Timer accurate ? CUDA Programming and Performance	3	6759	February 19, 2010

faster at small runtimes, slower for larger runtimes

Related topics