Difference in running time

Beckwang · July 26, 2015, 4:35am

I am a starter in GPU computing, recently when I was running a very simple vector-add program, I found that when I call the same cuda program twice, the first running time will be much longer than the second one. Was it because of the starting-up time for GPU hardware? Thanks!

little_jimmy · July 26, 2015, 6:29am

it may be many things

it may be a race - the first run essentially ‘setting up’ the second run, and manages to succeed, due to a race condition
it may be memory
if it were the other way around, it might have been power related
it may be that the kernel(s) do not even run the 2nd time around, for a number of reasons

do the standard checks - memcheck; racecheck
and make sure to do proper error checking on/ after apis

Beckwang · July 26, 2015, 9:31am

Thanks Jimmy. I have done error check after every API, and the kernel must have run twice since the input vectors are different I have checked the output vectors,which are correct. Thank you for your answer!

little_jimmy · July 26, 2015, 10:21am

that is indeed another possibility - execution time may very well be dependent on (input) data
certainly the case where the data determines the number of iterations or the termination point

tera · July 26, 2015, 5:53pm

Could well be the just-in-time compilation of PTX code to your target architecture, or various other driver initialization.

The former can be prevented by compiling your program including binary code for your target architecture.

Vectorizer · July 26, 2015, 7:59pm

One thing you did not mention in your post is if you send the data to GPU ram again for the second run.

Having said that, if you take a look at samples provided by Nvidia, they typically invoke the kernel once, then invoke it again multiple times and take average of these. The first invocation is for “warm-up”.

Beckwang · July 27, 2015, 2:10pm

Thanks you guys! Your suggestions are all so helpful, all help me know more about CUDA and GPU. Since I am rather new in this field, it will still take me some time to thoroughly understand your advice. Thanks!

Topic		Replies	Views
Getting Different Execution Times of Running Same Kernel Twice CUDA Programming and Performance	2	74	August 13, 2024
Timing Issue CUDA Programming and Performance	1	873	May 31, 2010
when a application runs, the first execution of a kernel will spend a longer time than the second. CUDA Programming and Performance	2	548	June 14, 2016
Inconsistent kernel run times CUDA Programming and Performance	12	5910	August 5, 2009
Why the measure time for second kernel is extremely short? CUDA Programming and Performance	5	90	May 13, 2025
First kernel execution takes longer CUDA Programming and Performance	8	2992	December 8, 2014
Second kernel run is faster than first run CUDA Programming and Performance	2	1935	September 27, 2016
Function executing time CUDA Programming and Performance	7	6508	December 17, 2007
First kernel run is slower than succeeding CUDA Programming and Performance	9	2957	May 16, 2022
Execution time The first execution time is always slow CUDA Programming and Performance	12	5384	January 23, 2008

Difference in running time

Related topics