how to compute time in cuda?

dingshuai1985 · October 12, 2007, 7:31pm

I just wonder how to compute time in Cuda…

I have a small test, the code is like this

t1=clock();
for(i=0;i<iteration_time;i++)
{
…call kernel_function…
}
t2=clock();

then I can not get the correct time when I use t2-t1, because I always get a tiny number no matter how I change the iteration_time… it is the same case when I use cutStartTimer and so forth…

I think it is because after the CPU calls the GPU function, it comes back without waiting for the result from GPU. Someone has the same case as me ?

Thanks!

Shuai

paulius · October 12, 2007, 8:05pm

call cudaThreadSynchronize() before you start timing (to make sure that all previous CUDA tasks have completed). Call cudaThreadSynchronize() right before the second timing call (to make sure that all the tasks you’re timing have completed).

Paulius

timtimac · October 13, 2007, 1:02am

If I call two kernel functions one by one without using cudaThreadSynchronize(), will these two kernel functions run in the device one after one, or will they run concurrently on the device?

For example:

global myKernel1() {…};

global myKernel2() {…};

myKernel1<<<dimGrid,dimBlock>>>();

myKernel2<<<dimGrid,dimBlock>>>();

// NB: no cudaThreadSYnchronize()

Will myKernel1 and my Kernel2 concurrently running on the device(suppose they both take long enough time)? Or is there some kind of queue that store myKernel2 until myKernel1 is finished and then lauch myKernel2?

Timtimac.

AndreiB · October 13, 2007, 6:04am

[q]these two kernel functions run in the device one after one, or will they run concurrently on the device[/q]
This has been discussed many time, why don’t you search for an answer?
I short: they will NOT run concurrently. myKernel2 will be launched ony after myKernel1 has completed.

Topic		Replies	Views
How to compute time in cuda? CUDA Programming and Performance	1	4582	October 12, 2007
the same thing, different time consuming asking for help CUDA Programming and Performance	5	6255	May 26, 2009
Kernel Timing and cudaThreadSynchronize() CUDA Programming and Performance	6	2025	July 30, 2010
is cudaThreadSynchronize() will take 600+ms to execute? CUDA Programming and Performance	3	1563	April 21, 2009
Can kernel function parallel with CPU code? CUDA Programming and Performance	12	7776	December 5, 2008
cudaThreadSynchronize() after kernel call? CUDA Programming and Performance	5	11511	November 29, 2010
How properly counting a performance/program time ? CUDA Programming and Performance	4	2591	August 28, 2007
cudaThreadSynchronize() and timing question CUDA Programming and Performance	7	8254	October 27, 2010
CUDA-Kernel time measurement CUDA Programming and Performance	4	13645	June 9, 2010
Compare Execution Times CPU vs GPU the proper way? CUDA Programming and Performance	5	6129	September 8, 2009

how to compute time in cuda?

Related topics