Sequential call of kernels

In_Kyu_Park · November 27, 2007, 1:48am

Hi,

Another simple question:

In sequential call of different kernels (Grid dim and block dim are different), I observed there is serious slow-down.

For example,

Kernel 1 only => 0.001 sec
Kernel 2 only => 0.130 sec

But kernel1 + kernel2 would take 0.450 sec.

Can I have some advice on this?

Thanks.

seb · November 27, 2007, 2:45am

You probably make a mistake when timing your kernels. Kernel calls are asynchronous so they return immediately after you called them. Use cudaThreadSynchronize() before starting and before stopping the timer to get accurate results.

AndreiB · November 27, 2007, 5:28am

You should do it in following way:

cudaThreadSyncronize();

t1 = clock();

kernel1<<>>();

cudaThreadSYncronize();

t2=clock();

kernel2<<>>();

cudaThreadSyncronize();

t3=clock();

Replace clock() with appropriate timing function (such as QueryPerformanceCounter() on WIndows). kernel1 execution time will be (t2-t1) and kernel2 execution time will be (t3-t2).

Topic		Replies	Views
Odd Slowdown Problem Same function slows down in loop CUDA Programming and Performance	3	9946	February 8, 2008
the same thing, different time consuming asking for help CUDA Programming and Performance	5	6312	May 26, 2009
very slow function next to kernel CUDA Programming and Performance	3	3990	August 10, 2008
how to compute time in cuda? CUDA Programming and Performance	3	3810	October 13, 2007
Kernels and For Loops CUDA Programming and Performance	2	4127	April 4, 2008
Kernel Timing and cudaThreadSynchronize() CUDA Programming and Performance	6	2093	July 30, 2010
Inconsistent CUDA Kernel Execution Times in Sequential Execution CUDA Programming and Performance cuda	6	387	June 11, 2024
kernel in loop (time explodes) CUDA Programming and Performance	4	3548	June 29, 2009
Kernel execution is async? CUDA Programming and Performance	1	4592	May 23, 2008
Speed reduces 17 -> 20 times after the kernel is called 9th times! T_T! CUDA Programming and Performance	4	2537	November 18, 2008

Sequential call of kernels

Related topics