Kernel execution time issue Execution time increases after several kernel launches

chrizh · July 23, 2010, 9:56am

Hello,

i ported the Gaussian Mixture Model algorithm for video foreground segmentation to CUDA. The CUDA kernel is executed for every frame so it will be launched rapidly. Following a code example:

...

while( NewFrame() )

{

   ...

   cudaEventCreate(&start);

   cudaEventCreate(&stop);

   cudaEventRecord(start, 0);

UpdateBackgroundModel<<<grid, block, size>>>(...);

cudaEventRecord(stop, 0);

   cudaEventSynchronize(stop);

   cudaEventElapsedTime(&time, start, stop);

   ...

}

...

The execution time of my kernel is for the first ~300 launches about 1.2 ms and then it increases to 2.4 ms. After some seconds the kernel finally takes about 6.9 ms.

This measurement is done with a release version (without any debug information etc.)

Here some system information:

Windows 7 32-bit
GeForce 295 GTX (Multi-GPU, only one GPU used for CUDA kernel)
Nsight Runtime API 3.1

I have the suspicion that it might be a power saving problem of the GPU. The power saving options from windows is set highest performance. But this didn’t solve the issue.

I hope someone has a solution for this behavior, because in need to be fast as possible.

Best regards

chrizh

we1314love · August 2, 2010, 1:59am

Hello,

i ported the Gaussian Mixture Model algorithm for video foreground segmentation to CUDA. The CUDA kernel is executed for every frame so it will be launched rapidly. Following a code example:
...

while( NewFrame() )

{

   ...

   cudaEventCreate(&start);

   cudaEventCreate(&stop);

   cudaEventRecord(start, 0);

UpdateBackgroundModel<<<grid, block, size>>>(...);

cudaEventRecord(stop, 0);

   cudaEventSynchronize(stop);

   cudaEventElapsedTime(&time, start, stop);

   ...

}

...
The execution time of my kernel is for the first ~300 launches about 1.2 ms and then it increases to 2.4 ms. After some seconds the kernel finally takes about 6.9 ms.

This measurement is done with a release version (without any debug information etc.)

Here some system information:

Windows 7 32-bit

GeForce 295 GTX (Multi-GPU, only one GPU used for CUDA kernel)

Nsight Runtime API 3.1

I have the suspicion that it might be a power saving problem of the GPU. The power saving options from windows is set highest performance. But this didn’t solve the issue.

I hope someone has a solution for this behavior, because in need to be fast as possible.

Best regards

chrizh

hi, how are you , i am doing the same thing as you do ,the mixture of gaussian in CUDA, so i want to ask your idea about it ,

Topic		Replies	Views
First kernel execution takes longer CUDA Programming and Performance	8	2894	December 8, 2014
Time of cudaLaunch increase with the times of calling kernels. CUDA Programming and Performance	7	1165	September 12, 2017
Inconsistent kernel run times CUDA Programming and Performance	12	5814	August 5, 2009
Function executing time CUDA Programming and Performance	7	6422	December 17, 2007
Strange Runtime behavior CUDA Programming and Performance	7	3104	December 18, 2009
Why kernel calculate speed got slower after waiting for a while? CUDA Programming and Performance cuda	9	1786	July 19, 2022
kernel execution time not constant CUDA Programming and Performance	2	1334	March 1, 2017
Why the measure time for second kernel is extremely short? CUDA Programming and Performance	5	32	May 13, 2025
Why does my kernel take too long occasionally? CUDA Programming and Performance	21	8810	October 13, 2010
Same kernel multiple time execution CUDA Programming and Performance	3	1909	July 30, 2018

Kernel execution time issue Execution time increases after several kernel launches

Related topics