Performance measurement

Sanix · April 21, 2011, 2:13pm

Hi,

I’ve created several kernel functions and measured the time. I’m interested in the overhead CUDA generates.

I analysed a kernel function with the built in nsight analyzer. It says the duration of my kernel is 3.2 microseconds. When I measure with the QueryPerformanceCounter around the kernel function and call cudaThreadSynchronize() to make sure, that the kernel execution has finished, I get like 230 microseconds.
Does this mean that there’s an overhead of approximately 200 microseconds? This would render CUDA inefficient for small junks of calculation to be done. Has anyone else tried to use CUDA for a small amounts of “work”.

Sanix · April 28, 2011, 8:57am

push

cudahacker · April 28, 2011, 11:17am

Hi I have no idea whether your measurements are correct and what the actual times are, however, 200us overhead sounds absolutely reasonable to me. It a well known fact that GPGPU computing is not suited for small jobs.

seibert · April 29, 2011, 12:42am

What operating system are you using? The launch overhead varies between operating systems quite a bit.

Topic		Replies	Views
Kernel Overhead/Profiler Accuracy CUDA Programming and Performance	4	6421	May 25, 2008
Visual Profiler: CPU Time? CUDA Programming and Performance	5	3441	March 21, 2008
Kernel execution overhead CUDA Programming and Performance	2	1174	July 6, 2009
Kernel enqueue overhead Bringing kernel overhead down? CUDA Programming and Performance	9	13783	March 12, 2010
kernel launch overhead for GTX 280 CUDA Programming and Performance	17	3673	November 5, 2009
kernel call overhead: timing results overhead is large for small # of calls CUDA Programming and Performance	16	7859	March 8, 2013
Viability for CUDA to improve millisecond calcs? Is there too much overhead? CUDA Programming and Performance	3	3511	June 12, 2008
overhead between two successive kernel calls CUDA Programming and Performance	6	1778	July 7, 2013
Slow loading kernel to GPU CUDA Programming and Performance	11	12968	April 18, 2008
System Time Too much time spent on system while running a kernel CUDA Programming and Performance	0	1188	July 17, 2009

Performance measurement

Related topics