Concurrent CPU and GPU execution on TESLA

fadhel · August 3, 2011, 5:20am

Good day,

Noob here and I have a simple question, can we call a device kernel and carry on with CPU computation without waiting for the GPU to return? Something like the right part of the attached image.

Is this possible on TESLA? FERMI?

Any simple examples?

Thank you,

Fadhel

MarkusM · August 3, 2011, 8:48am

This is actually the default. Kernel calls are asynchronous, i.e. they return before their work is done. You can simply conduct normal CPU computations afterwards and they will be overlapped with the GPU computations. Synchronization is only done either explicitly with functions like cudaDeviceSynchronize() or implicitly before blocking functions like cudaMemcpy() from/to host. (Chapter 3.2.5 of the CUDA C Programming guide has further details and also discusses things like concurrent kernels and asynchronous memcpys.)

fadhel · August 8, 2011, 6:42am

Thank you!

Topic		Replies	Views
A question about kernel execution CUDA Programming and Performance	1	2666	August 24, 2009
STATUS OF CALL Status of kernel Execution CUDA Programming and Performance	2	6974	December 17, 2007
Kernel Synchronization in CUDA not fully explained in programming guild CUDA Programming and Performance	1	10675	February 25, 2010
Using GPU and CPU at the same time CUDA Programming and Performance	5	7033	March 4, 2009
Heterogenour programming CUDA Programming and Performance	4	1900	November 24, 2008
What happens to my CPU thread? CUDA Programming and Performance	1	1303	August 19, 2009
Asynchronous Concurrent Execution CUDA Programming and Performance	3	5722	April 12, 2010
cudaMemcpy during kernel execution asynchronous kernel launch CUDA Programming and Performance	2	3137	July 20, 2007
Callbacks from GPU to CPU CUDA Programming and Performance	5	3796	December 10, 2008
is kernel in stream 0 asynchronous? CUDA Programming and Performance	10	3814	April 23, 2011

Concurrent CPU and GPU execution on TESLA

Related topics