A question about kernel execution

mcleary · August 17, 2009, 3:01am

Hi,

My question is a quite simple but I don’t have the answer. In the NVIDIA CUDA Programming Guide is written that a kernel has an asynchronous call, this mean that a kernel immediately return to the host when it is called. I want to know if there is some certainty that a kernel does not overlap the execution of another kernel or there’s need to call cudaThreadSyncrhonize to synchronise the execution.

Thanks.

mcleary · August 24, 2009, 5:42pm

I have another question about this matter. Typically, after a kernel execution, we need to copy data back from the GPU to CPU, so, there’s no need to use cudaThreadSynchronize() before use a cudaMemcpy() ?

Thanks in advance.

Topic		Replies	Views
Kernel execution CUDA Programming and Performance	2	902	September 28, 2009
Using GPU and CPU at the same time CUDA Programming and Performance	5	6955	March 4, 2009
cudaThreadSynchronize CUDA Programming and Performance	1	2391	February 1, 2009
STATUS OF CALL Status of kernel Execution CUDA Programming and Performance	2	6931	December 17, 2007
cudaThreadSynchronize usage CUDA Programming and Performance	3	2923	October 21, 2008
Synchronization between Kernel calls CUDA Programming and Performance	2	2732	July 4, 2011
cudaMemcpy during kernel execution asynchronous kernel launch CUDA Programming and Performance	2	3081	July 20, 2007
cudaThreadSynchronize() CUDA Programming and Performance	1	2223	July 11, 2007
Getting diff time statistics for same function Totally confused after seeing results CUDA Programming and Performance	3	4179	December 4, 2007
Question regarding cudaThreadSynchronize() Does it act like a barrier? CUDA Programming and Performance	1	1142	September 16, 2008

A question about kernel execution

Related topics