problem about cudaThreadSynchronize()

dingshuai1985 · November 20, 2007, 10:41pm

Here is my problem:

I need to call different kernels, so are the first and second case below same?

case1:

call kernel1 <<<grid,block>>>(…)
cudaThreadSynchronize();
call kernel2 <<<grid,block>>>(…)
cudaThreadSynchronize();

case2:
call kernel1 <<<grid,block>>>(…)
call kernel2 <<<grid,block>>>(…)
cudaThreadSynchronize();

Will I get some benefit from the omitting of one cudaThreadSynchronize()?

Thanks :)

seb · November 20, 2007, 11:06pm

I can only answer this for CUDA 1.0:
Yea it’s the same. Kernel calls are queued by the driver and executed sequentially. No kernels run in parallel. There is no benefit from omitting one of the calls.

paulius · November 21, 2007, 12:53am

The same is true for CUDA 1.1. If you do use the stream API in 1.1, kernel calls and memcopies in the same stream are queued up. Non-streamed API in 1.1 looks and behaves just like 1.0, so it’s completely backward compatible.

Paulius

nwilt · November 25, 2007, 3:50am

Setting up a kernel invocation isn’t free for the CPU, so case1 loses some benefit of concurrent CPU/GPU execution. If the kernels are doing a small enough amount of work that the driver overhead of the kernel invocations is noticeable, case2 is preferable because the CPU can set up the call to kernel2 while kernel1 is executing.

Topic		Replies	Views
when should cudaThreadSynchronize() be called? CUDA Programming and Performance	5	3350	October 22, 2010
cudaThreadSynchronize usage CUDA Programming and Performance	3	2972	October 21, 2008
cudaThreadSynchronize() and multiple kernels when is it necessary to sync? CUDA Programming and Performance	2	8391	June 20, 2008
Kernel Timing and cudaThreadSynchronize() CUDA Programming and Performance	6	2099	July 30, 2010
Kernel function calls in regards to cudaSynchronizeDevice(); CUDA Programming and Performance	2	707	May 25, 2017
Behaviour of Multithreaded programs with cudaThreadSynchronize() The semantics of cudaThreadSynchron CUDA Programming and Performance	1	7250	January 9, 2012
KERNELS are NOT queing , bug in cuda 2.0 ? cudaThreadSynchronize(); makes no difference ? CUDA Programming and Performance	0	1870	August 8, 2009
Synchronization between Kernel calls CUDA Programming and Performance	2	2801	July 4, 2011
Asynchronous kernel calls CUDA Programming and Performance	4	9309	October 21, 2009
cudaThreadSynchronize() vs. cudaStreamSynchronize CUDA Programming and Performance	0	5582	January 19, 2010

problem about cudaThreadSynchronize()

Related topics