cudaThreadSynchronize() after kernel call?

Peronet · April 23, 2008, 12:38pm

When you run a timer under the duration of a kernel, do you need to use cudaThreadSynchronize() before you stop the timer to get the accurate timing? I do use the __syncthreads() command in the end of the kernel to make sure all threads are finished.

What is the difference between these 2 commands? There is a significant timing-difference when using the outer sync and when not.

DenisR · April 23, 2008, 12:57pm

synthreads works on the GPU
cudathreadsynchronize will block until all kernels are finished. For timing purposes it is necessary to use cudathreadsynchronize

Peronet · April 23, 2008, 1:02pm

Thanks!

MisterAnderson42 · April 23, 2008, 1:19pm

__syncthreads() only acts as a barrier for threads within a block. You do not need it to “make sure that all threads are finished”. __syncthreads() is only needed to prevent race conditions when multiple threads access the same region of shared memory.

platinor · November 29, 2010, 3:38am

In my opinion, if you use the GPU timer, you need to use the cudathreadsynchronize, but with CPU timer, it is not needed, right?

avidday · November 29, 2010, 11:54am

No, wrong. Kernel launches are non-blocking on the host. If you don’t use cudaThreadSynchronize before stopping a timer after a kernel launch, the timer will only measure the kernel launch time, no the time it took the kernel to finish executing.

Topic		Replies	Views
No need to check cudaThreadSynchronize() in release mode? CUDA Programming and Performance	9	6338	April 21, 2009
Question regarding cudaThreadSynchronize() Does it act like a barrier? CUDA Programming and Performance	1	1142	September 16, 2008
Waiting for kernel CUDA Programming and Performance	6	1459	September 8, 2010
"cudaThreadSynchronize()" and "__syncthreads()" CUDA Programming and Performance	1	9703	March 22, 2008
Using GPU and CPU at the same time CUDA Programming and Performance	5	6955	March 4, 2009
cudaThreadSynchronize() and multiple kernels when is it necessary to sync? CUDA Programming and Performance	2	8336	June 20, 2008
Kernel Timing and cudaThreadSynchronize() CUDA Programming and Performance	6	2002	July 30, 2010
cudaThreadSyncronize doubt CUDA Programming and Performance	2	1644	September 1, 2008
About Synchronize CUDA Programming and Performance	4	1447	March 26, 2009
Behaviour of Multithreaded programs with cudaThreadSynchronize() The semantics of cudaThreadSynchron CUDA Programming and Performance	1	7206	January 9, 2012

cudaThreadSynchronize() after kernel call?

Related topics