核函数中不同block线程如何同步

lichunxue987 · June 16, 2019, 2:08pm

我在项目中遇到一个问题。和写了一个核函数，在核函数中又调用了一个核函数。第二个核函数的形式如下：
gpu1<<<4,16>>>();
我发现在gpu1运行完后通常是一组block线程运行完了，但是其他线程很没有运行完，我该如何同步不同block的线程。在实际项目中核函数参数是dim3的2D数据。

Robert_Crovella · June 16, 2019, 3:55pm

The only methods provided by CUDA to synchronize threads in separate blocks is the kernel launch boundaries (beginning and ending of a kernel, all threads are synchronized at the beginning or ending of your kernel code) and CUDA cooperative groups.

[url]https://devblogs.nvidia.com/cooperative-groups/[/url]

Topic		Replies	Views
cuda block synchronization CUDA Programming and Performance	1	982	June 19, 2011
sync over blocks age old question CUDA Programming and Performance	2	2877	September 9, 2008
synchronisation between blocks CUDA Programming and Performance	2	1476	June 11, 2009
cuda block synchronization CUDA Programming and Performance	4	8393	June 20, 2011
Synchronizing threads CUDA Programming and Performance	1	5923	March 21, 2007
CAN I SYNCRONIZE ALL THE THREADS OF A GRID? CUDA Programming and Performance	3	4082	June 25, 2008
Thread sync CUDA Programming and Performance	2	790	May 9, 2011
Synchronize threads. CUDA Programming and Performance	1	698	March 6, 2013
Need synchronization between blocks? CUDA Programming and Performance	3	3087	September 16, 2009
Synchronize all blocks in CUDA CUDA Programming and Performance	12	45170	October 25, 2013

核函数中不同block线程如何同步

Related topics