Blocks execution Are they executed concurrently?

thanasio · December 9, 2011, 2:57pm

Hi there

i have been trying to resolve this matter by reading books and articles, but not quite clear yet.

What is executed in parallel in cuda. I understand that the unit of execution is 32 threads (a warp ), which in Fermi is be 48. In parallel.

So a block pulses warps sequentially ie… warp 1, warp 2, …warp n and asynchronously, not necessarily 1,2…n

What about the blocks? Are they executed in parallel or in an asynchronous sequence?

Please help. The reason i am asking is because i need to update an array, where different blocks, might be accessing the same address…External Image

Best,
Than

MattWarmuth · December 11, 2011, 7:35pm

I think blocks gets launched in order, but don’t necessarily process in order. There’s no way to synchronize data between blocks except for the atomic operations. Generally, you have to finish all threads of all blocks, write the results to global memory, then start a new kernel call to utilize the new data (unless you can do it via atomics).

DrAnderson42 · December 13, 2011, 8:01pm

You can make no assumptions about the order of block execution. The hardware may execute them in any order.

tmurray · December 13, 2011, 8:03pm

this is a fundamental truth, there are any number of reasons why ordering of blocks may not match your expectations.

kbam · December 14, 2011, 2:05am

In parallel up to the number of blocks that can run at once (depends on the model of GPU and how you decide to code your solution ), after that the blocks waiting have to wait until one of the running blocks finishes up.

So if your application has 10000 blocks and your GPU can run say 36 of those at once, then 36 will be launched and 9964 will be wait, when some of the 1st 36 finish then a similar number of the waiting ones will start to replace them. The original 36 will probably not finish all at the same time or in order. So “make no assumptions about the order of block execution”

Topic		Replies	Views
Blocks run in sequential or parallel? CUDA Programming and Performance	3	1861	December 18, 2011
performance cost of too many blocks? CUDA Programming and Performance	12	2938	December 4, 2018
CUDA simple questions. please answer! CUDA Programming and Performance	4	1470	April 29, 2009
Each thread working concurrently ? CUDA Programming and Performance	5	1208	March 2, 2010
Threads and Blocks execution order CUDA Programming and Performance	2	2334	September 1, 2008
Block and thread scheduling/ordering questions CUDA Programming and Performance	14	6080	November 15, 2007
2D threads & blocks In what order they're executed? CUDA Programming and Performance	1	3397	October 3, 2008
Interactions among blocks CUDA Programming and Performance	11	11609	February 6, 2010
Controlling of block execution CUDA Programming and Performance	1	879	September 18, 2009
Controlling of block execution CUDA Programming and Performance	2	1127	September 18, 2009

Blocks execution Are they executed concurrently?

Related topics