Warp formation of small multidimensional blocks

strikosn · June 23, 2010, 9:20am

Hi all,

I have a small question… If I create, say, a two-dimensional block of size <16, 2>, then the 16 threads for y=0 and the 16 for y=1 form one warp or two independent? I am aware of the fact that block sizes that are a multiple of warp size make better utilization of the SMs, but in my application I have decided that it might be better to not follow this approach for some problem sizes. Additionally, for even smaller blocks, let’s say <8, 2>, do the coalescing rules for global and shared memory accesses apply for the 16 threads as a half-warp or for each 8-tuple of threads independently?

TIA,
strikosn

avidday · June 23, 2010, 9:40am

That is one warp. Block dimensions are just a language level device - threads are sequentially ordered in column major order within a block, and warps are formed in order from the sequence of threads.

The warp and half-warp “rules” of the execution model are invariant. If you have less than 32 threads per block, the hardware just adds dummy threads which are masked out and runs a single 32 thread warp. If you choose less than 32 threads per block, all you are doing is wasting cycles.

Topic		Replies	Views
Warp scheduler and dimensionality CUDA Programming and Performance	6	1058	January 20, 2015
Quick warp/thread question CUDA Programming and Performance	1	917	August 31, 2009
Warp in two dimension CUDA Programming and Performance	2	3567	March 11, 2010
Grouping of threads into warps CUDA Programming and Performance	1	3331	February 25, 2009
Block Size.. CUDA Programming and Performance	2	1780	July 11, 2008
Thread to warp assignement How block's threads get mapped to warps? CUDA Programming and Performance	4	7896	January 28, 2008
Warp wrap-around for odd numbered block sizes How are threads assigned to warps in this case? CUDA Programming and Performance	0	773	July 24, 2009
Blocks and Warps CUDA Programming and Performance	2	8064	January 7, 2009
threads in one block CUDA Programming and Performance	7	1748	March 6, 2010
question about warp, block and threads CUDA Programming and Performance	4	2002	February 3, 2009

Warp formation of small multidimensional blocks

Related topics