How to use blocks

Marsxema · November 26, 2007, 6:46pm

I know that the maximum blocks that can run concurrently on a multiprocessor is 8.
So for a 8800 GTX, we can run concurrently 64 blocks.
I just want to know the limit of blocks we can create (I suppose there is a limit).

And I have a question.
If we have two cases (on a 8800 GTX) :
1 - we run an application on 128 blocks with 1 thread per block
2 - we run this same application on 4 blocks with 32 threads

In my mind, the second case will run more rapidly because it’s using warps. But I am not sur and I need an explanation. Maybe it depends on something else …

Thanks for your futur answers.

hab · November 26, 2007, 7:08pm

You need to read chapters 2, 3 and appendix A of the Cuda Programming guide. Although it may take more than one reading to fully grasp what it tells you. 1 thread per block is a non-starter If you want performance it would only use 8 of the 128 processors and probably leave those processor idle at least 75 % of the time.

A warp is 32 threads the mininum number to keep a multiprocessor fully utilized with no memory access. So to fully utilize a 8899 GTX a minimum of 256 trheads organized as 8 blocks or one block per multiproccessor. To hde memory accesses you nead many more see the Performance Guidlines in Chapter 5.

Topic		Replies	Views
How many concurrently running threads CUDA Programming and Performance	1	2974	July 1, 2007
Architecture Questions CUDA Programming and Performance	6	8171	February 12, 2008
Maximum of threads On 8600GT CUDA Programming and Performance	6	3570	April 9, 2008
how to determine max number of blocks per kernel CUDA Programming and Performance	10	17220	September 11, 2011
threads how many threads can simultaneously execute? CUDA Programming and Performance	1	1967	February 27, 2009
help me understand cuda CUDA Programming and Performance	4	6879	February 10, 2010
Execution Of Thread-Blocks CUDA Programming and Performance	4	5282	June 18, 2007
finding the best number of threads per block CUDA Programming and Performance	3	7849	January 29, 2010
thread vs block CUDA Programming and Performance	1	1372	July 9, 2009
Organization of threads CUDA Programming and Performance	1	644	December 21, 2011

How to use blocks

Related topics