Maximum Number of Threads

mkashifhanif · June 3, 2010, 2:00pm

what is limit of maximum number of threads per block nVIDIA Geforce 8800 GT can have? I am running a program and it is accepting and running 1024 threads per block? Can anyone explain me this behavior.

avidday · June 3, 2010, 2:21pm

You think you are running a kernel with 1024 threads per block, but you actually are not, either because the kernel is never running and you have no error checking to tell you the launch failed, or because you have the block size and grid size arguments in the kernel launch reversed, and you are actually running 1024 blocks with a small number of threads per block, rather than the other way.

mkashifhanif · June 3, 2010, 5:08pm

Here is my code:

BLOCK_SIZE = 32

dim3 threads(BLOCK_SIZE*BLOCK_SIZE);

dim3 blocks(z+1,1);

mykernel<<<blocks,threads>>>(parameters);

and 32*32=1024 threads. and it is working with no error but if I increase BLOCK_SIZE to 64 then it gives error that shared memory is less.

mkashifhanif · June 3, 2010, 5:09pm

Here is my code:

BLOCK_SIZE = 32

dim3 threads(BLOCK_SIZE*BLOCK_SIZE);

dim3 blocks(z+1,1);

mykernel<<<blocks,threads>>>(parameters);

and 32*32=1024 threads. and it is working with no error but if I increase BLOCK_SIZE to 64 then it gives error that shared memory is less.

Quoc_Vinh · June 4, 2010, 1:52am

The total number thread per block is 1024 but it still works? That is strange.

Did you check the result (parameters) after kernel function execution?

AlexanderAgathos · June 4, 2010, 7:40pm

Indeed it is impossible, the number of threads should be 512, in my GTX-275 the kernel simply does not execute. The number of threads should be carefully chosen per card in order for your SMs to work at their peak.

I recommend also you use this notation:

dim3 dimGrid

dim3 dimBlock

i.e. using the notation Grid and Block which is standard to all programmers in CUDA.

Best,

Alex.

Topic		Replies	Views
I wonder maximum number of threads per block really limits the number of threads in each block. CUDA Programming and Performance	5	3976	February 9, 2024
Maximum of threads On 8600GT CUDA Programming and Performance	6	3569	April 9, 2008
What is the maximum number of threads per block? CUDA Programming and Performance	4	21240	April 8, 2010
Max no. of threads in a multiprocessor. CUDA Programming and Performance	4	1693	September 29, 2009
Question about grid/block/thread sizes CUDA Programming and Performance	3	12266	November 13, 2012
Thread Number Limitation CUDA Programming and Performance	3	3889	December 22, 2008
Block/threads and stuff... CUDA Programming and Performance	5	4901	September 12, 2008
Questions about Block and Grid CUDA Programming and Performance	4	3545	February 26, 2008
Confusion about thread per block CUDA Programming and Performance	1	792	July 24, 2009
maximum thread numbers CUDA Programming and Performance	5	12047	October 4, 2011

Maximum Number of Threads

Related topics