problem for kernel function threads in one block must run the same algorithm?

xueb · May 15, 2009, 2:57pm

Hi,
Sorry for this newbie question.
Why the kernel function can not work?
threads/block = 32, blcoks/grid = 12,
kernel_function:
{
gIndex = blockDim.x * blockIdx.x + threadIdx.x;
if( gIndex > 10 )
algorithm_1 …
else
algorithm_2 …
}
The error information is: the launch timed out and was terminated.
but if changed to “( gIndex >= 32 )”, the code could work normally,
Is that the threads in one block must run with the same algorithm?

Thanks in advance for the help.

Jamie_K · May 15, 2009, 3:20pm

Most likely, you have a race condition or other interdependence of threads that is causing the problem.

Generally speaking, what you are trying to do should “work” in the sense it should produce correct results. But if any warp (i.e. 32 sequentially numbered threads within a block, regardless of block size) diverges (meaning some of the warp does one thing while the rest does something else), then the processor executes the two cases sequentially.

If there is no interdependence of threads, then the fact that divergent threads are run sequentially instead of in parallel should have no effect (besides performance). But if they do depend on each other, for example if you have __syncthreads() or access to shared memory, then it will create a problem.

xueb · May 15, 2009, 3:41pm

aha,
Thanks for your help~~ Jamie K
I put the function __syncthreads() in a wrong position. :D

Topic		Replies	Views
Problem about launch kernel functions several times CUDA Programming and Performance	3	5941	August 21, 2009
Bad performance problems and discussion CUDA Programming and Performance	1	580	May 17, 2016
Here are my timing results, not impressive. Help. CUDA Programming and Performance	5	7020	January 30, 2008
Inconsitent output in Release Mode CUDA Programming and Performance	4	4392	October 17, 2008
Kernel function doesn't launch with block size >16 Block size of 4, 8, and 16 launch fine CUDA Programming and Performance	2	2878	July 28, 2008
Understanding number of threads Problems with program working CUDA Programming and Performance	3	1042	August 17, 2009
kernel only works when one block of threads launched CUDA Programming and Performance	1	595	June 23, 2015
kernel execution fail - because of memory ? function memory CUDA Programming and Performance	1	4414	December 30, 2009
Kernel Launch: number of blocks CUDA Programming and Performance	1	1704	May 21, 2009
blocks vs threads and bad CUDA performance CUDA Programming and Performance	3	3558	January 23, 2015

problem for kernel function threads in one block must run the same algorithm?

Related topics