Kill execution of a block

nihongasukidesu · August 9, 2010, 5:40am

Hello,

I am new to CUDA.

I have a kernel that do atomic for each thread in a block. If the addition exceeds certain value, there is no point from running the remaining threads in a block.
So is there a way to kill a block on a certain condition to free the multiprocessor? External Image External Image
using return; will only kill the given thread right? External Image

Thanks for advance.

eyalhir74 · August 9, 2010, 7:34am

To kill only a certain block - you can have a shared bool or something like that and all threads will be able to flag it (for exit)

and read whether they should continue or exit.

eyal

eyalhir74 · August 9, 2010, 7:34am

To kill only a certain block - you can have a shared bool or something like that and all threads will be able to flag it (for exit)

and read whether they should continue or exit.

eyal

jan.heckman · August 9, 2010, 11:35am

I can’t see that any such thing would be needed. In your code, every thread would do a test anyway, afaik letting just one thread do a test will make the other threads in the block wait.

Further, atomic functions return the old value, so it would be easy to detect the stop condition without performing an additional read, and let all threads return through their own code.

e.g.

define LIMIT 345.666f // or whatever

…

float value = 3.3f; // substitute you calculation

if (atomicAdd( &myglobalfloatvar, value ) + value >= LIMIT) return;

…

wil do just fine afaik(, assuming compute capability 2.0 for the floats, otherwise similar with int’s)

jan.heckman · August 9, 2010, 11:35am

I can’t see that any such thing would be needed. In your code, every thread would do a test anyway, afaik letting just one thread do a test will make the other threads in the block wait.

Further, atomic functions return the old value, so it would be easy to detect the stop condition without performing an additional read, and let all threads return through their own code.

e.g.

define LIMIT 345.666f // or whatever

…

float value = 3.3f; // substitute you calculation

if (atomicAdd( &myglobalfloatvar, value ) + value >= LIMIT) return;

…

wil do just fine afaik(, assuming compute capability 2.0 for the floats, otherwise similar with int’s)

nihongasukidesu · August 9, 2010, 12:56pm

Thanks eyalhir74, jan.heckman for your replies.

Actually the calculation is somewhat expensive that is why i want to cancel the rest of the threads.

I did something around

[codebox]shared bool stillrunning;

if (threadIdx.x == 0)

stillrunning = true;

__syncthreads();

if(!stillrunning)

return;

.

if (atomicAdd( &myglobalfloatvar, value ) + value >= LIMIT)

stillrunning = false;

[/codebox]

The kernel gives timeout error. I think it is because of some errored memory access.

But if someone sees that this code may be causing the problem, please tell me.

Thanks for advance.

nihongasukidesu · August 9, 2010, 12:56pm

Thanks eyalhir74, jan.heckman for your replies.

Actually the calculation is somewhat expensive that is why i want to cancel the rest of the threads.

I did something around

[codebox]shared bool stillrunning;

if (threadIdx.x == 0)

stillrunning = true;

__syncthreads();

if(!stillrunning)

return;

.

if (atomicAdd( &myglobalfloatvar, value ) + value >= LIMIT)

stillrunning = false;

[/codebox]

The kernel gives timeout error. I think it is because of some errored memory access.

But if someone sees that this code may be causing the problem, please tell me.

Thanks for advance.

eyalhir74 · August 9, 2010, 1:36pm

You should post all the kernel… with the code you’ve posted there’s no way of knowing what

happens after the stillrunning = false line ???

are you returning to the above code - if so you again reset it to true…

eyal

eyalhir74 · August 9, 2010, 1:36pm

You should post all the kernel… with the code you’ve posted there’s no way of knowing what

happens after the stillrunning = false line ???

are you returning to the above code - if so you again reset it to true…

eyal

tmurray · August 9, 2010, 6:21pm

There’s no better way to do this than the shared variable.

tmurray · August 9, 2010, 6:21pm

There’s no better way to do this than the shared variable.

Topic		Replies	Views
Kernel CUDA Programming and Performance	8	2728	October 14, 2009
Dynamically Canceling a Block: Deadlocks Kernel randomly freezes CUDA Programming and Performance	0	3586	May 13, 2008
execution of two kernels as one CUDA Programming and Performance	4	2039	February 9, 2009
How to stop all Threads CUDA Programming and Performance	4	7170	July 10, 2008
How can I halt an entire kernel? CUDA Programming and Performance	17	5165	July 3, 2009
Stopping kernel using shared mem variable What I'm doing wrong ? CUDA Programming and Performance	3	2981	June 11, 2008
Stop kernel function as soon as condition is met (CUDA) CUDA Programming and Performance	5	596	July 18, 2023
How to abort the kernel from inside ? I'd kindly ask Mark Harris to check it CUDA Programming and Performance	5	6273	June 9, 2008
Immediate termination of all threads after the condition is met CUDA Programming and Performance	4	638	March 13, 2023
Can one thread stop the other thread ? in the same block? CUDA Programming and Performance	0	2652	May 6, 2008

Kill execution of a block

Related topics