write to global memory from multiple threads and racing conditions

FangQ · April 24, 2009, 10:31pm

just want to make sure, I have a kernel running over many threads (1024 typically), for each thread, it calculate a number and add to a stream in the global memory, something like

__global__ void mykern(float *data){

   ... 

  data[idx]+=...;

}

this is basically a scatter operation. I am wondering if there is any issue with the above implementation? is CUDA going to take care of the racing conditions when writing from multiple threads?

tmurray · April 24, 2009, 10:49pm

If you have multiple threads writing to the same location without any sort of prevention, yes, that’s a race condition. There’s no transactional memory or anything like that.

gatoatigrado · April 25, 2009, 8:13am

Unfortunately there’s no race condition detection offered by nvidia tools. There is a paper about this and related project info at [url=“http://www.cs.virginia.edu/~mwb7w/cuda/”]http://www.cs.virginia.edu/~mwb7w/cuda/[/url].

FangQ · April 26, 2009, 11:27pm

thanks for the reply. basically you are saying if two threads modify the same global memory location, and the results will be unpredictable, is this correct?

notice that in my kernel, I used data[idx]+=…, which is an accumulation. Is this going to get around the issue? Also, if multiple threads modify a single address in the global memory, is there a lock present when performing this transaction to force a sequential modification?

The reason I ask because I used this in my program, and found the results are fine. There are 1024 threads to write (accumulate) randomly to a global memory of size 606060. I want to understand if it did it by accident or by design.

Topic		Replies	Views
Writing to several global memory locations from the same kernel CUDA Programming and Performance	1	1386	June 13, 2008
thread writing into global memory (thread sync) CUDA Programming and Performance	2	1624	August 23, 2009
Question regarding global memory write protection CUDA Programming and Performance	1	781	October 1, 2009
Multiple writes to global memory CUDA Programming and Performance	2	2198	May 6, 2008
Any locking mechanism? CUDA Programming and Performance	9	3852	July 25, 2007
How to deal with multiple threads writing to the same GPU memory location? CUDA Programming and Performance	3	6028	October 31, 2008
Read-After-Write for a single cuda thread ? (and vice versa) Potential race conditions/issues for gl CUDA Programming and Performance	1	15068	June 21, 2011
Writes in same memory location Cant add numbers from different threads? CUDA Programming and Performance	46	25990	July 5, 2007
measure conflict when performing non-atomic write to the global memory CUDA Programming and Performance	0	3067	June 2, 2009
device global memory update questions CUDA Programming and Performance	7	5961	April 20, 2009

write to global memory from multiple threads and racing conditions

Related topics