Q: read/writing data by multiple threads

owang · July 14, 2009, 4:11pm

Hi, I have a question, I have a data structure (containing float values) that i have numerous threads reading and writing.

Sometimes, this data structure will have to be read and written to by multiple threads. The threads will of course, interfere with each other, as they have no idea when the other threads will be reading/writing from it.

How can I go about preventing these race conditions? I know that any lock will be slow because of course, you are blocking potentially a large number of threads. However, the order that each thread read/writes into the data structure is highly structured, and these kinds of conflicts should actually happen fairly irregularly.

Thanks!

Nico · July 14, 2009, 6:26pm

Maybe one of the atomic functions in section B.10 of the programming guide can help.

N.

seibert · July 14, 2009, 7:39pm

Atomic operations (like addition) are available in CUDA. Unfortunately, you mention the data structure you are operating on includes floats, and there are no atomic operations on floats in CUDA. (My suspicion is that atomic operations are implemented in the memory controller, and putting a full FPU there was too much.)

Many people have used the atomic operations to construct mutexes and semaphores (which is where it sounds like you are going), but those tend to be complicated and error-prone if you are not careful.

The best approach is to see if you can design the algorithm to avoid the need to have multiple threads write to the same location (this might take multiple passes). This isn’t always possible, of course, but it is worth thinking about.

FangQ · July 14, 2009, 11:50pm

here is my implementation for atomic float operations:

http://forums.nvidia.com/index.php?showtopic=101968

I want to hear from other people if this approach is safe to use.

seibert · July 15, 2009, 2:07am

FYI: I’ve posted a link to a working atomic float addition in that thread.

Topic		Replies	Views
Simultaneous write Multiple threads writing to the same memory location? CUDA Programming and Performance	2	1132	June 6, 2010
How's atomic operations in CUDA implemented? CUDA Programming and Performance cuda , kernel , programming	8	2346	March 26, 2024
Any locking mechanism? CUDA Programming and Performance	9	3699	July 25, 2007
Which write operations are atomic in CUDA? CUDA Programming and Performance	6	3119	October 8, 2017
Adding data from multiple threads CUDA Programming and Performance	3	3324	June 20, 2008
Newbie Question: Threads What's going on here? CUDA Programming and Performance	5	2226	July 18, 2008
Useful Arbitrary Atomic Operation Hack CUDA Programming and Performance	0	10053	July 20, 2008
Atomic operations and Block communication CUDA Programming and Performance	3	2938	December 11, 2007
are float2 and float4 read/writes atomic ? assuming data is aligned CUDA Programming and Performance	5	1748	June 16, 2009
Neural Network using genetic algorithms on CUDA All threads working on same neural network instance? CUDA Programming and Performance	6	2728	January 10, 2010

Q: read/writing data by multiple threads

Related topics