NVIDIA Developer Forums

atomicAdd looking for an explanation of the mysterious atomic functions

Accelerated Computing CUDA CUDA Programming and Performance

DELUXEnized November 4, 2010, 10:42pm 1

Hello,

my dev-env is:
vs2010
nsight 1.5
cuda sdk 3.2

Today was my first time working with the atomic functions.
I thought it was as easy as using every other function, but it wasn’t.
After reading some blogs/posts, I found out that I have to add -arch=sm_12 to the nvcc arguments in order to use the functions.

So now I’ve got some questions.
Why is this not working with a normal #include of the device_function.h, respectively the sm_11_atomic_functions.h?
Why is it not working with sm_11 even though the function is defined in the sm_11_atomic_functions.h file??

I would appreciate every single explanation ;)

DELUXEnized November 4, 2010, 10:42pm 2

Hello,

my dev-env is:
vs2010
nsight 1.5
cuda sdk 3.2

Today was my first time working with the atomic functions.
I thought it was as easy as using every other function, but it wasn’t.
After reading some blogs/posts, I found out that I have to add -arch=sm_12 to the nvcc arguments in order to use the functions.

So now I’ve got some questions.
Why is this not working with a normal #include of the device_function.h, respectively the sm_11_atomic_functions.h?
Why is it not working with sm_11 even though the function is defined in the sm_11_atomic_functions.h file??

I would appreciate every single explanation ;)

seibert November 5, 2010, 11:27pm 3

The issue is that atomic functions require the compiler to emit special PTX instructions that are only supported on certain CUDA devices. This is why you need to pass an extra flag to the compiler to tell it that you are only going to run on devices at a certain compute capability (or greater).

seibert November 5, 2010, 11:27pm 4

The issue is that atomic functions require the compiler to emit special PTX instructions that are only supported on certain CUDA devices. This is why you need to pass an extra flag to the compiler to tell it that you are only going to run on devices at a certain compute capability (or greater).

Topic		Replies	Views	Activity
How to I use atomic functions? any library? CUDA Programming and Performance	3	7950	February 9, 2008
where to find the definition of those function ? CUDA Programming and Performance	5	3193	September 26, 2007
error: identifier "atomicAdd" is undefinede CUDA Programming and Performance	6	4166	May 18, 2010
CUDA_NO_SM_11_ATOMIC_INTRINSICS defined even for compute capability 1.1 CUDA Programming and Performance	2	11527	March 22, 2011
Error using Atomic functions CUDA Programming and Performance	1	853	August 13, 2014
atomic functions CUDA Programming and Performance	17	14407	April 10, 2011
Easy Question, what compile flag for atomicAdd ? CUDA Programming and Performance	7	8080	March 1, 2011
AtomicAdd CUDA Programming and Performance	5	7871	April 4, 2008
where -arch sm_11 in VS2008? error: identifier "atomicAdd" is undefined CUDA Programming and Performance	3	2128	May 18, 2010
atomic operations atomicExch is undefined CUDA Programming and Performance	8	17483	November 22, 2008