my dev-env is:
cuda sdk 3.2
Today was my first time working with the atomic functions.
I thought it was as easy as using every other function, but it wasn’t.
After reading some blogs/posts, I found out that I have to add -arch=sm_12 to the nvcc arguments in order to use the functions.
So now I’ve got some questions.
Why is this not working with a normal #include of the device_function.h, respectively the sm_11_atomic_functions.h?
Why is it not working with sm_11 even though the function is defined in the sm_11_atomic_functions.h file??
I would appreciate every single explanation ;)