Been running some tests where I have a discrete set of values which can be mapped/converted to either the float, int or unsigned int type. Regardless of the cast the values will be integers(0:65535).
I was trying to see if there are performance differences using the three casting options for global atomicAdd() on the values in my limited application.
Overall it seems that atomicAdd() on the float type may be slightly faster than atomicAdd() on signed int.
Should there be any performance difference based in the type of 32 bit word updated during an atomicAdd() ?