Atomic Adding to a Clamped Value

stolk · January 11, 2021, 9:09pm

I would like to atomically add, but in such a way that the result is clamped.

I am adding values to addresses holding uint32_t values.

But I want to do this in such a way that the end result never exceeds 0xffffffff.

Theoretically, I could add to uint64_t values instead, but by doing so, I would halve my bandwidth, so I would like to avoid this.

Are there any special instructions inside CUDA that will let me do this? Maybe control the overflow behaviour, somehow?

Robert_Crovella · January 11, 2021, 9:55pm

There aren’t any native clamped atomic add operations.

You can implement your own “custom” atomic given the template in the programming guide, but that is going to have a performance impact.

slaine1 · January 11, 2021, 10:14pm

Perhaps you could use ordinary atomic adds and use an auxiliary bit to indicate overflow. If an add causes the result to overflow, i.e., sum is less than original value when doing an unsigned comparison, the thread can set the overflow bit in another memory location. The downside is that when you want to use the value, you need to check both the overflow bit and the main variable. The upside is that the atomic addition runs at full speed except for the (presumably rare) case where an overflow occurs and a second memory access is needed to set the overflow bit.

Topic		Replies	Views
can I do atomic add on float using compare and swap CUDA Programming and Performance	0	16908	October 12, 2011
Why different opeartions in thread affect atomicAdd() CUDA Programming and Performance cuda	4	1034	April 26, 2022
atomicAdd crash CUDA Programming and Performance	8	1438	August 25, 2016
atomicAdd on uint8_t or unsigned char CUDA Programming and Performance	2	5237	December 13, 2019
atomicAdd(float,float) - atomicMul(float,float) ... CUDA Programming and Performance	13	57097	July 29, 2010
atomicAdd function for an unsigned short value CUDA Programming and Performance	3	1937	March 29, 2021
AtomicAdd() with zero value CUDA Programming and Performance	3	2069	April 14, 2015
Atomic Increment with Threshold Check CUDA Programming and Performance	8	2234	March 22, 2018
unaligned atomicAdd on signed shorts CUDA Programming and Performance	0	902	January 7, 2010
atomicAdd CUDA Programming and Performance	0	353	October 6, 2020

Atomic Adding to a Clamped Value

Related topics