atomicMax for float

mfarrell · March 8, 2009, 8:14pm

I have a heavily contended float that I want to ensure doesn’t screw up.

long story short, I have a 3D block executing where the z dimension corresponds to the specific row, the x dimension corresponds to row elements, and lastly the y direction shares each element. I want to keep a running max variable (float) in each element but I’m concerned about race conditions.

any smart way to implement this. i’m compute capability 1.1 on quadro fx 1700

uint ri = __umul24(blockIdx.x, blockDim.x) + threadIdx.x;

  uint di = __umul24(blockIdx.y, blockDim.y) + threadIdx.y;

  uint z = __umul24(blockIdx.z, blockDim.z) + threadIdx.z + z_off;

  uint dim = __umul24(gridDim.y, blockDim.y);

/*............*/

	  RAY_2D *row = (RAY_2D *)((char*)d_output.ptr + z * d_output.pitch);

	  //make sure these are set, but only once per ray

	  if(row[ri].r < 0.0f)

	  {

		row[ri].x = x;

		row[ri].y = y;

		row[ri].r = r;

		//row[ri].d = d;

	  }

	  //what to do here, d depends on di (y thread)

	  row[ri].max_d = help_me_atomic_max(row[ri].max_d, d);

seibert · March 8, 2009, 9:04pm

This page describes a cute trick for transforming a float into a sortable integer:

[url=“stereopsis : graphics : radix tricks”]http://www.stereopsis.com/radix.html[/url]

It is a one-to-one function between unsigned ints and single precision floats, so you can:

Transform float to this int representation.
Use the unsigned int version of atomicMax.
When you are done, you can read out the max value from global memory and convert it back to a float.

(NB: this might not behave properly with inf, nan, and possibly denormal floats.)

mfarrell · March 8, 2009, 9:09pm

thanks, man

Jamie_K · March 9, 2009, 1:36pm

Cool trick.

From my understanding of the floating point representation, i think this should also work for denormal numbers and inf. NaN has unusual behavior with relational operators so the integerized version might be not quite the same for NaN.

Topic		Replies	Views
Cuda atomicMax for float CUDA Programming and Performance cuda , ubuntu	9	5426	May 12, 2024
AtomicMax with floats CUDA Programming and Performance	12	34343	July 11, 2015
AtomicMax Im getting error while compiling my code due to atomicMax CUDA Programming and Performance	3	1164	November 15, 2010
Why doesn't runtime library provide atomicMax nor atomicMin for float? CUDA Programming and Performance	9	1497	October 12, 2021
How to get the min/max value of float type in a kernel? CUDA Programming and Performance	2	3134	August 14, 2021
Implementation of atomicMax for float CUDA Programming and Performance cuda	5	1783	July 25, 2022
atomicMin with float CUDA Programming and Performance	8	20595	April 16, 2019
Custom atomicMax for int2 type CUDA Programming and Performance	9	3954	January 26, 2014
problems with AtomicMax! CUDA Programming and Performance	0	572	February 20, 2013
Maximum absolute value Fastest method for floating point numbers CUDA Programming and Performance	10	12665	March 8, 2010

atomicMax for float

Related topics