AtomicXor...How do does it work?

baffa · April 3, 2008, 9:44pm

Hi all,

I have read the description of the AtomicXor function however im not quite sure on the parameters that the function requires? Could someone explain how it works or point me to an example of it in use please? That would be great.

All i am trying to do is Xor two unsigned chars together, however I do understand the the function works only with integers, but this is not a problem for the application in which im using the function.

Thanks

baffa

wumpus · April 4, 2008, 6:50pm

Well, if you can atomicXor an integer, you can also do a character by using shifts (atomic on 4-byte level is atomic on 1-byte level by definition)

The parameters and usage of the atomic functions is described in the SDK documentation. Basically you pass an address and a value, in that order, that’s all :)

baffa · April 6, 2008, 5:53pm

I had read the SDK documentation however im still not understanding the address and the the value parameters?

For example if i was trying to do this for example

int i, x, y;

i = x ^ y;

How would that work with the atmoic function?

MisterAnderson42 · April 6, 2008, 8:45pm

i would need to be in device memory (initialized to x) and you would pass the &i in the address parameter and y in the value parameter.

Are you sure you need the atomic xor? The atomic ops are only needed if you have many threads all trying to xor the value at the same time.

baffa · April 6, 2008, 10:58pm

Well to be honest im not sure i need it now. Can i use the regular bitwise operators as usual if i only want to use the value once then?

MisterAnderson42 · April 6, 2008, 11:15pm

Yep, you can use all regular operations on integers just declared as “int i” etc… These operations are then performed in each thread individually. For that matter, you can do practically anything C syntax allows to any variable declared like that: CUDA is a full fledged compiler. On the device you are really only limited by resources, and you of course can’t make calls to the standard library (or others). Math library functions are limited to those listed in the CUDA programming manual (which includes most everything you can think of: sin/cos/exp, etc…).

Atomic operations are for when you have multiple threads trying to modify the same integer and you need to read, modify, then write all in one atomic (unbreakable) operation. It’s usually best to avoid such situations at all possible, since they incur a large performance penalty.

baffa · April 7, 2008, 8:06am

Yep, you can use all regular operations on integers just declared as “int i” etc… These operations are then performed in each thread individually. For that matter, you can do practically anything C syntax allows to any variable declared like that: CUDA is a full fledged compiler. On the device you are really only limited by resources, and you of course can’t make calls to the standard library (or others). Math library functions are limited to those listed in the CUDA programming manual (which includes most everything you can think of: sin/cos/exp, etc…).

Atomic operations are for when you have multiple threads trying to modify the same integer and you need to read, modify, then write all in one atomic (unbreakable) operation. It’s usually best to avoid such situations at all possible, since they incur a large performance penalty.

[snapback]357763[/snapback]

Ahhh, ok thanks very much Mister! That makes sense External Media

but im trying to Xor elements in two different arrays in a device function. If i do this int the global kernel function this seems to work fine. However if i call a subsequent device function and perform the same Xor, it appears to have no effect? whys that?

eg working:

device int array1[16];

device int array2[16];

global void functionA(){

for (int i =0; i<=15; i++){

        array1[i] = array1[i]^array2[i];

  }

}

eg not working:

device int array1[16];

device int array2[16];

device void functionB(int array1, int array2){

int temp [16];

for (int l = 0; l<= 15; l++){

	temp[l] = array1[l];

}



for (int k = 0; k<=15; k++){

	array1[k] = temp[k]^array2[k];

}

}

global void functionA(){

functionB(array1,array2);

}

Topic		Replies	Views
atomicAnd function for an unsigned short value CUDA Programming and Performance	7	1808	February 22, 2012
atomicXor misaligned memory access?! CUDA Programming and Performance	1	304	March 26, 2020
atomicMin on Char? Is there a way to compare char to in to use atomicMin? CUDA Programming and Performance	5	12269	May 11, 2011
Problem with Tesla D870 compute capability CUDA Programming and Performance	4	1796	December 30, 2008
Useful Arbitrary Atomic Operation Hack CUDA Programming and Performance	0	10056	July 20, 2008
Atomic operations for multi-GPU Is it possible to do that? CUDA Programming and Performance	9	8131	August 27, 2009
bitwise atomic operator atomicOr data type size_t CUDA Programming and Performance	9	2066	July 16, 2019
question on atomics CUDA Programming and Performance	2	916	May 4, 2011
Are long integer assignments atomic? Atomicity of assignment operator CUDA Programming and Performance	3	5237	May 9, 2011
atomics function , what is it ? CUDA Programming and Performance	6	4043	October 12, 2010

AtomicXor...How do does it work?

Related topics