bitwise atomic operator atomicOr data type size_t

cudaMancpy · July 16, 2019, 4:56am

Hello.

I need a 64bit unsigned integer data type atomicOr, especially size_t data type.

but CUDA supports only unsigned long long int data type.

I can’t Be compatible between size_t and unsigned long long type.

So I want to use like this code:

__device__ double atomicAdd(double* address, double val)
{
    unsigned long long int* address_as_ull =
                                          (unsigned long long int*)address;
    unsigned long long int old = *address_as_ull, assumed;
    do {
        assumed = old;
        old = atomicCAS(address_as_ull, assumed, 
                        __double_as_longlong(val + 
                        __longlong_as_double(assumed)));
    } while (assumed != old);
    return __longlong_as_double(old);
}

CUDA API hasn’t double data type atomicAdd.
Can use size_t atomicOr like that?

cudaMancpy · July 16, 2019, 6:07am

Add)

and How to use atomicOr unsigned long long int example?
unsigned int(32bit) and int(32bit) are compiled, but unsigned long long int(64bit) has occurred… I don’t know the reason.

njuffa · July 16, 2019, 7:18am

What is this supposed to mean? On all platforms supported by CUDA, both ‘size_t’ and ‘unsigned long long int’ are unsigned 64-bit integer types, and the former might even just be a typedef of the latter. So casting in either direction should be perfectly fine.

cudaMancpy · July 16, 2019, 7:30am

but when I use the size_t in atomicOr, it occurred this error message
error: no instance of overloaded function “atomicOr” matches the argument list argument types are: (size_t *, size_t)

and when I use the unsigned long long in atomicOr, it occurred this error message
error: no instance of overloaded function “atomicOr” matches the argument list argument types are: (unsigned long long *, unsigned long long)

but, when I use the int or unsigned int in atomicOr, there was no any compile error.

njuffa · July 16, 2019, 7:41am

That’s a different issue than the one I understood you were raising in your earlier post. Not all functions are necessarily overloaded for all data types. There may be limitations due to GPU architecture. From the CUDA Programming Guide:

https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#atomicor

You would want to specify the architecture of your GPU when building the code. The compiler likely defaults to a build target of compute capability 3.0.

cudaMancpy · July 16, 2019, 7:47am

I already read that.
my GPU CC is over 6.0 (titan xp)

I can understand your reply little bit.
so, I should give the compiler setting parameter.
I confuse the parameter.
SM_60 something like that? right?

njuffa · July 16, 2019, 8:13am

A simple build command setting the target architecture explicitly would be

nvcc -arch=sm_61 -o app.exe app.cu

Please consult the compiler documentation on how to specify target architectures.

cudaMancpy · July 16, 2019, 8:14am

thank you so much!!!

have an awesome day!

cudaMancpy · July 16, 2019, 10:49am

I am really sorry…

I found the page
https://stackoverflow.com/questions/35656294/cuda-how-to-use-arch-and-code-and-sm-vs-compute

but I can’t understand how to use the gencode and arch…

and I check the NVIDIA CUDA programming guide, but I can’t find the arch section.

Could you give me the information?

Thank you for your warm hand

njuffa · July 16, 2019, 1:36pm

Latest NVCC documentation is here:

[url]https://docs.nvidia.com/cuda/cuda-compiler-driver-nvcc/index.html[/url]

Topic		Replies	Views
Compiling 64-bit Atomics Please Help! CUDA Programming and Performance	2	2424	September 20, 2008
atomicMax() for unsigned long long error compiling CUDA Programming and Performance	2	2419	August 25, 2013
Error [atomicCAS with unsigned short] in atomicMax with (__half*, __half) CUDA Programming and Performance tensorrt , cuda	5	786	February 28, 2023
Double variable AtomicAdd CUDA Programming and Performance	3	2777	January 10, 2015
atomicAdd with signed long long not working CUDA Programming and Performance	2	12329	October 31, 2011
atomicCAS with unsigned short on CUDA 10.1 CUDA Programming and Performance	2	2743	June 27, 2019
atomicAdd() showing error: no instance of overloaded function "atomicAdd" mathes the argument list CUDA Programming and Performance cuda	1	2655	February 27, 2023
How can I use atomicSub for floats and doubles? CUDA Programming and Performance	9	1931	October 14, 2020
AtomicXor...How do does it work? CUDA Programming and Performance	6	8175	April 7, 2008
AtomicAdd with Visual Studio 2013 CUDA Setup and Installation	11	5517	February 26, 2015

bitwise atomic operator atomicOr data type size_t

Related topics