__atomic_compare_exchange_n bug in release mode, when building with nvc++ 22.3

ishkhan23 · May 12, 2022, 8:37am

Consider two functions below.

void lock_n() {
        int raw;
    relock:
        raw = 0;
        if (!__atomic_compare_exchange_n(&ref_count, &raw, -1, true, __ATOMIC_ACQUIRE, __ATOMIC_RELAXED)) {
            thrd_yield();
            goto relock;
        }
}

void lock() {
        int raw;
        int state = -1;
    relock:
        raw = 0;
        if (!__atomic_compare_exchange(&ref_count, &raw, &state, true, __ATOMIC_ACQUIRE, __ATOMIC_RELAXED)) {
            thrd_yield();
            goto relock;
        }
}

And here is the __atomic_compare_exchange_n implementation in cuda-11.5

template<class _Type>
bool __atomic_compare_exchange_n(_Type volatile *__ptr, _Type *__expected, _Type __desired, bool __weak, int __success_memorder, int __failure_memorder) {
    return __atomic_compare_exchange(__ptr, __expected, &__desired, __weak, __success_memorder, __failure_memorder);
}

So lock_n just calls lock under the hood. This works as expected with gcc and clang.
It also works as expected when building in debug mode using nvc++, but it gives a warning for the lock_n for the -1 argument.

warning: integer conversion resulted in a change of sign

In release mode with nvc++ lock_n goes into an infinite loop, while lock works as expected.
This looks like a compiler bug.

nvc++ flags: -fast -Mvect forces -O3
Note: Bug persists both in single and multi threaded runs, so problem is not a race.

ishkhan23 · May 12, 2022, 12:14pm

Also it seems that __atomic_sub_fetch does not work either, in release mode.

__atomic_sub_fetch(&remaining, 1, __ATOMIC_RELEASE);

remaining is 1 both before and after the call.
In debug mode it is 0 after the call.

MatColgrove · May 12, 2022, 4:12pm

Thanks ishkahan for the report. Though do you have a complete reproducible example you can share?

-Mat

ishkhan23 · May 13, 2022, 1:48pm

Just put the snippets somewhere and run them. I’ve found this bug in a large closed source project.
This is a cmake project with c++20.
I do not have a complete reproducible example.

Topic		Replies	Views
Nvc++ error with <atomic> and C++20 flag nvc, nvc++ and nvfortran	1	579	June 17, 2021
__nv_bool mixup with bool CUDA NVCC Compiler cuda	6	1562	April 16, 2022
OpenACC atomic fetch construct fails for atomic-fetch-shift operations nvc, nvc++ and nvfortran	6	547	March 13, 2024
Release mode error debug CUDA Programming and Performance	11	5195	February 4, 2015
A bug related to shared variables and label in nvcc v11.7 CUDA Programming and Performance cuda , nvbugs , nvcc	3	598	May 31, 2023
Compilling with nvc++ nvc, nvc++ and nvfortran cuda	8	929	November 28, 2023
Compilation/linking error CUDA NVCC Compiler	0	554	April 6, 2022
ATOMIC operations on pointers arrays (OpenACC Fortran) nvc, nvc++ and nvfortran	3	899	December 17, 2021
Incorrect CPU results with #pragma acc atomic capture nvc, nvc++ and nvfortran	4	631	September 20, 2021
NVCC silently compiles std::swap to incorrect code (with no error or warning) in certain scenarios CUDA NVCC Compiler nvbugs	2	44	February 15, 2025

__atomic_compare_exchange_n bug in release mode, when building with nvc++ 22.3

Related topics