Why does a kernel which contains atomic functions return correct result unless I insert a printf() to check it?

ingnaryk · March 3, 2023, 11:44am

I wrote a kernel implementing the propagation of neural network which calls atomicAdd function several times. I preset the network as zero while the input layer has the value 2.0; the weights between neurons are all 1.0. I’m sure I’ve done the cudaMalloc and cudaMemcpy work. However, the result of the output layer was always plain-zero, unless I inserted a printf() function to see what kernel calculated – nonzero was printed but still parts of the final output layer equal to zero.

__global__ void goForwardGPU(double* single_piece, double* xs, double* vals, bool* availables)
{ // GPU go forward
    // input layer -> hidden layer1
    for (int i = blockIdx.x * blockDim.x + threadIdx.x; i < layerInput; i += gridDim.x * blockDim.x) {
        for (int j = blockIdx.y * blockDim.y + threadIdx.y; j < layer2; j += gridDim.y * blockDim.y) {
            atomicAdd(&xs[layerInput + j], availables[j] * single_piece[i] * vals[i * layer2 + j]);
            // printf("%lf\n", xs[layerInput + j]);
        }
    }
    // hidden layer1 -> hidden layer2...
    // ...
}

What’s wrong with atomicAdd, or is there other problem occurred?

Topic		Replies	Views
Atomic functions problem CUDA Programming and Performance	8	1829	May 30, 2009
AtomicAdd() functions CUDA Programming and Performance	1	753	December 9, 2016
The atomic functions do not provide correct results CUDA Programming and Performance cuda	4	384	March 26, 2021
atomicAdd() during loop not work well but at end work well CUDA Programming and Performance	3	1186	May 20, 2010
atomicAdd problems. CUDA Programming and Performance	3	2346	April 13, 2011
AtomicAdd result incorrect CUDA Programming and Performance	3	1595	December 29, 2018
printf inside a kernel is not working nVIDIA Quadro 4000 CUDA Programming and Performance	2	3707	November 7, 2011
Problems after atomicAdd CUDA Programming and Performance	0	9245	June 28, 2011
Atomic add fails to update the pointer CUDA NVCC Compiler	1	324	October 26, 2023
How can I make sure atomicAdd() was successful? CUDA Programming and Performance	4	3363	March 12, 2017

Why does a kernel which contains atomic functions return correct result unless I insert a printf() to check it?

Related topics