So if I have float* array and I want multiple threads in a kernel to add to this array, will they be able to do this just by calling something like array = array + newValue? I don’t care what order things are added, just as long as by the end of execution they do get added.
const int ix = blockDim.x * blockIdx.x + threadIdx.x;
array[columnID] = array[columnID] + gpuA[ix];
ColumnID is defined seperately, so just assume that part is working.