Metric(s) for measuring atomic operations' hit/miss

Is there a metric for measuring how many times an atomic operation was blocked. I created a kernel that purposefully generated many blocks on atomicAdd of global memory. Can nv-nsight-cu count the number of atomic blocks/hits when executing this kernel.

global void run_atomic(float *C, size_t s)
{
int index = (blockDim.x * blockIdx.x + threadIdx.x);
if (index>=s-1) return;
atomicAdd(C+s-1, C[index]);
}

I assume you are asking for the additional cycles caused by contention for the memory address referenced by atomicAdd? In this case, no, there are no metrics that measure this.