Is there a way to record the execution sequence of the threads in a block?As like this in the xxx_kernel.cu:
int tid = threadIdx.y*blockDim.x+threadIdx.x;
Since the threads executed independently,when a thread arrived the __syncthreads()?How can i record the order? I have trid ,but no result.Maybe it is impossible to do that ?
Any reply is appreciated!