Well, i’ve been working on two dimensional blocks; however, i noticed that there is a big amount of divergent branches; hence, I think I don’t know how warp works in two dimension.
in 1 dimension, every 32 threads are considered as one warp, what about in two dimension? In which order should the threads be included in a warp?
For example;
s_data[0][0] s_data[0][1] … s_data[0][15]
s_data[1][0] s_data[1][1] … s_data[1][15]
are these in the same warp if they were called by threads?? (consider blocksize as 16x16)
thanks in advance.
Loleeta