Hey guys I had a question about the dimension and size of the grid ie Db in <<<Dg, Db, Ns>>>
for my program when I allocate more threads in the x and y direction I get better performance compared to when I give more to in z.
For example (16,16,2) is much faster compared to (2,2,16)
Could somebody explain the reasoning behind this.
Thanks in advance