I still get confused by the [grid, block, thread] configuration on the kernel (amazing!).
Max ‘x’ dimension values = [2^31,1024,1024] respectively for compute capability 3.0 (or GeForce GTX 680).
It seems to me that kernel needs 3 parameters but only 2 are used all the time right?
I’m actually launching a kernel with this configuration:
How is this read? 1 Grid with 1250000 blocks, each block with 1024 threads? Isn’t 1024 maximum ‘x’ dimension for blocks?
What if I want 3 grids with 16 blocks, each block with 32 threads… what is the proper configuration for this parameters?
Thanks in advance for the noob question :)