I have a question :
How many times can I change the execution in one program ? I am using couple of kernels and each of them needs different sizes of threads and blocks
for instance : Can I do it like that ?
BLOCK_SIZE = 16, BLOCK_DIM = 32.
// FIRST setup execution parameters dim3 threads(BLOCK_SIZE, BLOCK_SIZE); dim3 grid(WC / threads.x, HC / threads.y); // execute the FIRST kernel multiplication l<<< grid, threads >>>(d_C, d_A, d_X, WA, WX, HA, 0);
// SECOND setup execution parameters
dim3 threads(BLOCK_DIM, BLOCK_DIM, 1);
dim3 grid(size_x / BLOCK_DIM, size_y / BLOCK_DIM, 1);
// execute the SECOND kernel
transpose<<< grid, threads >>>(d_Xt, h_C, WXt, HXt);//half_data_size)
I am confused because in this case I muliply declaration of :
Thank you for your help :)