I can get the grid and the block sizes but not the input sizes of the matrices to the kernel via nsight compute. Could you please show it with an example or any link showing this information?
I can get the grid and the block sizes but not the input sizes of the matrices to the kernel via nsight compute. Could you please show it with an example or any link showing this information?