I just wondered why my code doesn’t do what I expect it to do :)
I’m working with the driver API for the first time and try to launch a kernel.
But the result of the following two statement differs:
CUresult res = cuLaunchGrid(function, m_numBlocksX, m_numBlocksY ); CU_SAFE_CALL(cuLaunchGrid(function, m_numBlocksX, m_numBlocksY ));
If I’m just checking the result everything works correctly. But if I Use the CU_SAFE_CALL macro my image looks weird and some blocks are at the wrong position (it looks a little bit like using a wrong parallelisation or wrong blockIds).
But the kernel code is the same.
Anybody out there who has an idea why this could be the case?