Hi all,
I’m noob at Cuda Parallel Programming.I’m using prefix scan algorithm for integral image but that should be transformed does not change. I think I do wrong when I call kernel function. When I call kernel function my dimblock is 16 my dimgrid is
dim3 dimGrid((int)ceil(height / dimBlock.x), (int)ceil(width / dimBlock.y))
Thanks in advance.