I am using CUDA to do 3D volume data processing.
The behavior is different and independent along X, Y, Z direction.
So I want to have three different threads running through the data.
Dose CUDA support this?
Or, Let me put in this way: I write three different thread; EdgeDetectX, EdgeDetectY, EdgeDetectZ
if I call the kernel function in such a sequence,
EdgeDetectX<<< grid, threads >>>(d_A, d_B, WIDTH);
EdgeDetectY<<< grid, threads >>>(d_A, d_B, WIDTH);
EdgeDetectZ<<< grid, threads >>>(d_A, d_B, WIDTH);
dose it mean this runs sequentially?
Is there any way I can schedule these three keneral to run at the same time?
I am new to CUDA. really appreciate you guys’ help. :D