Hi everyone. I’m another person new to CUDA (and to the forum) looking for some information from more experienced users and this is my first post, so go easy
I’m trying to convert a lengthy C++ video processing algorithm to CUDA. The current algorithm is an MFC application that uses OpenCV functions quite a bit. As I’ve been go through the algorithm, the parts that seem best for parallelization are for loops that initialize buffers or extract RGB values for calculations.
Firstly, for clarification: I’ll be using a GTX480 card - does anyone know if GPUCV supports this card and will be of any use to me here? Their website isn’t clear to me although I realize that this forum isn’t the best place to ask this question.
Secondly, if I have a for loop that goes through every pixel of each frame and uses OpenCV functions such as cvQueryFrame() and cvGet2D() to extract RGB values how should I go about converting such a loop to a kernel? I can’t use these host functions within the global kernel. I’m asking this question because I feel like I’m not the only person who has run into this situation and there’s likely an answer out there that I have not been able to find.