CUDA OpenCV questions

Adam27X · November 18, 2010, 5:41am

Hi everyone. I’m another person new to CUDA (and to the forum) looking for some information from more experienced users and this is my first post, so go easy External Image

I’m trying to convert a lengthy C++ video processing algorithm to CUDA. The current algorithm is an MFC application that uses OpenCV functions quite a bit. As I’ve been go through the algorithm, the parts that seem best for parallelization are for loops that initialize buffers or extract RGB values for calculations.

Firstly, for clarification: I’ll be using a GTX480 card - does anyone know if GPUCV supports this card and will be of any use to me here? Their website isn’t clear to me although I realize that this forum isn’t the best place to ask this question.

Secondly, if I have a for loop that goes through every pixel of each frame and uses OpenCV functions such as cvQueryFrame() and cvGet2D() to extract RGB values how should I go about converting such a loop to a kernel? I can’t use these host functions within the global kernel. I’m asking this question because I feel like I’m not the only person who has run into this situation and there’s likely an answer out there that I have not been able to find.

Thanks!

Adam27X · November 18, 2010, 5:41am

Hi everyone. I’m another person new to CUDA (and to the forum) looking for some information from more experienced users and this is my first post, so go easy External Image

I’m trying to convert a lengthy C++ video processing algorithm to CUDA. The current algorithm is an MFC application that uses OpenCV functions quite a bit. As I’ve been go through the algorithm, the parts that seem best for parallelization are for loops that initialize buffers or extract RGB values for calculations.

Firstly, for clarification: I’ll be using a GTX480 card - does anyone know if GPUCV supports this card and will be of any use to me here? Their website isn’t clear to me although I realize that this forum isn’t the best place to ask this question.

Secondly, if I have a for loop that goes through every pixel of each frame and uses OpenCV functions such as cvQueryFrame() and cvGet2D() to extract RGB values how should I go about converting such a loop to a kernel? I can’t use these host functions within the global kernel. I’m asking this question because I feel like I’m not the only person who has run into this situation and there’s likely an answer out there that I have not been able to find.

Thanks!

Crankie · November 19, 2010, 5:45am

Hi everyone. I’m another person new to CUDA (and to the forum) looking for some information from more experienced users and this is my first post, so go easy External Image

I’m trying to convert a lengthy C++ video processing algorithm to CUDA. The current algorithm is an MFC application that uses OpenCV functions quite a bit. As I’ve been go through the algorithm, the parts that seem best for parallelization are for loops that initialize buffers or extract RGB values for calculations.

Firstly, for clarification: I’ll be using a GTX480 card - does anyone know if GPUCV supports this card and will be of any use to me here? Their website isn’t clear to me although I realize that this forum isn’t the best place to ask this question.

Secondly, if I have a for loop that goes through every pixel of each frame and uses OpenCV functions such as cvQueryFrame() and cvGet2D() to extract RGB values how should I go about converting such a loop to a kernel? I can’t use these host functions within the global kernel. I’m asking this question because I feel like I’m not the only person who has run into this situation and there’s likely an answer out there that I have not been able to find.

Thanks!

You may visit http://cuvilib.com/ . They have implemented some of the OpenCV stuff on GPU.

Crankie · November 19, 2010, 5:45am

Hi everyone. I’m another person new to CUDA (and to the forum) looking for some information from more experienced users and this is my first post, so go easy External Image

I’m trying to convert a lengthy C++ video processing algorithm to CUDA. The current algorithm is an MFC application that uses OpenCV functions quite a bit. As I’ve been go through the algorithm, the parts that seem best for parallelization are for loops that initialize buffers or extract RGB values for calculations.

Firstly, for clarification: I’ll be using a GTX480 card - does anyone know if GPUCV supports this card and will be of any use to me here? Their website isn’t clear to me although I realize that this forum isn’t the best place to ask this question.

Secondly, if I have a for loop that goes through every pixel of each frame and uses OpenCV functions such as cvQueryFrame() and cvGet2D() to extract RGB values how should I go about converting such a loop to a kernel? I can’t use these host functions within the global kernel. I’m asking this question because I feel like I’m not the only person who has run into this situation and there’s likely an answer out there that I have not been able to find.

Thanks!

You may visit http://cuvilib.com/ . They have implemented some of the OpenCV stuff on GPU.

Adam27X · November 19, 2010, 6:59pm

Thank you very much Crankie. I’ll definitely try to integrate this with my project soon and hopefully I won’t have to ask more questions about it (a lot more, at least :) ).

Adam27X · November 19, 2010, 6:59pm

Thank you very much Crankie. I’ll definitely try to integrate this with my project soon and hopefully I won’t have to ask more questions about it (a lot more, at least :) ).

salmanulhaq · November 30, 2010, 2:28am

What exact algorithm/filtering are you looking for? If you have any CUVI related questions then do leave them on the CUVI Forums for quick replies.

Adam27X · November 30, 2010, 6:21pm

Well right now I’m working on something a bit more basic (but I’m new to CUDA so I wouldn’t consider it easy) and in a few months I’ll be working with algorithms that are more focused on actual video processing.

Basically I’m trying to take a loop like this:

for(int num = 0; num < numFrame; ++num)

			{

				int count = 0;

				while(1)

				{

					frame = cvQueryFrame(capture);

					if(count == 0)

					{

						for(int i = 0; i < frame->width; ++i)

						{

							for(int j = 0; j < frame->height; ++j)

							{

								temp = cvGet2D(frame, j, i); //Gets RGB values for pixel(j,i)

								model[i][j][0] += temp.val[0]/numFrame;

								model[i][j][1] += temp.val[1]/numFrame;

								model[i][j][2] += temp.val[2]/numFrame;

							}

						}

					}

					count++;

					if(count == 1)

						break;

					cvWaitKey(37);

				}

			}

temp is declared as: CvScalar temp = {0};

model is declared as:

double ***model;

			model = new double**[frame->width];

			for(int i = 0; i < frame->width; ++i)

			{

				model[i] = new double*[frame->height];

				for(int j = 0; j < frame->height; ++j)

				{

					model[i][j] = new double[3];

				}

			}

…and convert it for use on a GPU like this:

for(int num = 0; num < numFrame; ++num)

			{

				int count = 0;

				while(1)

				{

					frame = cvQueryFrame(capture);

					if(count == 0)

					{

						GPU_Wrapper();

					}

					count++;

					if(count == 1)

						break;

					cvWaitKey(37);

				}

			}

So basically, the kernel would ideally call cvGet2D and do the matrix addition but I can’t call cvGet2D from within a kernel. I currently have 480 threads and 640 blocks for the use of this program on a 640x480 video such that each pixel’s RGB values can be extracted from cvGet2D and added to the model in parallel. I’m not sure if CUVI will be of particular help here, though.

Topic		Replies	Views
CUDA & OpenCV CUDA Programming and Performance	10	31552	December 2, 2010
Beginner To CUDA CUDA Programming and Performance	7	5142	September 17, 2008
Any example on real time video processing CUDA Programming and Performance	12	3835	January 6, 2012
What can't you do in CUDA that you'd like? Requests for the future CUDA Programming and Performance	407	134551	May 26, 2010
CUDA very slow performance CUDA Programming and Performance	21	16467	March 6, 2020
Neural network on GPU, physics on CPU? CUDA Programming and Performance	13	5648	October 6, 2013
Real Time image Processing CUDA CUDA Programming and Performance	6	7161	May 8, 2012
Wishlist Place your considered suggestions here CUDA Programming and Performance	201	204313	April 13, 2009
CUDA to run a virtual machine? CUDA Programming and Performance	17	31125	April 15, 2010
New to Tesla/CUDA questions Just a few questions. CUDA Programming and Performance	7	7915	October 24, 2007

CUDA OpenCV questions

Related topics