How to make GPU and CPU work at the same time

JasonJuang · December 28, 2010, 8:55pm

Hi everyone,
I am doing some image processing stuff to images captured by webcam. What I want to achieve is

CPU: capture frame t0~t4—>capture frame t5~t9–>display frame t0~t4,capture frame t10~t14–>display frame t5~t9,capture frame t15~t19…

GPU: Idle—> compute frame t0~t4–>compute frame t5~t9 -->compute frame t10~t14…

I am able to do this sequentially,
CPU capture frame t0~t4–> GPU compute frame t0~t4–> CPU display t0~t4–> CPU capture frame t5~t9–> GPU compute frame t5~t9–> CPU display t5~t9–>…

There will be relatively large time difference between t4 and t5, and frame t0~t4 is very identical to each other. I think CPU and GPU should be able to work together technically, but I somehow can’t figure it out. Any tips?

Lev · December 28, 2010, 9:01pm

They should. Just launch kernell and use async functions. Also, you need to check OS.

JasonJuang · December 28, 2010, 9:17pm

Can you give me a more detailed explanation or an example?

Lets say I have main.cpp and compute.cu,

main.cpp looks like this,

for(i=0;;i+=5){

Cam capture(t(i)~t(i+4));

compute(t(i)~t(i+4));

Cam display(t(i)~t(i+4));

}

how do you make the loop go on after compute(); is called?

Lev · December 28, 2010, 9:20pm

kernell launches should be async

kernell<<<>>>

capture next frame

cudasyncthreads();

display current frame

kernell and capture would work in parallel.

JasonJuang · December 28, 2010, 9:29pm

Thank you for your response.

I have a few kernel calls in my compute.cu, and they have to be launched in order. Does that matters?

Do you mean the frame capture code also has to be written in a cu file? Because I am using OpenCV functions to capture images from webcam, and it won’t compile on nvcc. If so, how do you solve this problem?

Thanks

Lev · December 28, 2010, 9:42pm

kernell launch function returns control to cpu code just after call, so cpu and gpu work in parallel. But it is somehow OS dependent. What is your OS?

JasonJuang · December 28, 2010, 9:45pm

My OS is Windows 7 64-bit. When you say cpu code, do you mean the code in .cpp file or the code in .cu file?

Lev · December 28, 2010, 10:15pm

It does not matter. kernell <<<>>> returns just after launch while gpu is still working. However, on win7 gpu calls are batched, so need additional tricks.

JasonJuang · December 28, 2010, 10:18pm

What are the tricks?

Lev · December 28, 2010, 10:31pm

run a lot dummy kernells or other async calls to ensure that actuall call to gpu had been made.

tmurray · December 28, 2010, 11:27pm

cuStreamQuery(0)

Lev · December 28, 2010, 11:49pm

Is it work in run time API?

tmurray · December 28, 2010, 11:56pm

cudaStreamQuery(0) :P

Lev · December 29, 2010, 12:05am

“Returns cudaSuccess if all operations in stream have completed, or cudaErrorNotReady if not.”

It is hard to figure out that this function sends the batch to gpu.

tmurray · December 29, 2010, 12:06am

Which is intentional. Most of the time, you shouldn’t try to manage batching yourself.

Dittoaway · December 29, 2010, 1:33pm

This came up in another thread. If the documentation is going to claim that kernel calls are asynchronous when they aren’t in Windows because of batching, that should be explained and the workaround given.

Topic		Replies	Views
How to Launch Cuda kernel in different processes CUDA Programming and Performance	8	3775	November 6, 2018
Concurrent CPU and GPU processing Jetson TX1	12	1658	October 18, 2021
Overlapping GPU and CPU computation? CUDA Programming and Performance	9	1246	November 19, 2010
How to effectively parallelize cuda kernel launches on CPU CUDA Programming and Performance	9	3110	January 19, 2018
Asynchronous performance between CPU and GPU CUDA Programming and Performance	3	2384	June 18, 2012
multi task parallelization with cuda streams ? CUDA Programming and Performance	7	1471	September 14, 2017
Concurrent kernel execution without stream CUDA Programming and Performance	7	2460	December 28, 2016
Concurrent Kernels Bug / Undocumented Behavior (Urgent) need info on "simple" problem with c CUDA Programming and Performance	2	910	June 18, 2010
Heterogenour programming CUDA Programming and Performance	4	1825	November 24, 2008
GPU and CPU don't run in (pure) parallel ? CUDA Programming and Performance	24	20157	May 4, 2007

How to make GPU and CPU work at the same time

Related topics