How to implement calc. pipeline with streams ?

Romant · October 14, 2008, 5:42pm

I’d kindly appreciate a general advice or scheme of better implementation of the following task:

All the source data goes to GPU prior to any kernel call as the data is not too big.
Each kernel call means evaluation of portion of source data and storage of results in GPU memory with subsequent copy to main system RAM.
Kernels are called one by one, as soon as the previous kernel is finished the next kernel is run.
CPU waits in a separate thread for a single kernel completion and handles the data the kernel produced, as soon as the kernel is finished CPU handles the data and waits for the next kernel.
It seems like streams are suitable solution, but it is not clear how to use them inside a loop {run kernel - acquire results - run kernel …}, are the stream objects reusable or it is necessary to recreate them before each kernel call e t c.

Also, may be SDK contains a sample that does something similar ? The main idea is to make the GPU working continuously without delays on CPU post processing - GPU works continuously and CPU handles the results as soon as they are produced in a separate thread.

If GPU works faster then CPU then GPU results are queued. CPU must not be a bottleneck for the GPU in any case.

Thanks in advance!

Topic		Replies	Views
simultaneous execution of kernels in cuda CUDA Programming and Performance	1	410	September 17, 2019
Using streams... Howto? CUDA Programming and Performance	0	1112	July 25, 2008
Coordinating convergence of multiple streams CUDA Programming and Performance	0	2643	June 15, 2008
confusions about CUDA streams CUDA Programming and Performance	5	805	July 30, 2017
Special case overlapping execution and transfer times with streams CUDA Programming and Performance	3	1450	April 2, 2009
multi task parallelization with cuda streams ? CUDA Programming and Performance	7	1456	September 14, 2017
How to implement calculation pipeline via CUDA streams ? CUDA Programming and Performance	3	6518	January 17, 2013
Streams in nsight report Profiling Linux Targets	2	775	October 12, 2021
An idea about streams CPU-side streams! CUDA Programming and Performance	1	1266	July 25, 2008
Parallel Kernels Best practices for creating a pipeline CUDA Programming and Performance	7	4701	June 1, 2007

How to implement calc. pipeline with streams ?

Related topics