Multiple streams.

wuninsu · June 22, 2011, 2:02am

for (int i = 0; i < 2; ++i) {
cudaMemcpyAsync(inputDevPtr + i * size, hostPtr + i * size,
size, cudaMemcpyHostToDevice, stream[i]);
MyKernel<<<100, 512, 0, stream[i]>>>
(outputDevPtr + i * size, inputDevPtr + i * size, size);
cudaMemcpyAsync(hostPtr + i * size, outputDevPtr + i * size,
size, cudaMemcpyDeviceToHost, stream[i]);
}

==========================================================================================================

I saw that code. Is it better than the one stream case?
If it is better, then why is it?
I wonder the multiple stream run concurrently.

brano · June 22, 2011, 8:18am

for (int i = 0; i < 2; ++i) {
cudaMemcpyAsync(inputDevPtr + i * size, hostPtr + i * size,

                size, cudaMemcpyHostToDevice, stream[i]);

MyKernel<<<100, 512, 0, stream[i]>>>

      (outputDevPtr + i * size, inputDevPtr + i * size, size);

cudaMemcpyAsync(hostPtr + i * size, outputDevPtr + i * size,

                size, cudaMemcpyDeviceToHost, stream[i]);
}

==========================================================================================================

I saw that code. Is it better than the one stream case?

If it is better, then why is it?

I wonder the multiple stream run concurrently.

Hi,

With one stream i guess that you mean to execute on the default stream 0. In that case there are several benefits of using streams.

Async. mem copy.
Overlap memcpy and kernel execution.
Concurrent execution of kernels
Because the calls seen from the CPU doesn’t require synchronization you can have the GPU fully occupied with work.

The order in which the streams are launched is preserved.

One drawback of streams i can think of is the duplicated memory usage you need.

Topic		Replies	Views
CUDA stream CUDA Programming and Performance	1	4651	April 11, 2010
About Stream control CUDA Programming and Performance	1	940	March 26, 2009
:rolleyes: wath Gain using stream? code with stream take more time to execute, wath is the gain of s CUDA Programming and Performance	3	7181	February 12, 2010
Problem using streams Can't get more than one stream to work CUDA Programming and Performance	3	4663	October 8, 2008
Streams and multiprocessor usage? CUDA Programming and Performance	3	2898	September 20, 2008
Question about streams CUDA Programming and Performance	1	980	August 6, 2009
a question about the asynchronous mechanism and stream CUDA Programming and Performance	3	1883	December 10, 2008
Help with CUDA streams CUDA Programming and Performance	1	1599	April 2, 2010
Syncronization with cuda Streams CUDA Programming and Performance cuda	8	419	October 12, 2021
My streams are not running concurrently CUDA Programming and Performance	7	1775	March 6, 2018

Multiple streams.

Related topics