Fermi streams and kernels

deghost · July 20, 2010, 11:51am

Hi,

Up to 16 kernels can run simultaneously on a Fermi card (ver. 3.1). Does this hold also for executions of a kernel with different streams? The statement in the guide haven’t changed: “Different streams, on the other hand, may execute their commands out of order with respect to one another or concurrently; this behavior is not guaranteed and should therefore not be relied upon for correctness”.
Is resource distribution for concurrent kernels the same as for concurrent thread blocks?

Thanks!

SPWorley · July 20, 2010, 4:37pm

Simultaneous kernel execution only happens with kernels in different streams. Kernels in a single stream never simultaneously execute. Streams are the method you use to define serial data dependencies between kernel calls, letting the high level GPU scheduler know what kernels/copies can run simultaneously.

Greg_Ross · July 20, 2010, 10:06pm

It’s unfortunate that we cannot specify it is safe to run a specific set of kernels in the same stream concurrently though.

tmurray · July 20, 2010, 10:09pm

Why would you want a stream capable of executing out of order?

SPWorley · July 20, 2010, 10:56pm

But that’s the entire purpose of streams, to explicitly specify GPU copies and executions which are serially dependent. When you have a set of kernels that are not interdependent, then you assign them different streams to specify that fact and allow the GPU scheduler to optimize their execution.

deghost · July 22, 2010, 11:19am

My question was probably not clear enough. Nevertheless, it seems that I got the answer.
I’ll rephrase: If kernel is a ‘kernel function’ then to run 16 kernels concurrently means to run 16 different kernel functions, rather than the same function with different streams.
Its a silly question, I know, but I wanted to be sure.

Topic		Replies	Views
Easiest way to invoke two different kernels simultaneously ? CUDA Programming and Performance	4	5814	April 12, 2012
Concurrent kernels execution using streams in multiple CPU threads CUDA Programming and Performance	7	10704	June 26, 2012
Kernel scheduling with Fermi independent blocks can be placed in new streams? CUDA Programming and Performance	14	13314	January 22, 2010
Do kernels/streams execute concurrently? CUDA Programming and Performance	1	1212	October 15, 2008
Streaming Concurrent Kernels (in Fermi GPUs) ... CUDA Programming and Performance	2	1425	May 7, 2013
Distinct Kernels on Concurrent Streams? CUDA Programming and Performance	3	1252	June 9, 2009
Multiple CUFFT in different streams? CUDA Programming and Performance	7	7148	July 5, 2008
Is it possible to execute kernels in parallel CUDA Programming and Performance	9	4658	February 6, 2009
Multiple simultaneous kernels across different streams CUDA Programming and Performance	3	4594	February 3, 2009
Parallel execution of multiple kernels possible? CUDA Programming and Performance	1	1670	June 4, 2008

Fermi streams and kernels

Related topics