Question about streaming

captainp · February 18, 2008, 6:45pm

I have been reading the documentation and the simpleStream example files, but they seam to have left me with more questions then answers. My first question is the simple stream example said that if I had a card with 1.1 that it would be X amount faster. Is that only for this example for memory copying or does all streaming on a 1.0 card work serially. The next question I have is about getting a kernel and memory copy operation to happen at the same time. The documentation briefly says you ca do this, but doesn’t give many specifics. Is this something that only 1.1 cards can do as well, and is it something that must be done with the driver API?

AndreiB · February 18, 2008, 7:53pm

Yes, overlapping kernel execution and memory copy is possible only on devices with compute capability 1.1. On 1.0 devices such streams will be serialized. For async memory functions check programming manual, it is something like cudaMemcpyAsync (I don’t have Programming Manual right now).

captainp · February 19, 2008, 10:09pm

I have been reading about the async options and had one more question. It only mentioned the ability to have a asyn execution to return control to the processor in the cu section. Is there a way to do this that is not listed with cuda commands, or is it only possible in the driver api?

MisterAnderson42 · February 19, 2008, 11:19pm

Kernel executions are always async when using the runtime API.

captainp · February 20, 2008, 12:06am

O, didn’t realize that.

AndreiB · February 20, 2008, 4:33am

And with Driver API too, by the way.

Topic		Replies	Views
Do kernels/streams execute concurrently? CUDA Programming and Performance	1	1177	October 15, 2008
async memcopy/kernel from different contexts overlaping operations from different contexts.. CUDA Programming and Performance	9	2949	December 18, 2008
Asynchronous data transfer CUDA Programming and Performance	8	7083	May 15, 2008
Asynchronous memory copy from Host to Device CUDA Programming and Performance	5	3060	June 12, 2008
Question about streams CUDA Programming and Performance	1	980	August 6, 2009
Accesing memory from both kernel and host side CUDA Programming and Performance	1	3029	February 17, 2008
Questions about STREAM CUDA Programming and Performance	0	538	November 22, 2011
memory copy overlap CUDA Programming and Performance	7	14725	March 29, 2008
Kernel Queueing CUDA Programming and Performance	8	9686	June 29, 2009
Concurrent Memory Copy and Kernel Execution CUDA Programming and Performance	0	2339	February 26, 2010

Question about streaming

Related topics