Running CUDA on multicore system.

serge · September 9, 2007, 12:05pm

Hi all.

I have some questions about running CUDA on multicore system (Intel Core 2 Duo with WinXP in my case). Unfortunately I failed to find smth about this topic in programming guide.

While running several threads on multicore system (i.e. on host)

Can I invoke CUDA methods (such as cudaMemcpy() or kernel invocation) from different host threads? What is result of such invokations?
Should this calls be synchronized (i.e. can there be some problems if I accidentally invoke cudaMemcpy() from different threads simultaneously?)?.
While running CUDA in one thread (this questions relates to one-core systems too) what is result of such invokation:

kernel1<<< Dg, Db, Ns >>>(parameter);

kernel2<<< Dg, Db, Ns >>>(parameter);

Is it guaranteed that kernel1 will e executed after kernel2 (I have some doubts since kernel invokations are declared to be asynchronous)?

Thank you.

seibert · September 9, 2007, 1:37pm

You cannot call CUDA methods from different threads, unless each thread is accessing a different device. If your application has multiple threads, then you will probably want to make a special CUDA service thread, and that thread will be the only one to call CUDA methods.

AndreiB · September 9, 2007, 2:54pm

While running CUDA in one thread (this questions relates to one-core systems too) what is result of such invokation:
kernel1<<< Dg, Db, Ns >>>(parameter);

kernel2<<< Dg, Db, Ns >>>(parameter);
Is it guaranteed that kernel1 will e executed after kernel2 (I have some doubts since kernel invokations are declared to be asynchronous)?

[snapback]248619[/snapback]

Yes, kernel2 will be executed after kernel1.

When you launch kernel2 your host thread will block until kernel1 is finished executing. When kernel1 is done kernel2 will be launched and control will be returned to your thread.

It is like calling cudaThreadSynchronize() before launching second kernel.

BTW, cudaMemcpy() between host and device memory have similar behaviour: it blocks until kernel completes.

serge · September 9, 2007, 5:50pm

i see. thank you

Topic		Replies	Views
Run same CUDA kernel from two different host threads, with different data CUDA Programming and Performance	0	1346	August 4, 2011
CUDA called from multiple threads CUDA Programming and Performance	1	4612	July 18, 2010
Is CUDA thread-safe? CUDA Programming and Performance	3	13011	February 18, 2008
A simple threading question Do memory copies have to occur in the device thread? CUDA Programming and Performance	4	4352	March 26, 2009
cudaMalloc and kernel call do they need to be in the same thread CUDA Programming and Performance	1	11303	November 19, 2008
cudaThreadSynchronize() in MultyThreade Application CUDA Programming and Performance	3	4817	December 17, 2010
Multiple concurrent device processes using multiple concurrent host threads CUDA Programming and Performance	4	3824	January 26, 2009
Behaviour of Multithreaded programs with cudaThreadSynchronize() The semantics of cudaThreadSynchron CUDA Programming and Performance	1	7249	January 9, 2012
Multithreaded call from cpu to gpu CUDA Programming and Performance	1	1383	October 12, 2009
Warning whem using cuda in thread? CUDA Programming and Performance	4	728	March 10, 2017

Running CUDA on multicore system.

Related topics