Cuda 4 inter-GPU synchronization ?

ikidntu · April 4, 2011, 2:43pm

Did anyone successfully test this feature on Cuda4 ? It seems I can’t have it worked correctly, not sure if it’s a bug in my code or the feature is not fully implemented yet.

For example, I want to do a kernel call after copying a memory from 1 gpu to another. Stream 0 is created on gpu0 stream 1 is created on gpu1. Both are Fermi

cudaMemcpyAsync( mem1, mem0, size, cudaMemcpyDefault, stream0 );
cudaEventRecord(P2Pevent, stream0);

cudaStreamWaitEvent(stream1, P2Pevent, 0);
cudaKernel<<<block,thread,0,stream1>>>(mem1);

Sometime it seems the memory is not ready for the kernel yet so the result is incorrect. If I add an cudaDeviceSynchronize on Gpu0 then it works fine.

Thanks

ikidntu · April 5, 2011, 2:04am

any idea ?

hyqneuron · April 5, 2011, 3:53am

Shouldn’t you use cudaMemcpyDeviceToDevice?

Not sure if cudaStreamWaitEvent works for event on a different device

tmurray · April 5, 2011, 4:07am

default should be working. can you post full code? this might be a known bug that is resolved in RC2, but I’d like to find out for sure.

ikidntu · April 5, 2011, 7:54am

I guess it’s a bug in my code, changed something and it works… Thanks guy. This is a cool feature, hopefully there is no bug

hyqneuron · April 5, 2011, 2:40pm

so what did you change to make it work?

Topic		Replies	Views
cudaMemcpyAsync clarification required & help needed CUDA Programming and Performance	0	1751	October 17, 2009
How does cudaMemcpyPeer(Async) work with streams? CUDA Programming and Performance	1	449	September 25, 2023
Unable to run kernel on device 1 with memory in device 2 CUDA Programming and Performance	10	912	January 24, 2017
Streams and CPU CUDA Programming and Performance	1	1029	September 27, 2013
Do i really need to use cudaDeviceSynchronize in this scenario ? CUDA Programming and Performance	2	1020	February 11, 2019
Best way to synchronise two stream from different gpus CUDA Programming and Performance	3	1145	August 22, 2022
Multi-GPU & stream management. CUDA Programming and Performance	2	912	October 12, 2013
Question about streams CUDA Programming and Performance	1	980	August 6, 2009
Syncronization with cuda Streams CUDA Programming and Performance cuda	8	419	October 12, 2021
cudaDeviceSynchronize needed between kernel launch and cudaMemcpy ? CUDA Programming and Performance	15	16267	September 29, 2017

Cuda 4 inter-GPU synchronization ?

Related topics