How does “cudaMemcpyPeer” implement?

ding9801 · March 16, 2021, 12:52pm

How does “cudaMemcpyPeer” implement ? Is it device1 mem → host mem → device2mem ? If there is nvlink, does this API use nvlink or gpu-direct?

Robert_Crovella · March 16, 2021, 1:46pm

The normal usage of this API would be to precede it with checks of Peer support, followed by enablement of Peer support. See the simpleP2P CUDA sample code for an example.

If Peer support has been enabled, then the flow of data is directly from device1 mem → device2 mem, using the fabric (PCIE, or NVLINK). If NVLINK is available, it is used. If NVLINK is not available, PCIE is used.

In the above scenario, the data will not touch CPU/host memory, and depending on the system topology, may not even enter the CPU socket (if there are PCIE switches in the topology).

buaastv_yzl · February 6, 2024, 5:59am

is it possible that this API use both PCIE AND nvlink to transfer data？It means the data is split into two parts, one part is transfered through PCIE, the other part is through nvlink.

Robert_Crovella · February 6, 2024, 4:17pm

No, not that I know of. One or the other is used.

Topic		Replies	Views
Can cudaMemcpy() use both pci and nvlink? General Topics and Other SDKs	0	218	February 5, 2024
Why cudaDeviceEnablePeerAccess is not default? CUDA Programming and Performance	1	1157	December 11, 2019
NVLINK CUDA Programming and Performance	3	2040	May 5, 2018
about the nvlink between two gpus CUDA Programming and Performance	4	1131	April 3, 2019
cudaMemcpyDeviceToDevice CUDA Programming and Performance	8	7045	November 13, 2020
Need example to disable nvlink CUDA Programming and Performance	10	5877	April 11, 2024
Peer to peer (UVA) memcpy not working CUDA Programming and Performance cuda	1	38	November 15, 2024
P2P GPU Direct Communication CUDA Programming and Performance	1	1632	February 1, 2024
How do I determine if my device supports P2P mode (cudaMemcpyPeerAsync)? CUDA Programming and Performance	1	558	September 19, 2023
Data transfer between GPU of a workstation CUDA Programming and Performance	2	304	April 16, 2024

How does “cudaMemcpyPeer” implement?

Related topics