What's the proper memory region access flags for GPUDirect RDMA?

chensy20 · May 24, 2023, 7:07am

Hi everyone,

I’m trying GPUDirect RDMA technology to send some data in GPU memory to a remote host bypassing the GPU server’s CPU.

When I register the GPU memory to the RDMA protection domain with empty access flag, the send/recv operations all succeed without reporting any error, but the data received in remote host are just a bunch of zeros. When I change the GPU side mr access flag to IBV_ACCESS_LOCAL_WRITE, the remote host can receive the correct data.

From my perspective IBV_ACCESS_LOCAL_WRITE is not required on the GPU side because the RDMA HCA only reads the data in that region. What’s the problem here?

System environment: I am using Nvidia A100 with CUDA version 12.1 on the GPU side and ConnectX-6 Infiniband cards on both sides.

michaelbe · May 24, 2023, 8:39am

Hi chensy20,
This is not related to GPU or CPU memory. When you are using local buffer, without giving remote write access, we still set LOCAL_WRITE access flag to be able to write data locally before you are sending it to remote side.
This is what I can see in all RDMA samples not related to GPU.
Best regards,
Michael.

chensy20 · May 24, 2023, 8:59am

Does that mean we still have to set LOCAL_WRITE flag even if that buffer is only modified by user but not by RDMA operations?

michaelbe · May 24, 2023, 9:05am

Yes

michaelbe · May 24, 2023, 9:07am

I suggest to take some simple working sample for CPU memory and use it as reference

chensy20 · May 24, 2023, 9:10am

Ok, thanks!

system · June 7, 2023, 9:10am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Having issues getting host gpu to host gpu RDMA to work CUDA Programming and Performance	2	1889	July 17, 2019
GPUDirect RDMA at the ibverbs level. Software And Drivers iterations , bytes	4	1606	November 30, 2020
RDMA from local host memory to remote GPU memory? CUDA Programming and Performance	1	581	April 10, 2019
Exploring GPUDirect on a Local Area network Teaching and Curriculum Support	1	1173	September 22, 2013
GPUDirect RDMA on proprietary interconnect CUDA Programming and Performance	0	350	March 24, 2020
RDMA using GPUDirect CUDA Programming and Performance	0	733	March 24, 2014
Ethernet adapter with RDMA support (RoCE or iWARP implementation) GPU-Accelerated Libraries	2	784	January 20, 2020
Does GPUDirect RDMA support CUDA/OpenGL Interop Buffers?" CUDA Programming and Performance opengl	1	76	March 13, 2025
GPUDirect Write with transfer size under 256 Bytes CUDA Programming and Performance cuda	1	391	October 19, 2020
Using RDMA ibverbs registered memory as pinned memory for CUDA CUDA Programming and Performance	0	747	August 12, 2021

What's the proper memory region access flags for GPUDirect RDMA?

Related topics