GPUDirect RDMA: Difference between ibv_reg_mr and ibv_reg_dmabuf_mr

k.bodi2 · August 7, 2023, 12:23pm

Hey there,

I’m still trying to understand what is the actual difference between these two calls?
We have either

ibv_reg_mr
or
ibv_reg_dmabuf_mr

But how does it impact the data flow between the GPU and NIC? Can you please explain it based on the PCI topology? From my current understanding, the GPU and NIC share a pinned memory block on host memory filled with virtual addresses through which they communicate with each other. Afaik this is the regular implementation for ibv_reg_mr when using GPU memory. But what exactly happens when using ibv_reg_dmabuf_mr?

dmabuf however is not supported by my GPU (RTX A5000) as noted by perftest > ib_write_bw via the following code:

cuDeviceGetAttribute(&is_supported, CU_DEVICE_ATTRIBUTE_DMA_BUF_SUPPORTED, cuDevice)

samerka · August 9, 2023, 7:32am

Hi ,

ibv_reg_mr is memory registration allocated on GPU with nvidia-peermem library usage and ibv_reg_dmabuf_mr is registration with DMAbuff usage

Further information about DMABUF:
https://kernel.org/doc/html/v5.18/userspace-api/media/v4l/dmabuf.html?highlight=dma%20buffer

For RDMA code :

github.com

linux-rdma/rdma-core/blob/master/libibverbs/man/ibv_reg_mr.3

.\" -*- nroff -*-
.\" Licensed under the OpenIB.org BSD license (FreeBSD Variant) - See COPYING.md
.\"
.TH IBV_REG_MR 3 2006-10-31 libibverbs "Libibverbs Programmer's Manual"
.SH "NAME"
ibv_reg_mr, ibv_reg_mr_iova, ibv_reg_dmabuf_mr, ibv_dereg_mr \- register or deregister a memory region (MR)
.SH "SYNOPSIS"
.nf
.B #include <infiniband/verbs.h>
.sp
.BI "struct ibv_mr *ibv_reg_mr(struct ibv_pd " "*pd" ", void " "*addr" ,
.BI "                          size_t " "length" ", int " "access" );
.sp
.BI "struct ibv_mr *ibv_reg_mr_iova(struct ibv_pd " "*pd" ", void " "*addr" ,
.BI "                               size_t " "length" ", uint64_t " "hca_va" ,
.BI "                               int " "access" );
.sp
.BI "struct ibv_mr *ibv_reg_dmabuf_mr(struct ibv_pd " "*pd" ", uint64_t " "offset" ,
.BI "                                 size_t " "length" ", uint64_t " "iova" ,
.BI "                                 int " "fd" ", int " "access" );

This file has been truncated. show original

Thanks,
Samer

k.bodi2 · August 14, 2023, 12:22pm

Thank you very much.

Can you specify what are the pro/cons of dmabuf over dma? What should we prefer to use for GDR for best performance?

k.bodi2 · August 17, 2023, 8:54am

I would appreciate any help or hints on my open question

Topic		Replies	Views
GPUDirect RDMA at the ibverbs level. Software And Drivers iterations , bytes	4	1531	November 30, 2020
Device Memory MR and GPU RDMA Mellanox OFED software-and-drivers , iterations , bytes	1	659	February 13, 2019
Error when trying to write data to GPU DMA memory (using GPU Direct RDMA) Jetson AGX Xavier pcie , kernel , fpga	8	1490	May 30, 2023
GPU Direct RDMA Help CUDA Programming and Performance	4	1415	November 22, 2020
Pinning GPU memory for RDMA failed CUDA Programming and Performance	1	574	April 3, 2022
Unlocking GPU-Accelerated RDMA with NVIDIA DOCA GPUNetIO Technical Blog	5	212	March 27, 2025
GPUDirect RDMA on NVIDIA Jetson AGX Xavier Technical Blog	1	841	June 12, 2019
What's the proper memory region access flags for GPUDirect RDMA? RDMA Software For GPU	6	768	May 24, 2023
Having issues getting host gpu to host gpu RDMA to work CUDA Programming and Performance	2	1840	July 17, 2019
"--use_cuda_dmabuf" is not supported on this GPU RDMA Software For GPU	4	2103	July 31, 2023

GPUDirect RDMA: Difference between ibv_reg_mr and ibv_reg_dmabuf_mr

Related topics