[BF2/BF3] Can the DPU Arm Directly Access GPU Memory on the Same Host?

323184330 · June 8, 2025, 6:56am

Hello everyone,

I am currently developing on the BlueField-2/3 DPU platform and am trying to implement a specific function: to have the DPU’s Arm cores directly initiate read/write access to the memory of a local GPU installed on the same host, with the data path completely bypassing the host CPU.

My goal is to use the DPU Arm as a co-processor that can directly operate on data in GPU memory, aiming to achieve the lowest possible communication latency.

I have already reviewed the following documentation:

The NVIDIA DOCA DMA and GPUNetIO library.
Technical documents related to GPUDirect.

I haven’t been able to find a clear programming example (sample) or an authoritative guide on how to initiate direct access to local GPU memory from the DPU’s Arm cores.

Additionally, I came across the paper “Conspirator: SmartNIC-Aided Control Plane for Distributed ML Workloads” (https://www.usenix.org/system/files/atc24-xiao.pdf), which mentions a “SNIC DMA to GPU” technical path. This makes me more confident that direct communication between the DPU and a local GPU over the PCIe bus is theoretically possible.

Therefore, I would like to ask the community and official experts:

On the BlueField-2/3 platform, is it possible for the DPU Arm cores to perform DMA operations directly on the memory of a GPU on the same host?
If so, is there any official sample code, tutorial, or detailed documentation that could guide this implementation?
What key DOCA libraries, APIs, or driver configurations are required to achieve this?

abirman · July 14, 2025, 12:06pm

Hi,

Following to my check with the relevant engineers, accessing the GPU memory with DPU ARM cores is possible, but no sample app to demonstrate it at the moment.
It is not yet described in Nvidia official documentation.

Best Regards,
Anatoly

323184330 · July 16, 2025, 7:49am

Thank you for your response.

I would appreciate it if you could provide a concise example demonstrating the APIs to be used. Furthermore, I am interested to know if there are any specific requirements regarding the physical placement of the GPU and BlueField, such as the necessity for them to be under a single PCIe bridge, as is the case with GPUNetIO.

Topic		Replies	Views
Bluefield-2 accessing GPU in the same server BlueField cuda , ai , gpu , gpu-computing	2	697	February 19, 2024
How does Bluefield-3 DPA access CPU memory in DPU mode? Enterprise Networking	1	519	July 5, 2024
Is there any other way for BF3 to access local SSD besides NVMf? BlueField	2	99	February 21, 2025
Using Host Memory as External Buffer in DPDK on BlueField-3 BlueField dpdk	0	107	July 9, 2025
BlueField-X mode Cybersecurity bluefield-smart_nic	7	1126	February 1, 2023
How to deal with dma on the host to access DPU's memory? Getting Started & Resources	2	1878	November 14, 2022
Can't access GPU from bluefield2 BlueField	2	1114	June 17, 2022
After mode switch still unable to expose GPU to the DPU side Getting Started & Resources	0	1050	July 1, 2022
GPU direct access to DMA memory over PCIe Jetson Xavier NX pcie , cuda	4	2465	April 22, 2022
Memory from peripheral devices to GPU DMA directly to another device... CUDA Programming and Performance	6	4275	August 16, 2009

[BF2/BF3] Can the DPU Arm Directly Access GPU Memory on the Same Host?

Related topics