Connectx-6 issue with RoCE v2

user107713 · November 22, 2023, 7:53pm

We have a rping application running on a Xilinx device to send RoCE v2 package to a Connectx-6 card, the Xilinx device is the server and Connectx-6 card is the client. The cnp request is processed no problem, the RC connection is created, but after that the all the RDMA request/response to the Connectx-6 card are not recognized, the received packet counter show the correct number, but the packet disappeared after that, check the hw_counters, find local_ack_timeout_err, slow_restart_cnps, and rp_cnp_handled also have a larger value.
To compare, we also do the self loop back test on the Mellanox card and it works no problem.
Compared the captured pcapng file, we noticed the ECN field in IP head is different, Xilinx device always set it as 01 (ECT(1)), and Connectx-6 card set it 10 (ECT(0)). Is this the reason that the received packet got ignored? If it is, is there a way to solve it?

xiaofengl · November 23, 2023, 1:25am

NVIDIA has it’s own ROCE CC algorithm DCQCN on CX6. And ROCE is EtoE solution, need work on NVIDIA End to End device.

If you want developing CC algorithm you can contact NVIDIA pre-sales support.

user107713 · November 23, 2023, 7:28pm

Hi Xiaofengl,
Thank you so much for your quick response! Do you mean Connectx-6 cannot works other venders RoCE device? Could you please give more information?
Since the original demo is using a Connectx-4 card, do you think we have a chance to avoid this issue with a Connectx-4 card? We just want to validate the 100G data transmitting from Xilinx device.

system · December 7, 2023, 7:28pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
having trouble doing rdma connect when using routes for rocee network Mellanox OFED	0	335	September 22, 2015
rocev2 is not working Connect X5 with Mellanox OFED Ethernet Adapter Cards	1	648	May 15, 2017
Hello, I would like to verify the RoCEv2 Congestion Management on my system following the HowTo Configure Resilient RoCE End-to-End Using ConnectX-4 and Spectrum (No QoS) article. However, I cannot find any packets with ECN bit == 11b on my Wireshark. Adapters and Cables understanding-rocev2-congestion-man	1	702	March 12, 2020
ConnectX-4 LX RoCE does not like latency Mellanox OFED	1	412	March 7, 2017
Connectx-5 (WinOF-2) and RoCE v1 Software And Drivers roce	2	694	May 27, 2020
MCX314A-BBCT 40G ConnectX -3 pro failing to change mode to RoCE v2 Ethernet Adapter Cards	1	408	February 14, 2017
RoCEv2 PFC/ECN Issues	2	600	October 3, 2018
RoCE Mode equal to 1.25 InfiniBand/VPI Adapter Cards	0	399	February 19, 2017
RoCE not working on Win 2016 (ConnectX-3 Pro) WinOF Driver disable , configure , remove	7	1267	March 26, 2018
Confusion with Packet and Byte counters for RoCE over ConnectX-3 Mellanox OFED	7	551	May 3, 2013

Connectx-6 issue with RoCE v2

Related topics