Hello,
According to the guide “Understanding mlx5 Linux Counters and Status Parameters” and other related resources, the port_xmit_wait counter measures the number of clock ticks during which the port had data to transmit, but was unable to do so due to insufficient credits or lack of arbitration.
As I understand it, this counter effectively accounts for the amount of time the port was stalled while data was available to send, a useful metric for diagnosing network congestion or flow control issues.
If the duration of a tick were known, it would be possible to convert port_xmit_wait into an actual time (e.g., seconds), which would allow comparison against job runtime and help evaluate whether the value is significant.
My test platform uses:
> lspci | grep InfiniBand
InfiniBand Controller: MT28908 Family [ConnectX-6]
> ibstatus | grep rate
rate: 100 Gb/sec (2X HDR)
However, I have been unable to find any official documentation or method that specifies the tick duration for this adapter. The only source I’ve found so far is ChatGPT, which suggests that 1 tick = 4 ns, based on a presumed 250 MHz internal clock. While this value is mentioned in some sources, particularly for SDR adapters, I have found no reference regarding HDR.
My question:
Is there any official documentation, support article, or query method that defines or reveals the tick duration used for port_xmit_wait on a ConnectX-6 HDR adapter?
This would be extremely helpful for interpreting counter values in a meaningful way.
Thank you in advance for any clarification or guidance!
Best regards,
– G.