Perfquery stopped working.

I tried uninstalling and reinistalling OFED from your CentOS 7.7 repos but it still doesn’t work,

ibv_devinfo

hca_id: mlx5_1

transport: InfiniBand (0)

fw_ver: 16.26.1040

node_guid: 506b:4b03:000c:ef4d

sys_image_guid: 506b:4b03:000c:ef4c

vendor_id: 0x02c9

vendor_part_id: 4119

hw_ver: 0x0

board_id: MT_0000000008

phys_port_cnt: 1

Device ports:

port: 1

state: PORT_INIT (2)

max_mtu: 4096 (5)

active_mtu: 4096 (5)

sm_lid: 1

port_lid: 1

port_lmc: 0x00

link_layer: InfiniBand

perfquery 1

ibwarn: [5756] _do_madrpc: recv failed: Connection timed out

ibwarn: [5756] mad_rpc: _do_madrpc failed; dport (Lid 1)

perfquery: iberror: failed: classportinfo query

How do I fix this

Hello Joshua,

Many thanks for posting your inquiry on the Mellanox Community.

As per man-page of perfquery:

Local port Selection

Multiple port/Multiple CA support: when no IB device or port is specified (see the “local umad parameters” below), the libibumad library selects the port to use by the following criteria:

  1. the first port that is ACTIVE.

  2. if not found, the first port that is UP (physical link up).

If a port and/or CA name is specified, the libibumad library attempts to fulfill the user request, and will fail if it is not possible.

For example:

ibaddr # use the first port (criteria #1 above)

ibaddr -C mthca1 # pick the best port from “mthca1” only.

ibaddr -P 2 # use the second (active/up) port from the first available IB device.

ibaddr -C mthca0 -P 2 # use the specified port only.

In your case, you should use the following syntax:

perfquery -C mlx5_1 -P 1

Example output:

perfquery -C mlx5_1 -P 1

Port counters: Lid 6 port 1 (CapMask: 0x5A00)

PortSelect:…1

CounterSelect:…0x0000

SymbolErrorCounter:…0

LinkErrorRecoveryCounter:…0

LinkDownedCounter:…0

PortRcvErrors:…0

PortRcvRemotePhysicalErrors:…0

PortRcvSwitchRelayErrors:…0

PortXmitDiscards:…0

PortXmitConstraintErrors:…0

PortRcvConstraintErrors:…0

CounterSelect2:…0x00

LocalLinkIntegrityErrors:…0

ExcessiveBufferOverrunErrors:…0

QP1Dropped:…0

VL15Dropped:…0

PortXmitData:…15856737

PortRcvData:…190095349

PortXmitPkts:…220269

PortRcvPkts:…1866046

PortXmitWait:…0

Hope this will resolve your issue.

Many thanks,

~Mellanox Technical Support