Technical inquiries regarding Mellanox network cards and switches

hi
I am using MCX555A-ECAT and NVIDIA Mellanox SB7800.

Case1) What does the log below mean?

Jan 1 00:00:19 switch-db6306 health[3523]: [Health-ERR]: Health-Report: Power Supply 2/1 is unresponsive Aug 11 01:09:38 switch-db6306 rh19254 [web-ERR]: lib_ui_api_get_system_type(), lib_ui_api_utils.c:6 34, build 1: Error code 14000 (generic error) returned
Aug 11 01:09:38 switch-db6306 rh19254 [web-ERR]: Could not get system type, unsing[68], err[14000]
Aug 11 01:09:38 switch-db6306 rh19254 [web-ERR]: lib_ui_api_get_first_supported_chip type(), lib_ui_api_utils.c:691, build 1: Error code 14000 (generic error) returned
Aug 11 01:09:38 switch-db6306 rh19254 [web-ERR]: wcf_map_profile(), web_mtx_commands:c:1852, build 1: Er ror code 14000 (generic error) returned
Aug 11 01:09:38 switch-db6306 rh19254 [web-ERR]: web_include_template(), web_template.c:375, build 1:
Aug 11 01:09:38 switch-db6306 rh19254 [web-ERR]: Error in template "get-ports-info-system “at line 34 of the generated TCL code
Aug 11 01:09:38 switch-db6306 rh19254 [web-ERR]: web_render_template(), web_template.c:226, build 1: Errpr code 14002 (assertion failed) returned
Aug 11 01:09:38 switch-db6306 rh19254 [web-ERR]: main() , rh_main.c:337, build 1: Error code 14002 (assertion failed) returned
Aug 11 01:09:38 switch-db6306 rh19254 [web-ERR]: Request handler failed with error code 14002: assertion failed
Aug 11 05:08:26 switch-db6306 rh19254 [web-ERR]: ts_put_str_frag(), tstring.c 1679, build 1: Bail forced with error code 14011
Aug 11 05:08:26 switch-db6306 rh19254 [web-ERR]: ts_append_str_frag(), tstring.c:1381, build 1: Error code 14011 (buffer space exhausted)
Aug 11 :08:26 switch-db6306 rh19254 [web-ERR]: wlog_get_log_lines(), web_logging.c:291, build 1: Error code 14011
Aug 11 05:08:26 switch-db6306 rh19254 [web-ERR]: wlog_get_log_lines_cmd(), web_logging.c:384, build 1: Error code 14011
Aug 11 05:08:26 switch-db6306 rh19254 [web-ERR]: web_include_template(), web_template.c:375, build 1:
Aug 11 05:08:26 switch-db6306 rh19254 [web-ERR]: error in template " get-logs-duimp-file” aty line 52 of the generated TCL code
Aug 11 05:08:26 switch-db6306 rh19254 [web-ERR]: web_include_template(), web_template.c:226, build 1: Error code 14002
Aug 11 05:08:26 switch-db6306 rh19254 [web-ERR]: main(), rh_main.c:337, build 1: Error code 14002 (assertion failed) returned
Aug 11 05:08:26 switch-db6306 rh19254 [web-ERR]: Request handler failed with error code 14002: assertion failed

Case2) Intermittent data transmission/reception interruption occurs, and DCQCN-related Windows event logs occur at that time

Is there a correlation between the occurrence of DCQCN logs and data interruption?

Case3) Inquiry regarding Windows event log
Log)
ConnectX-5 firmware version 16.32.1010 is below the minimum FW version recommended this driver. Minimum recommended Firmware for this driver : 16.35.3502. It is recommended to upgrade the FW, for more details, please refer to WinOF-2 User Manual.

  1. What error does this message mean?

  2. Will the event log issue be improved by updating the firmware (16.32.1010 → 16.35.3502)?

Hi.
Do you know which MLNX-OS version are you use on the switch?
You can check by command: #show version
We suggest you upgrade Switch OS to latest, and upgrade relate HCA Firmware.
Thanks,
Suo

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.