SU540: Chelsio T6 NIC errors cause system shutdown when upgrading from 40G to 100G network switches
- Views:
- 1,381
- Last Updated:
- 2024/3/22 07:03:14
收藏
Summary
[Impact Critical: Possible cluster data outage]
After converting Chelsio T6-based Ethernet ports from 40GbE to 100GbE speeds, a continuous high number of CRC errors are reported due to corrupted Ethernet packets. These errors can potentially lead to a system disruption.
Issue Description
- After converting T6-based Ethernet ports from 40GbE to 100GbE speeds, a continuous high number of CRC errors are reported due to corrupted Ethernet packets.
- Link parameters are not cleared after a 40GbE to 100GbE port conversion, resulting in the generation of malformed packets.
- In some cases, the receipt of these corrupted packets can lead to a system disruption.
- Port speed changes can occur in the following example scenarios:
- 40GbE Cluster switch or Ethernet data switches are replaced with 100GbE models
- Cluster ports are temporarily configured at 40GbE for a storage system upgrade, but the final port speed configuration is 100GbE
Symptom
After replacing 40GbE switches with 100GbE switches, CRC errors and long frames will increment due to malformed Ethernet packets.
cluster1::> system node run -node local -command "ifstat -a"
RECEIVE
Total frames: 292g | Total bytes: 1485t | Total errors: 8746
Total discards: 1276 | Multi/broadcast: 4612k | No buffers: 0
CRC errors: 8564 | Runt frames: 0 | Fragment: 0
Long frames: 182 | Jabber: 0 | Alignment errs: 0
over/underruns: 0 | Xon: 0 | Xoff: 0
Jumbo: 193g
Workaround
Take over and give back (reboot) the affected nodes to correctly reset link parameters on the T6 ports.
Solution
ONTAP 9.14.1, 9.13.1P8,9.12.1P11 and later releases contain new NIC firmware to resolve BUG ID 1570339.
联想凌拓科技有限公司(“Lenovo NetApp”)不对本页面中提供的任何信息或建议的准确性、可靠性或可维护性,或通过使用这些信息或遵守本文中提供的建议可能获得的任何结果,提供任何陈述或保证。本页面中的信息是按原样分发的,使用这些信息或实施本文中的任何建议或技术是客户的责任,取决于客户评估这些信息并将其整合到客户的运营环境中的能力。本页面及其包含的信息只能与本页面中讨论的 NetApp 产品结合使用。在任何情况下,Lenovo NetApp 均不承担因与使用或执行本页面上提供的信息有关的或导致的任何特殊的、间接的或随之而来的任何损失,或者因使用、数据或利润损失(无论是否在合同履行中)、疏忽或其它侵权行为导致的任何损失。
更多最新信息请参考 NetApp 官网支持公告