SU449: [Impact: Critical] SSD (PX02S*) firmware to prevent data loss / unavailability
- Views:
- 1,266
- Last Updated:
- 12/20/2022, 10:24:26 AM
收藏
Summary
[Impact: Critical = Data loss or cluster data outage]
NetApp® has identified that the drive models listed in the table below will fail after 70,000 power-on hours (~8 years of use) if power-cycled.
As a result, NetApp has implemented a drive firmware fix that can be upgraded non-disruptively to mitigate the issue. The updated firmware is available from the E/EF-Series Drive Firmware Download page on the NetApp Support site.
Update to minimum drive firmware MS03 for the affected drive part numbers and identification strings, below:
Part Number | Drive Identifier | Capacity |
E-X4041B-R6 | PX02SMF080 | 800GB |
E-X4043B-R6 | PX02SMF080 | 800GB |
E-X4057A-R6 | PX02SMF040 | 400GB |
E-X4058A-R61 | PX02SMU080 | 800GB |
E-X4059A-R6 | PX02SMB160 | 1.6TB |
E-X4060A-R6 | PX02SMF040 | 400GB |
E-X4060B | PX02SMF040 | 400GB |
E-X4061A-R61 | PX02SMU080 | 800GB |
E-X4062A-R6 | PX02SMB160 | 1.6TB |
1 TCG (Trusted Computer Group – Encrypted)
Issue Description
SSD internal logs are periodically recorded and have an upper limit of 70,000 entries (about eight years), after which logging stops. The SSD continues to operate after reaching the 70,000 limit until power is turned OFF/ON, after which subsequent Read/Write commands return an error and user data is inaccessible.
Symptom
This issue results in drive failure.
In a multiple drive failure scenario, RAID limits may be exceeded, in which case a Volume Group would go Offline (or fail), and the data would not be accessible.
In a single drive failure scenario, a drive will be failed for a Hardware Error Check Condition [04/4C/A8] as reported by the drive. This would result in a degraded volume group.
If this is a power on scenario of the storage array system, 3 drives are required for the storage array to return to an optimal online state. If the three drives are not available, the storage array would enter a lockdown state.
Note: In any event where more drives are impacted than RAID tolerance, immediate engagement with technical support is strongly recommended.
Solution
Update drive firmware as soon as possible.
Additional Information
See Bug #1335350
In accordance with the Support Services terms, always update NetApp products with the latest version of firmware and software to provide the best reliability, availability, and serviceability:
- Download drive firmware from the E/EF- Series Drive and Firmware Matrix.
- Upgrade instructions: Upgrading drive firmware.
- For more information: How to obtain the latest drive firmware for E/EF-Series.
- NetApp official guidance on drives with over 6 years of operating life.
Hot spare drives: To best maintain the continuous presence of hot spare drives available in the system, adhere to Hot Spares Best Practices and follow the standard drive replacement process if a drive fails.
Active IQ System Risk Detection:
For customers who have enabled AutoSupport™ on their storage systems the Active IQ Portal provides detailed System Risk reports at the customer and site and system levels. The reports show systems that have specific risks as well as severity levels and mitigation action plans. Drives that are not running the latest firmware is an example of such a risk. Not upgrading to the most current drive firmware could leave the storage appliance vulnerable to undesirable behavior.
Important: The purpose of this communication is for NetApp to notify its installed base end users about urgent and important product information that may affect product performance or reliability. The information contained herein and the distribution lists are NetApp confidential materials that are subject to restrictions on redistribution and that cannot be shared outside of this e-mail distribution list.
***************************************************
*** NETAPP CONFIDENTIAL – FOR LIMITED USE ONLY ***
***************************************************
联想凌拓科技有限公司(“Lenovo NetApp”)不对本页面中提供的任何信息或建议的准确性、可靠性或可维护性,或通过使用这些信息或遵守本文中提供的建议可能获得的任何结果,提供任何陈述或保证。本页面中的信息是按原样分发的,使用这些信息或实施本文中的任何建议或技术是客户的责任,取决于客户评估这些信息并将其整合到客户的运营环境中的能力。本页面及其包含的信息只能与本页面中讨论的 NetApp 产品结合使用。在任何情况下,Lenovo NetApp 均不承担因与使用或执行本页面上提供的信息有关的或导致的任何特殊的、间接的或随之而来的任何损失,或者因使用、数据或利润损失(无论是否在合同履行中)、疏忽或其它侵权行为导致的任何损失。
更多最新信息请参考 NetApp 官网支持公告