SU509: HDD (HAKPE0) firmware mitigates high failure rate

Views:
696
Last Updated:
6/29/2022, 12:49:59 AM

收藏

Summary

NetApp® has identified that the drive model listed in the table below fail at a higher rate than other drives shipped by NetApp. As a result, NetApp has implemented a drive firmware fix that can be upgraded non-disruptively to mitigate the issue. The updated firmware is available from the Disk Drive Firmware Download page on the NetApp Support site.

Update to minimum drive firmware version for the affected drive part number and identification string, below:

Part Number Drive Identifier Capacity FW
SP-306A-R5/X306A-R5 X306_HAKPE02TSSA 2.0TB NA01
SP-306A-R5/X306A-R5 X306_HAKPE02TSSM 2.0TB NA01
SP-316A-R6/X316A-R6 X316_HAKPE06TA07 6.0TB NA01
SP-336A-R6/X336A-R6 X336_HAKPE04TA07 4.0TB NA02
SP-375A/X375A X375_HAKPE04TA07 4.0TB NA01
SP-477A-R6/X477A-R6 X477_HAKPE04TA07 4.0TB NA01
SP-480A-R6/X480A-R6 X480_HAKPE04TSDB 4.0TB NA01
SP-481A-R6/X481A-R6 X481_HAKPE06TSDB 6.0TB NA01

Issue Description

Overly aggressive internal drive thresholds can result in unnecessary failures.

Symptom

Messages similar to the following might be indicative of the issue(s):

[node1: disk_admin: disk.outOfService:notice]: Drive 0b.04.0 (K4HKS3BB): Predictive Failure PFA (0x01), ASC(0x5d), ASCQ(0x90), FRU(0x90). Power-On Hours: 39216, GList Count: 6, Drive Info: Disk 0b.04.0 Shelf 4 Bay 0 [NETAPP   X477_HAKPE04TA07 NA00] S/N [K4HKS3BB]…

Solution

Update drive firmware per the above Summary.

Additional Information

See Bug #1398852

In accordance with the Support Services terms, always update NetApp products with the latest version of firmware and software to provide the best reliability, availability, and serviceability:

Hot spare drives: To best maintain the continuous presence of hot spare drives available in the system, maintain the minimum recommended number of hot spares, and follow the standard drive replacement process if a drive fails.

Active IQ System Risk Detection:

For customers who have enabled AutoSupport on their storage systems the Active IQ Portal provides detailed System Risk reports at the customer and site and system levels. The reports show systems that have specific risks as well as severity levels and mitigation action plans. Drives that are not running the latest firmware is an example of such a risk. Not upgrading to the most current drive firmware could leave the storage appliance vulnerable to undesirable behavior.

Important: The purpose of this communication is for NetApp to notify its installed base end users about urgent and important product information that may affect product performance or reliability. The information contained herein and the distribution lists are NetApp confidential materials that are subject to restrictions on redistribution and that cannot be shared outside of this e-mail distribution list.

***************************************************
*** NETAPP CONFIDENTIAL – FOR LIMITED USE ONLY ***
***************************************************