Field Notice: FN74064 - UCS X Series Server Intermittently Fails to Boot with UCSX-X10C-RAIDF-D / UCSX-X10C-RAIDF - Software Upgrade Recommended

Available Languages

Updated:December 18, 2023

Document ID:FN74064

Bias-Free Language

The documentation set for this product strives to use bias-free language. For the purposes of this documentation set, bias-free is defined as language that does not imply discrimination based on age, disability, gender, racial identity, ethnic identity, sexual orientation, socioeconomic status, and intersectionality. Exceptions may be present in the documentation due to language that is hardcoded in the user interfaces of the product software, language used based on RFP documentation, or language that is used by a referenced third-party product. Learn more about how Cisco is using Inclusive Language.

Notice

THIS FIELD NOTICE IS PROVIDED ON AN "AS IS" BASIS AND DOES NOT IMPLY ANY KIND OF GUARANTEE OR WARRANTY, INCLUDING THE WARRANTY OF MERCHANTABILITY. YOUR USE OF THE INFORMATION ON THE FIELD NOTICE OR MATERIALS LINKED FROM THE FIELD NOTICE IS AT YOUR OWN RISK. CISCO RESERVES THE RIGHT TO CHANGE OR UPDATE THIS FIELD NOTICE AT ANY TIME.

Products Affected

Affected Product Name	Description	Comments
UCSX-X10C-RAIDF	UCS X10c Compute RAID Controller with LSI 3900 (Front)
UCSX-X10C-RAIDF-D	UCS X10c Compute RAID Controller with LSI 3900 (Front)
UCSX-X10C-RAIDF=	UCS X10c Compute RAID Controller with LSI 3900 (Front)

Defect Information

Defect ID	Headline
CSCwf00940	Intermittently UCS X-series server fails to boot with UCSX-X10C-RAIDF-D / UCSX-X10C-RAIDF

Problem Description

The Cisco UCS X Series node fails to boot with UCSX-X10C-RAIDF-D or UCSX-X10C-RAIDF front mezzanines installed. This issue affects the following Cisco UCS X Series nodes with UCSX-X10C-RAIDF-D/UCSX-X10C-RAIDF if they are running affected older firmware:

UCSX-210C-M6
UCSX-210C-M7
UCSX-410C-M7

Background

The failure can occur if the server has not been powered up for more than 24 hours. The RAID controller that is installed in the X Series server has a Super Capacitor (SuperCap) that maintains power to the board and prevents data loss in the event of a power off. The SuperCap inside UCSX-X10C-RAIDF-D slowly discharges while the server is powered off. This will cause the card power up to take longer than normal and create a fault. The server control circuit will not allow the server to power up with the UCSX-X10C-RAIDF/UCSX-X10C-RAIDF-D fault.

Problem Symptom

Upon initial insertion of an X210 or X410 server into a chassis, or if the server has been powered off for longer than 24 hours, the server may fail to power on with a critical power problem fault.

In the Intersight user interface, there will be a critical alarm similar to the one shown in the following image:

For Cisco UCS Managed servers, a critical F0868 fault like the following is generated:

Severity: CriticalCode: F0868Last Transition Time: 2023-08-24T10:52:10.738ID: 280198Status: NoneDescription: Motherboard of server 1/7 (service profile: ) power: failedAffected Object: sys/chassis-1/blade-7/boardName: Compute Board Power FailCause: Power Problem

The following error may be seen in the System Event Log (SEL):

Platform alert PWR_SEQ_FAIL #0xa5 | Predictive Failure asserted | Asserted

Workaround/Solution

Solution

To fix the issue that is described in this field notice, upgrade to a fixed software release, as shown in the following table:

Cisco UCS X Series Model Number	Affected Release	First Fixed Release for UCS Server Software Bundle	First Fixed Release for Intersight Server Firmware Bundle
X210cM6	5.0(1f) 5.0(2b) 5.0(4a) 5.1(0.230054)	4.3(2b)	5.2(0.230040)
X210cM7	5.1(0.230096) 5.1(1.230052)	4.3(2b)	5.2(0.230041)
X410cM7	5.1(1.230052)	4.3(2b)	5.2(0.230041)

Workaround

As a temporary workaround, power up the servers that failed using one of the following methods:

Reboot the Cisco Integrated Management Controller (CIMC)
Re-seat the server by removing the server, waiting for at least 20 seconds, and then re-inserting the server into the chassis.

Note: If a server that is running affected firmware is powered off for longer than 24 hours, it may once again be affected by this defect.

To reboot the CIMC from the Cisco UCS Manager (UCSM), do the following:

Choose the server that is affected.
Click Recover Server.
Choose Reset CIMC (Server Controller).

To reboot the CIMC from Intersight, do the following:

Click Actions.
Choose System.
Choose Reboot Management Controller.

How to Identify Affected Products

Intersight-Managed Servers

To determine whether affected hardware is present, do the following:

On the affected blade, click Inventory.
Click Storage Controllers.
Choose Controller X (Raid).
Check for UCSX-X10C-RAIDF / UCSX-X10C-RAIDF-D.

To determine whether the hardware is running an affected firmware release, choose the affected server and scroll down on the left side to Firmware Version. Releases earlier than Release 5.2(0.230040) are affected.

UCSM-Managed Servers

To determine whether affected hardware is present, do the following:

Choose the affected server.
Click Inventory.
Click Storage.
Click Controller.
Choose Storage Controller SAS 1.
Check for the model number UCSX-X10C-RAIDF or UCSX-X10C-RAIDF-D.

To determine whether the server is running an affected firmware release, do the following:

Choose the affected server.
Click Installed Firmware.
Check if the CIMC controller is running a release earlier than Release 5.2(0.230040) (package version 4.32b B series bundle for UCSM on blades).

For More Information

The specific power sequencer fault codes for M6 and M7 blades are different. The log locations are also different for Intersight-managed servers and UCS Managed servers. In a tech support bundle, the Power Failure log will have the following error messages:

X210 M6

**************************************************************************************************
UCD90320 Status Flags
**************************************************************************************************
FMEZZ_PWRGD :
TON_MAX Fault

X210 M7 and X410 M7

**************************************************************************************************
UCD90320 Status Flags
**************************************************************************************************
FMEZZ_BLD_CARD_PWRGD :
TON_MAX flag asserted

For an Intersight-managed server, the location of the logfile is:

intersight_ts.tgz\<date>_intersight_ts.tar\mnt\scratchpad\techsupport_pidXXXXX\tech_support-nvram.tar.gz\tech_support-nvram.tar\nv\etc\log\power_sys_logs\Fault_parsed.txt

For a UCSM-managed server, the logfile location is:

BCX_all\CIMCX_TechSupport.tar.gz\CIMCX_TechSupport.tar\mnt\scratchpad\techsupport_pidXXXX\CIMCX_TechSupport-nvram.tar.gz\CIMCX_TechSupport-nvram.tar\nv\etc\log\power_sys_logs\Fault_parsed.txt

Revision History

Version	Description	Section	Date
1.0	Initial Release	—	2023-DEC-25

For More Information

For further assistance or for more information about this field notice, contact the Cisco Technical Assistance Center (TAC) using one of the following methods:

Receive Email Notification About New Field Notices

To receive email updates about Field Notices (reliability and safety issues), Security Advisories (network security issues), and end-of-life announcements for specific Cisco products, set up a profile in My Notifications

Was this Document Helpful?

Feedback

Contact Cisco

Open a Support Case
(Requires a Cisco Service Contract)