THIS FIELD NOTICE IS PROVIDED ON AN "AS IS" BASIS AND DOES NOT IMPLY ANY KIND OF GUARANTEE OR WARRANTY, INCLUDING THE WARRANTY OF MERCHANTABILITY. YOUR USE OF THE INFORMATION ON THE FIELD NOTICE OR MATERIALS LINKED FROM THE FIELD NOTICE IS AT YOUR OWN RISK. CISCO RESERVES THE RIGHT TO CHANGE OR UPDATE THIS FIELD NOTICE AT ANY TIME.
Affected Product Name | Description | Comments |
---|---|---|
UCSX-X10C-RAIDF | UCS X10c Compute RAID Controller with LSI 3900 (Front) | |
UCSX-X10C-RAIDF-D | UCS X10c Compute RAID Controller with LSI 3900 (Front) | |
UCSX-X10C-RAIDF= | UCS X10c Compute RAID Controller with LSI 3900 (Front) |
Defect ID | Headline |
CSCwf00940 | Intermittently UCS X-series server fails to boot with UCSX-X10C-RAIDF-D / UCSX-X10C-RAIDF |
The Cisco UCS X Series node fails to boot with UCSX-X10C-RAIDF-D or UCSX-X10C-RAIDF front mezzanines installed. This issue affects the following Cisco UCS X Series nodes with UCSX-X10C-RAIDF-D/UCSX-X10C-RAIDF if they are running affected older firmware:
The failure can occur if the server has not been powered up for more than 24 hours. The RAID controller that is installed in the X Series server has a Super Capacitor (SuperCap) that maintains power to the board and prevents data loss in the event of a power off. The SuperCap inside UCSX-X10C-RAIDF-D slowly discharges while the server is powered off. This will cause the card power up to take longer than normal and create a fault. The server control circuit will not allow the server to power up with the UCSX-X10C-RAIDF/UCSX-X10C-RAIDF-D fault.
Upon initial insertion of an X210 or X410 server into a chassis, or if the server has been powered off for longer than 24 hours, the server may fail to power on with a critical power problem fault.
In the Intersight user interface, there will be a critical alarm similar to the one shown in the following image:
For Cisco UCS Managed servers, a critical F0868 fault like the following is generated:
Severity: CriticalCode: F0868Last Transition Time: 2023-08-24T10:52:10.738ID: 280198Status: NoneDescription: Motherboard of server 1/7 (service profile: ) power: failedAffected Object: sys/chassis-1/blade-7/boardName: Compute Board Power FailCause: Power Problem
The following error may be seen in the System Event Log (SEL):
Platform alert PWR_SEQ_FAIL #0xa5 | Predictive Failure asserted | Asserted
Solution
To fix the issue that is described in this field notice, upgrade to a fixed software release, as shown in the following table:
Cisco UCS X Series Model Number | Affected Release | First Fixed Release for UCS Server Software Bundle | First Fixed Release for Intersight Server Firmware Bundle |
---|---|---|---|
X210cM6 | 5.0(1f) 5.0(2b) 5.0(4a) 5.1(0.230054) |
4.3(2b) | 5.2(0.230040) |
X210cM7 | 5.1(0.230096) 5.1(1.230052) |
4.3(2b) | 5.2(0.230041) |
X410cM7 | 5.1(1.230052) | 4.3(2b) | 5.2(0.230041) |
Workaround
As a temporary workaround, power up the servers that failed using one of the following methods:
Note: If a server that is running affected firmware is powered off for longer than 24 hours, it may once again be affected by this defect.
To reboot the CIMC from the Cisco UCS Manager (UCSM), do the following:
To reboot the CIMC from Intersight, do the following:
Intersight-Managed Servers
To determine whether affected hardware is present, do the following:
To determine whether the hardware is running an affected firmware release, choose the affected server and scroll down on the left side to Firmware Version. Releases earlier than Release 5.2(0.230040) are affected.
UCSM-Managed Servers
To determine whether affected hardware is present, do the following:
To determine whether the server is running an affected firmware release, do the following:
The specific power sequencer fault codes for M6 and M7 blades are different. The log locations are also different for Intersight-managed servers and UCS Managed servers. In a tech support bundle, the Power Failure log will have the following error messages:
X210 M6
************************************************************************************************** UCD90320 Status Flags ************************************************************************************************** FMEZZ_PWRGD : TON_MAX Fault
X210 M7 and X410 M7
************************************************************************************************** UCD90320 Status Flags ************************************************************************************************** FMEZZ_BLD_CARD_PWRGD : TON_MAX flag asserted
For an Intersight-managed server, the location of the logfile is:
intersight_ts.tgz\<date>_intersight_ts.tar\mnt\scratchpad\techsupport_pidXXXXX\tech_support-nvram.tar.gz\tech_support-nvram.tar\nv\etc\log\power_sys_logs\Fault_parsed.txt
For a UCSM-managed server, the logfile location is:
BCX_all\CIMCX_TechSupport.tar.gz\CIMCX_TechSupport.tar\mnt\scratchpad\techsupport_pidXXXX\CIMCX_TechSupport-nvram.tar.gz\CIMCX_TechSupport-nvram.tar\nv\etc\log\power_sys_logs\Fault_parsed.txt
Version | Description | Section | Date |
1.0 | Initial Release | — | 2023-DEC-25 |
For further assistance or for more information about this field notice, contact the Cisco Technical Assistance Center (TAC) using one of the following methods:
To receive email updates about Field Notices (reliability and safety issues), Security Advisories (network security issues), and end-of-life announcements for specific Cisco products, set up a profile in My Notifications
Unleash the Power of TAC's Virtual Assistance