本文檔介紹幾種命令列介面(CLI)命令以及其他故障排除技術,可幫助排除硬碟驅動器(HDD)問題。排查HDD問題的最佳方法是使用LED、GUI、BIOS、LSI選項ROM/MegaRaid GUI和日誌。但是,這些選項並非始終可用。在這種情況下,您可以使用CLI。
本文件沒有特定需求。
本文件所述內容不限於特定軟體和硬體版本。
本文中的資訊是根據特定實驗室環境內的裝置所建立。文中使用到的所有裝置皆從已清除(預設)的組態來啟動。如果您的網路正在作用,請確保您已瞭解任何指令可能造成的影響。
如需文件慣例的詳細資訊,請參閱思科技術提示慣例。
附註:本文檔中列出的一些命令取決於您是否有LSI MegaRaid控制器,因為1064/1068e LSI控制器並不支援所有這些命令。
輸入show pci-adapter命令以檢視產品名稱。此示例顯示LSI 1064e介面卡。
ucs-c2xx-m1 /chassis #show pci-adapter Slot Vendor ID Device ID SubVendor ID SubDevice ID Product Name ---- --------- --------- ------------ ------------ ------------------------ M 0x1000 0x0056 0x152d 0x896d Cisco LSI 1064E Mezzan...
輸入show hdd命令以檢視HDD的狀態。
ucs-c2xx-m1 /chassis #show hdd Name Status -------------------- -------------------- HDD_01_STATUS present HDD_02_STATUS absent HDD_03_STATUS absent HDD_04_STATUS absent
輸入show virtual-drive命令以檢視虛擬驅動器的狀態。此命令非常有用,因為它不需要您關閉伺服器並輸入BIOS來檢視資訊。
ucs-c210-m2/chassis #scope storageadapter SLOT-5 ucs-c210-m2/chassis/storageadapter #show virtual-drive Virtual Drive Status Name Size RAID Level -------------- ------------------ ---------------------- --------- ---------- 0 Optimal 139236 MB RAID 1 1 Degraded 974652 MB RAID 5
輸入show physical-drive命令以檢視物理驅動器的狀態。
ucs-c210-m2 /chassis/storageadapter #show physical-drive Predictive Slot Failure Drive Coerced Number Controller Status Manufacturer Model Count Firmware Size Type ------ ---------- ------ ------------ ----------- ---------- -------- --------- ---- 0 SLOT-5 1 SLOT-5 online SEAGATE ST9146852SS 0 0005 139236 MB HDD 2 SLOT-5 online SEAGATE ST9146852SS 0 0005 139236 MB HDD 3 SLOT-5 online SEAGATE ST9146852SS 0 0005 139236 MB HDD 4 SLOT-5 online SEAGATE ST9146852SS 0 0005 139236 MB HDD 5 SLOT-5 online SEAGATE ST9146852SS 0 0005 139236 MB HDD 6 SLOT-5 online SEAGATE ST9146852SS 0 0005 139236 MB HDD 7 SLOT-5 online SEAGATE ST9146852SS 0 0005 139236 MB HDD 9 SLOT-5 online SEAGATE ST9146852SS 0 0005 139236 MB HDD 10 SLOT-5 online SEAGATE ST9146852SS 0 0005 139236 MB HDD
輸入show error-counters命令以檢視可糾正和不可糾正的錯誤數量。
ucs-c210-m2 /chassis/storageadapter #show error-counters PCI Slot SLOT-5: Memory Correctable Errors: 0 Memory Uncorrectable Errors: 0
輸入show hw-config命令以檢視RAID控制器組態。
ucs-c210-m2 /chassis/storageadapter #show hw-config PCI Slot SLOT-5: SAS Address 0: 500e004aaaaaaa3f SAS Address 1: 0000000000000000 SAS Address 2: 0000000000000000 SAS Address 3: 0000000000000000 SAS Address 4: 0000000000000000 SAS Address 5: 0000000000000000 SAS Address 6: 0000000000000000 SAS Address 7: 0000000000000000 BBU Present: true NVRAM Present: true Serial Debugger Present: true Memory Present: true Flash Present: true Memory Size: 512 MB Cache Memory Size: 394 MB Number of Backend Ports: 8
輸入show physical-drive-count 命令以檢視HDD的數量。
ucs-c210-m2 /chassis/storageadapter #show physical-drive-count PCI Slot SLOT-5: Physical Drive Count: 9 Critical Physical Drive Count: 0 Failed Physical Drive Count: 0
如果您無法訪問CLI,您可以檢視技術支援檔案(/tmp/tech_support)以獲取有關HDD狀態的資訊。技術支援檔案的節選顯示來自智慧平台管理介面(IPMI)感測器的HDD:
Querying All IPMI Sensors: Sensor Name | Reading | Unit | Status | LNR | LC | LNC | UNC | UC | UNR HDD0_INFO | 0x0 | discrete | 0x2181 | na | na | na | na | na | na HDD1_INFO | 0x0 | discrete | 0x2181 | na | na | na | na | na | na HDD2_INFO | 0x0 | discrete | 0x2181 | na | na | na | na | na | na HDD3_INFO | 0x0 | discrete | 0x2181 | na | na | na | na | na | na HDD4_INFO | 0x0 | discrete | 0x2181 | na | na | na | na | na | na HDD5_INFO | 0x0 | discrete | 0x2181 | na | na | na | na | na | na HDD6_INFO | na | discrete | na | na | na | na | na | na | na HDD7_INFO | na | discrete | na | na | na | na | na | na | na
技術支援檔案的摘錄顯示硬碟狀態的細分:
Bit[15:10] - Unused Bit[9:8] - Fault Bit[7:4] â LED Color Bit[3:0] â LED State Fault: 0x100 â On Line 0x200 - Degraded LED Color: 0x10 â GREEN 0x20 â AMBER 0x40 â BLUE 0x80 â RED LED State: 0x01 â OFF 0x02 â ON 0x04 â FAST BLINK 0x08 â SLOW BLINK
以下是從技術支援檔案摘錄的硬碟狀態(狀態代碼為0x2181):
0x2181 Fault: 0x100 --- HDD is On Line LED Color: 0x80 --- RED LED State: 0x01 --- OFF
您可以選擇將電池備用單元(BBU)用於某些伺服器部署。BBU是一個智慧電池備份單元,可在斷電期間保護RAID控制器上的磁碟寫快取資料長達72小時。
此範例顯示如何使用MegaCli檢查BBU的狀態:
bash$ sudo /opt/MegaRAID/MegaCli/MegaCli64 -AdpBbuCmd -a0 -NoLog Password: . . . Battery Replacement required : Yes . . . Relative State of Charge: 99 % Absolute State of charge: 76 % . . . Date of Manufacture: 11/08, 2008 Design Capacity: 700 mAh Design Voltage: 3700 mV Specification Info: 33 Serial Number: 243 Pack Stat Configuration: 0x6cb0 Manufacture Name: LSI113000G Device Name: 2970700 Device Chemistry: LION Battery FRU: N/A
此範例顯示如何使用CLI檢查BBU的狀態:
ucs-c200-m2 /chassis/storageadapter #show bbu detail Controller SLOT-7: Battery Type: iBBU Battery Present: true Voltage: 4.023 V Current: 0.000 A Charge: 100% Charging State: fully charged Temperature: 34 degrees C Voltage Low: false Temperature High: false Learn Cycle Requested: false Learn Cycle Active: false Learn Cycle Failed: false Learn Cycle Timeout: false I2C Errors Detected: false Battery Replacement Required: true Remaining Capacity Low: true
修訂 | 發佈日期 | 意見 |
---|---|---|
1.0 |
07-Dec-2012 |
初始版本 |