RX8640前面板 CELL1亮黄灯
RX8640前面板 CELL1亮黄灯,高手们给我分析分析啥原因?? 谢谢了!
[rp8640_2_mp] MP> cm
Enter HE to get a list of available commands
(Use ^B to return to main menu.)
[rp8640_2_mp] MP:CM> ps
Display detailed status of the selected MP bus device.
The following MP bus devices were found:
+---+----+-----+-------+-------+-----------+
| | | | | | |
| | | Sys | | IO | Bulk Pwr |
|Cab| MP |Bkpln| Cells |Chassis| Supplies |
| # |M S| |0 1 2 3| 0 1 |0 1 2 3 4 5|
+---+----+-----+-------+-------+-----------+
| 0 |* *| * |* * * | * * |* * * * * |
+---+----+-----+-------+-------+-----------+
You may display detailed power and hardware status for the following items:
T - Cabinet
S - System Backplane
G - MP (Core I/O)
P - IO Chassis
C - Cell
Select Device: t
HW status for rx8640 cabinet : FAILURE DETECTED
Master/Slave failover is Enabled
Power switch is on
Right Door is closed
Top Door is closed
Left Door is closed
Total Power Available 5000 VA
Total Power Needed 2992 VA
Power Redundancy : redundant
Power Viability : viable
Power Status
---------------+-----+-------+-------+-----------+
| | | | |
| Sys | | IO | Bulk Pwr |
|Bkpln| Cells |Chassis| Supplies |
| |0 1 2 3| 0 1 |0 1 2 3 4 5|
---------------+-----+-------+-------+-----------+
Populated | * |* * * | * * |* * * * * |
Enabled | * |* * * | * * |* * * * * |
Power OK | * |* * * | * * |* * * * * |
Warning/Fault | | * | | |
Attention LED | | | | |
AC Line status:
Line A0 Present
Line B0 Present
Line A1 Present
Line B1 Present
-- Press <CR> to continue, or 'Q' to Quit --
Front Fan Speed : normal
Rear Fan speed : normal
I/O Bay Fan Speed : normal
Temperature state : normal
Main Fan Redundancy : redundant
I/O Fan Redundancy : redundant
Overtemp Shutdown Enabled
| BPS | PCI |
| Fans | Fans |
| 0 1 2 3 4 5 | 0 1 2 3 4 5 |
+-----------+-------------+-------------+
Populated | * * * * * | * * * * * * |
Failing | | |
Failed | | |
| Standby/Main Fans |
| 1 1 1 1 1 1 1 1 1 1 2 |
| 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 |
+-----------+-------------------------------------------+
Populated | * * * * * * * * * * * * * * * * * * * * * |
Failing | |
Failed | |
| Cell Fans |
| CPU C |
| 0 1 2 3 C |
+-----------+------------+
Cell 0 | |
Populated | * * * * * |
Failing | |
Failed | |
Cell 1 | |
Populated | * * * * * |
Failing | |
Failed | |
Cell 2 | |
Populated | * * * |
Failing | |
Failed | |
[rp8640_2_mp] MP:CM> ps
Display detailed status of the selected MP bus device.
The following MP bus devices were found:
+---+----+-----+-------+-------+-----------+
| | | | | | |
| | | Sys | | IO | Bulk Pwr |
|Cab| MP |Bkpln| Cells |Chassis| Supplies |
| # |M S| |0 1 2 3| 0 1 |0 1 2 3 4 5|
+---+----+-----+-------+-------+-----------+
| 0 |* *| * |* * * | * * |* * * * * |
+---+----+-----+-------+-------+-----------+
You may display detailed power and hardware status for the following items:
T - Cabinet
S - System Backplane
G - MP (Core I/O)
P - IO Chassis
C - Cell
Select Device: c
Enter cell number: 0
HW status for Cell 0 : FAILURE DETECTED
Power status : on, CPU 0 MODULE FAULT
Faults above are latched. They are cleared just before the cell is powered on.
Boot is not blocked
PDH memory is shared
Processor Compatibility : OK
RIO cable status : connected
RIO cable connection physical location : PCI Domain 0
Core cell is cell 0
Attention Led is off
PDHC status Leds : ****
CPU Module Slot 0 1 2 3
Populated P P P P
Local 48V Good * * * *
Power Enabled * * * *
Power Good * * * *
(* - True, P - Processor, T - Terminator)
| DIMM Presence |
| |
| Side: | A | B |
|Echelon: |01234567|01234567|
+---------+--------+--------+
| |****** ****** |
1 1 1 1 1 1
VRM's 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5
Present : * * * * * * * * * * * * * * * *
Enabled : * * * * * * * * * * * * * * * *
Pwr Good : * * * * * * * * * * * * * * * *
Front Side Bus Freq. : 200.25 MHz
CPU Core Freq. : 1600 MHz
CPU Part Number : 80543KC
System Boot Rom (SFW) firmware rev 004.080.000
PDH controller (PDHC) firmware rev 022.002.003 built SUN JAN 08 23:26:14 2006
[rp8640_2_mp] MP:CM> ps
Display detailed status of the selected MP bus device.
The following MP bus devices were found:
+---+----+-----+-------+-------+-----------+
| | | | | | |
| | | Sys | | IO | Bulk Pwr |
|Cab| MP |Bkpln| Cells |Chassis| Supplies |
| # |M S| |0 1 2 3| 0 1 |0 1 2 3 4 5|
+---+----+-----+-------+-------+-----------+
| 0 |* *| * |* * * | * * |* * * * * |
+---+----+-----+-------+-------+-----------+
You may display detailed power and hardware status for the following items:
T - Cabinet
S - System Backplane
G - MP (Core I/O)
P - IO Chassis
C - Cell
Select Device: c
Enter cell number: 1
HW status for Cell 1 : FAILURE DETECTED
Power status : on, CPU 0 MODULE FAULT
Faults above are latched. They are cleared just before the cell is powered on.
Boot is not blocked
PDH memory is shared
Processor Compatibility : OK
RIO cable status : connected
RIO cable connection physical location : PCI Domain 1
Core cell is cell 0
Attention Led is off
PDHC status Leds : ****
CPU Module Slot 0 1 2 3
Populated P P P P
Local 48V Good * * * *
Power Enabled * * * *
Power Good * * * *
(* - True, P - Processor, T - Terminator)
| DIMM Presence |
| |
| Side: | A | B |
|Echelon: |01234567|01234567|
+---------+--------+--------+
| |***** ***** |
1 1 1 1 1 1
VRM's 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5
Present : * * * * * * * * * * * * * * * *
Enabled : * * * * * * * * * * * * * * * *
Pwr Good : * * * * * * * * * * * * * * *
Front Side Bus Freq. : 200.25 MHz
CPU Core Freq. : 1600 MHz
CPU Part Number : 80543KC
System Boot Rom (SFW) firmware rev 004.080.000
PDH controller (PDHC) firmware rev 022.002.003 built SUN JAN 08 23:26:14 2006
[rp8640_2_mp] MP:CM> ps
Display detailed status of the selected MP bus device.
The following MP bus devices were found:
+---+----+-----+-------+-------+-----------+
| | | | | | |
| | | Sys | | IO | Bulk Pwr |
|Cab| MP |Bkpln| Cells |Chassis| Supplies |
| # |M S| |0 1 2 3| 0 1 |0 1 2 3 4 5|
+---+----+-----+-------+-------+-----------+
| 0 |* *| * |* * * | * * |* * * * * |
+---+----+-----+-------+-------+-----------+
You may display detailed power and hardware status for the following items:
T - Cabinet
S - System Backplane
G - MP (Core I/O)
P - IO Chassis
C - Cell
Select Device: c
Enter cell number: 2
HW status for Cell 2 : FAILURE DETECTED
Power status : on, CPU 0 MODULE FAULT
Faults above are latched. They are cleared just before the cell is powered on.
Boot is not blocked
PDH memory is shared
Processor Compatibility : OK
RIO cable status : not connected
RIO cable connection physical location : cannot be determined
Core cell is cell 0
Attention Led is off
PDHC status Leds : ****
CPU Module Slot 0 1 2 3
Populated P P
Local 48V Good * *
Power Enabled * *
Power Good * *
(* - True, P - Processor, T - Terminator)
| DIMM Presence |
| |
| Side: | A | B |
|Echelon: |01234567|01234567|
+---------+--------+--------+
| |***** ***** |
1 1 1 1 1 1
VRM's 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5
Present : * * * * * * * * * * * * * * * *
Enabled : * * * * * * * * * * * * * * * *
Pwr Good : * * * * * * * * * * * * * * * *
Front Side Bus Freq. : 200.25 MHz
CPU Core Freq. : 1600 MHz
CPU Part Number : 80543KC
System Boot Rom (SFW) firmware rev 004.080.000
PDH controller (PDHC) firmware rev 022.002.003 built SUN JAN 08 23:26:14 2006
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(7)
谢谢兄弟的提醒!
[rp8640_2_mp] MP:VW> sel
Welcome to the SEL (System Event Log) Viewer
The following SEL navigation commands are available:
D: Dump log starting at current block for capture and analysis
F: Display first (oldest) block
L: Display last (newest) block
J: Jump to specified entry and display previous block
+: Display next (forward in time) block
-: Display previous (backward in time) block
<cr>: Repeat previous +/- command
<sp>: Repeat previous +/- command
?: Display help
^B: Exit viewer
The following event format options are available:
K: Keyword
R: Raw hex
T: Text
The following event filter options are available:
A: Alert level
C: Cell
U: Unfiltered
[rp8640_2_mp] MP:VWR (<cr>,<sp>,+,-,?,F,L,J,D,K,R,T,A,C,U,^B) > [rp8640_2_mp] a
Alert Level Filter:
0: Minor Forward Progress
1: Major Forward Progress
2: Informational
3: Warning
5: Critical
7: Fatal
Q: Quit
For example, selecting an alert level threshold of 3
selects all events with alert levels of 3 or higher.
Please select alert level threshold: 3
Switching to alert level 3 filter.
[rp8640_2_mp] MP:VWR (<cr>,<sp>,+,-,?,F,L,J,D,K,R,T,A,C,U,^B) > [rp8640_2_mp]
13222 214bbd35e10216de ff0f066f001f0300 IPMI Type-02 Event
13222 04/08/2010 01:48:17
13218 PDHC 0,1 *3 648013c701e016d6 00ffff01ff04ff43 CELL_VRM_VOLT_FLT
13218 04/08/2010 01:43:42
13195 SFW 0,0,0 *3 698001ee00e016a8 45554e49544e4f43 ERM
13195 04/08/2010 01:39:14
13194 SFW 0,0,0 *5 a380020300e016a6 0000000000000005 PD_ERROR_REACHABLE_SET
13194 04/08/2010 01:39:14
13191 SFW 0,2,0 *3 698001ee20e016a0 45554e49544e4f43 ERM
13191 04/08/2010 01:38:58
13190 SFW 0,2,0 *5 a380020320e0169e 0000000000000005 PD_ERROR_REACHABLE_SET
13190 04/08/2010 01:38:58
13189 HPUX 0,0,0 *3 7f80033900e0169c 00000000000aefff HP-UX_DUMP_STATUS
13189 04/08/2010 01:38:28
13188 HPUX 0,0,0 *3 7f80033700e0169a 00000000000ad700 HP-UX_CRASHDUMP_STARTED
13188 04/08/2010 01:38:28
13187 HPUX 0,0,0 *5 bf80033800e01698 000000000002b000 HP-UX_HEX_FAULT_CODE
13187 04/08/2010 01:38:26
13186 214bbd3392021697 ff0f016f00200300 IPMI Type-02 Event
13186 04/08/2010 01:38:26
13183 SFW 0,0,0 *3 698001ee00e01691 45554e49544e4f43 ERM
13183 04/08/2010 01:34:15
[rp8640_2_mp] MP:VWR (<cr>,<sp>,+,-,?,F,L,J,D,K,R,T,A,C,U,^B) > [rp8640_2_mp]
13182 SFW 0,0,0 *5 a380020300e0168f 0000000000000005 PD_ERROR_REACHABLE_SET
13182 04/08/2010 01:34:15
13179 SFW 0,2,0 *3 698001ee20e01689 45554e49544e4f43 ERM
13179 04/08/2010 01:33:58
13178 SFW 0,2,0 *5 a380020320e01687 0000000000000005 PD_ERROR_REACHABLE_SET
13178 04/08/2010 01:33:58
13177 HPUX 0,0,0 *3 7f80033900e01685 00000000000aefff HP-UX_DUMP_STATUS
13177 04/08/2010 01:33:29
13176 HPUX 0,0,0 *3 7f80033700e01683 00000000000ad700 HP-UX_CRASHDUMP_STARTED
13176 04/08/2010 01:33:29
13175 HPUX 0,0,0 *5 bf80033800e01681 000000000002b000 HP-UX_HEX_FAULT_CODE
13175 04/08/2010 01:33:27
13174 214bbd3267021680 ff0f016f00200300 IPMI Type-02 Event
13174 04/08/2010 01:33:27
13171 SFW 0,0,0 *3 698001ee00e0167a 45554e49544e4f43 ERM
13171 04/08/2010 01:29:16
13170 SFW 0,0,0 *5 a380020300e01678 0000000000000005 PD_ERROR_REACHABLE_SET
13170 04/08/2010 01:29:16
13167 SFW 0,2,0 *3 698001ee20e01672 45554e49544e4f43 ERM
13167 04/08/2010 01:28:59
13166 SFW 0,2,0 *5 a380020320e01670 0000000000000005 PD_ERROR_REACHABLE_SET
13166 04/08/2010 01:28:59
[rp8640_2_mp] MP:VWR (<cr>,<sp>,+,-,?,F,L,J,D,K,R,T,A,C,U,^B) > [rp8640_2_mp]
13165 HPUX 0,0,0 *3 7f80033900e0166e 00000000000aefff HP-UX_DUMP_STATUS
13165 04/08/2010 01:28:30
13164 HPUX 0,0,0 *3 7f80033700e0166c 00000000000ad700 HP-UX_CRASHDUMP_STARTED
13164 04/08/2010 01:28:30
13163 HPUX 0,0,0 *5 bf80033800e0166a 000000000002b000 HP-UX_HEX_FAULT_CODE
13163 04/08/2010 01:28:28
13162 214bbd313c021669 ff0f016f00200300 IPMI Type-02 Event
13162 04/08/2010 01:28:28
13159 SFW 0,0,0 *3 698001ee00e01663 45554e49544e4f43 ERM
13159 04/08/2010 01:24:17
13158 SFW 0,0,0 *5 a380020300e01661 0000000000000005 PD_ERROR_REACHABLE_SET
13158 04/08/2010 01:24:17
13155 SFW 0,2,0 *3 698001ee20e0165b 45554e49544e4f43 ERM
13155 04/08/2010 01:24:01
13154 SFW 0,2,0 *5 a380020320e01659 0000000000000005 PD_ERROR_REACHABLE_SET
13154 04/08/2010 01:24:01
13153 HPUX 0,0,0 *3 7f80033900e01657 00000000000aefff HP-UX_DUMP_STATUS
13153 04/08/2010 01:23:31
13152 HPUX 0,0,0 *3 7f80033700e01655 00000000000ad700 HP-UX_CRASHDUMP_STARTED
13152 04/08/2010 01:23:31
13151 HPUX 0,0,0 *5 bf80033800e01653 000000000002b000 HP-UX_HEX_FAULT_CODE
13151 04/08/2010 01:23:29
[rp8640_2_mp] MP:VWR (<cr>,<sp>,+,-,?,F,L,J,D,K,R,T,A,C,U,^B) > [rp8640_2_mp]
13150 214bbd3011021652 ff0f016f00200300 IPMI Type-02 Event
13150 04/08/2010 01:23:29
13147 SFW 0,0,0 *3 698001ee00e0164c 45554e49544e4f43 ERM
13147 04/08/2010 01:19:18
13146 SFW 0,0,0 *5 a380020300e0164a 0000000000000005 PD_ERROR_REACHABLE_SET
13146 04/08/2010 01:19:18
13143 SFW 0,2,0 *3 698001ee20e01644 45554e49544e4f43 ERM
13143 04/08/2010 01:19:01
13142 SFW 0,2,0 *5 a380020320e01642 0000000000000005 PD_ERROR_REACHABLE_SET
13142 04/08/2010 01:19:01
13141 HPUX 0,0,0 *3 7f80033900e01640 00000000000aefff HP-UX_DUMP_STATUS
13141 04/08/2010 01:18:32
13140 HPUX 0,0,0 *3 7f80033700e0163e 00000000000ad700 HP-UX_CRASHDUMP_STARTED
13140 04/08/2010 01:18:32
13139 HPUX 0,0,0 *5 bf80033800e0163c 000000000002b000 HP-UX_HEX_FAULT_CODE
13139 04/08/2010 01:18:30
13138 214bbd2ee602163b ff0f016f00200300 IPMI Type-02 Event
13138 04/08/2010 01:18:30
13135 SFW 0,0,0 *3 698001ee00e01635 45554e49544e4f43 ERM
13135 04/08/2010 01:14:19
13134 SFW 0,0,0 *5 a380020300e01633 0000000000000005 PD_ERROR_REACHABLE_SET
13134 04/08/2010 01:14:19
[rp8640_2_mp] MP:VWR (<cr>,<sp>,+,-,?,F,L,J,D,K,R,T,A,C,U,^B) > [rp8640_2_mp]
13131 SFW 0,2,0 *3 698001ee20e0162d 45554e49544e4f43 ERM
13131 04/08/2010 01:14:02
13130 SFW 0,2,0 *5 a380020320e0162b 0000000000000005 PD_ERROR_REACHABLE_SET
13130 04/08/2010 01:14:02
13129 HPUX 0,0,0 *3 7f80033900e01629 00000000000aefff HP-UX_DUMP_STATUS
13129 04/08/2010 01:13:33
13128 HPUX 0,0,0 *3 7f80033700e01627 00000000000ad700 HP-UX_CRASHDUMP_STARTED
13128 04/08/2010 01:13:33
13127 HPUX 0,0,0 *5 bf80033800e01625 000000000002b000 HP-UX_HEX_FAULT_CODE
13127 04/08/2010 01:13:31
13126 214bbd2dbb021624 ff0f016f00200300 IPMI Type-02 Event
13126 04/08/2010 01:13:31
13123 SFW 0,0,0 *3 698001ee00e0161e 45554e49544e4f43 ERM
13123 04/08/2010 01:09:19
13122 SFW 0,0,0 *5 a380020300e0161c 0000000000000005 PD_ERROR_REACHABLE_SET
13122 04/08/2010 01:09:19
13119 SFW 0,2,0 *3 698001ee20e01616 45554e49544e4f43 ERM
13119 04/08/2010 01:09:03
13118 SFW 0,2,0 *5 a380020320e01614 0000000000000005 PD_ERROR_REACHABLE_SET
13118 04/08/2010 01:09:03
13117 HPUX 0,0,0 *3 7f80033900e01612 00000000000aefff HP-UX_DUMP_STATUS
13117 04/08/2010 01:08:34
[rp8640_2_mp] MP:VWR (<cr>,<sp>,+,-,?,F,L,J,D,K,R,T,A,C,U,^B) > [rp8640_2_mp]
13116 HPUX 0,0,0 *3 7f80033700e01610 00000000000ad700 HP-UX_CRASHDUMP_STARTED
13116 04/08/2010 01:08:34
13115 HPUX 0,0,0 *5 bf80033800e0160e 000000000002b000 HP-UX_HEX_FAULT_CODE
13115 04/08/2010 01:08:32
13114 214bbd2c9002160d ff0f016f00200300 IPMI Type-02 Event
13114 04/08/2010 01:08:32
13111 SFW 0,0,0 *3 698001ee00e01607 45554e49544e4f43 ERM
13111 04/08/2010 01:04:20
13110 SFW 0,0,0 *5 a380020300e01605 0000000000000005 PD_ERROR_REACHABLE_SET
13110 04/08/2010 01:04:20
13107 SFW 0,2,0 *3 698001ee20e015ff 45554e49544e4f43 ERM
13107 04/08/2010 01:04:04
13106 SFW 0,2,0 *5 a380020320e015fd 0000000000000005 PD_ERROR_REACHABLE_SET
13106 04/08/2010 01:04:04
13105 HPUX 0,0,0 *3 7f80033900e015fb 00000000000aefff HP-UX_DUMP_STATUS
13105 04/08/2010 01:03:35
13104 HPUX 0,0,0 *3 7f80033700e015f9 00000000000ad700 HP-UX_CRASHDUMP_STARTED
13104 04/08/2010 01:03:35
13103 HPUX 0,0,0 *5 bf80033800e015f7 000000000002b000 HP-UX_HEX_FAULT_CODE
13103 04/08/2010 01:03:33
13102 214bbd2b640215f6 ff0f016f00200300 IPMI Type-02 Event
13102 04/08/2010 01:03:32
请教高手,您怎么分析是cell 1的 vrm 4 故障??依据?
本帖最后由 liangzigb 于 2010-05-14 09:49 编辑
我只看出有可能是cell1在报错,你为什么不把sel的信息发出来呢?主机出现从cell0 的cpu0出现报错信息,这个是正常的现象,不代表cpu0有故障,如果有故障,你的机器早就crash了。
cell 1的vrm 4的pwr good一行显示有问题,但不清楚为啥每个cell上面有报有错误信息
cell 1的vrm 4 故障。