hp MP底层二进制LOG怎么分析
[uninitialized] MP> sl -nc -e
Log Name Entries % Full Latest Timestamped Entry
---------------------------------------------------------------------------
E - System Event 182 21 % 24 Nov 2010 11:01:56
# Location|Alert| Encoded Field | Data Field | Keyword / Timestamp
-------------------------------------------------------------------------------
2 BMC 2 0x204C6C0DF0020030 FFFF027000120300 Type-02 127002 1208322
18 Aug 2010 16:44:32
3 BMC *3 0x204C6C0DF1020040 23265701D8010300 Type-02 010107 65799
36 BMC *3 0x204CC743CB0202E0 FFFF0A6FFA220300 Type-02 226f0a 2256650
26 Oct 2010 21:10:35
37 BMC *7 0x204CC743D90202F0 00000607D6020300 Type-02 020706 132870
26 Oct 2010 21:10:49
38 BMC 2 0x204CCE9BFD020300 FFFF006F04140300 Type-02 146f00 1339136
01 Nov 2010 10:52:45
39 BMC 2 0x204CCE9C0A020310 FFFF006F04140300 Type-02 146f00 1339136
01 Nov 2010 10:52:58
40 BMC 2 0x204CCE9C18020320 0401A37004120300 Type-02 127003 1208323
01 Nov 2010 10:53:12
41 BMC 2 0x204CCE9C1C020330 FFFF027000120300 Type-02 127002 1208322
01 Nov 2010 10:53:16
42 BMC *3 0x204CCE9C1C020340 23235701D8010300 Type-02 010107 65799
01 Nov 2010 10:53:16
48 BMC 2 0x204CCE9C290203A0 FFFF027000120300 Type-02 127002 1208322
01 Nov 2010 10:53:29
49 BMC *3 0x204CCE9C2A0203B0 23235701D8010300 Type-02 010107 65799
01 Nov 2010 10:53:30
65 BMC 2 0x204CCEC8C90204E0 4000A37040120300 Type-02 127003 1208323
01 Nov 2010 14:03:53
66 BMC *7 0x204CCEC8D80204F0 00000607D6020300 Type-02 020706 132870
[uninitialized] MP:CM> ps
PS
System Power state: On
Temperature : Normal
Power supplies State
-----------------------------------------------------------
Power Supply 1 A/C Disconnected or Out of Range
Power Supply 2 Normal
Fans State
-----------------------------------------------------------
Fan1A (CPU) Normal
Fan1B (CPU) Normal
Fan2 (Memory) Normal
Fan3 (I/O) Normal
CPU0 Fan Normal
CPU1 Fan Normal
一个电源有问题,需要更换,哪日志里面有几项3.7级别的告警,怎么分析什么故障?
能分析出来吗?
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(8)
找hp的800,他们有工具帮你分析。
工具叫MCA,提供机型就能确认。
LS详细,收藏一下
hp的解决方案
级别7 的告警 应该是一个电源启动的报警 更换一个电源看能不能解决这个告警,不能可能需要更换主板
级别3的告警 主同硬重启的信息
下面是解决的方法
级别7的报警:
IPMI Event Code: 204CC743D90202F0 00000607D6020300
Timestamp (GMT) = Tue Oct 26 21:10:49 2010
Generator = Baseboard Management Controller
Alert Level = 5 - Critical
Sensor
Number Triplet Data 2 Data 3
------ -------- ------ ------
D6 02:07:06 00 00
Decoding as system type: Long's Peak / Storm Peak
Sensor Number : D6 - Power Good
----- WTEC Note -------------------------------------------
The Power Good signal is a logical wired-OR signal that indicates the power status of the entire system. It can be pulled low by any of the VRM's, power pods or power supplies in the system.
To trouble-shoot this error ...
1. If the system fails constantly, check the power supply LEDs. It may indicate a failure
of one of the two power supplies.
2. Look for adjacent entries in the SEL log that may indicate the specific problem area.
3. Try disconnecting each of the bulk power supplies in turn, and see if their removal
prevents the dc shutdown. Try each supply on it's own in the other slot. If this is a
system with a single power supply, try replacing with another supply.
4. If the system still fails, replace the system board.
5. If the system still fails, replace other hardware components one by one until you can
figure out the root cause.
Suspect FRUs:
One of the power supplies
System board
Power Supply Interface (PSI) Module
For more ideas on trouble-shooting, see ...
http://teams3.sharepoint.hp.com/ ... h_newsletter425.htm
Sensor Type : 02 = Voltage
Event type = Assertion event : Discrete – severity
Keyword = VOLTAGE_NON_RECOVERABLE
Voltage becomes non-recoverable from either critical level or less severe level.
_______________________________________________________________________________
IPMI Event Code: 204CC743CB0202E0 FFFF0A6FFA220300
Timestamp (GMT) = Tue Oct 26 21:10:35 2010
Generator = Baseboard Management Controller
Alert Level = 3 - Warning or non-critical
Sensor
Number Triplet Data 2 Data 3
------ -------- ------ ------
FA 22:6F:0A FF FF
Decoding as system type: Long's Peak / Storm Peak
Sensor Number : FA - ACPI Power State
Sensor Type : 22 = System ACPI Power State
Event type = Assertion event : Sensor-specific assertion event
ACPI state S5 (soft-off) entered by override
Keyword = ACPI_SOFT_OFF_OVERRIDE
The system has been powered OFF with a hard power-down.
HP recommends that when you power down the system you use the normal shutdown procedure to
properly shut down the operating system. Otherwise, data-loss can occur.
Possible Power states
S0 / G0 - working
S1 - sleeping with system h/w & processor context maintained
S2 - sleeping, processor context lost
S3 - sleeping, processor & h/w context lost, memory retained.
S4 - non-volatile sleep / suspend-to disk
S5 / G2 - soft-off
G1 sleeping (S1-S4 state cannot be determined)
G3 / Mechanical Off
_______________________________________________________________________________
IPMI Event Code: 204CCEC8D80204F0 00000607D6020300
Timestamp (GMT) = Mon Nov 01 14:04:08 2010
Generator = Baseboard Management Controller
Alert Level = 5 - Critical
Sensor
Number Triplet Data 2 Data 3
------ -------- ------ ------
D6 02:07:06 00 00
Decoding as system type: Long's Peak / Storm Peak
Sensor Number : D6 - Power Good
----- WTEC Note -------------------------------------------
The Power Good signal is a logical wired-OR signal that indicates the power status of the entire system. It can be pulled low by any of the VRM's, power pods or power supplies in the system.
To trouble-shoot this error ...
1. If the system fails constantly, check the power supply LEDs. It may indicate a failure
of one of the two power supplies.
2. Look for adjacent entries in the SEL log that may indicate the specific problem area.
3. Try disconnecting each of the bulk power supplies in turn, and see if their removal
prevents the dc shutdown. Try each supply on it's own in the other slot. If this is a
system with a single power supply, try replacing with another supply.
4. If the system still fails, replace the system board.
5. If the system still fails, replace other hardware components one by one until you can
figure out the root cause.
Suspect FRUs:
One of the power supplies
System board
Power Supply Interface (PSI) Module
For more ideas on trouble-shooting, see ...
http://teams3.sharepoint.hp.com/ ... h_newsletter425.htm
Sensor Type : 02 = Voltage
Event type = Assertion event : Discrete – severity
Keyword = VOLTAGE_NON_RECOVERABLE
Voltage becomes non-recoverable from either critical level or less severe level.
_______________________________________________________________________________
级别3的报警:
IPMI Event Code: 204CCE9C2A0203B0 23235701D8010300
Timestamp (GMT) = Mon Nov 01 10:53:30 2010
Generator = Baseboard Management Controller
Alert Level = 3 - Warning or non-critical
Sensor
Number Triplet Data 2 Data 3
------ -------- ------ ------
D8 01:01:57 23 23
Decoding as system type: Long's Peak / Storm Peak
Sensor Number : D8 - Ambient Temperature (front panel sensor)
Sensor Type : 01 = Temperature
Event type = Assertion event : Threshold exceeded
Keyword = TEMPERATURE_HIGH_WARNING
The temperature inside the server has gone outside the factory specified range for normal
operation. You should quit applications and power down the server to protect its
hardware from damage.
Data 2 = 35, Reading that triggered event
Data 3 = 35, Trigger threshold value
看看是否有风扇不转或灰尘比较多的情况?
rx2620的型号,报过一次温度过高的报警,
楼上说的用哪个方法,我试过了,显示的就是二进制的文件。没有简单的说明,不知道为什么。
二进制代码分析工具只有HP有?
MP:> sl 回车后先后敲 e -> t -> a -> 3 -> l 你就可以看到3级以上的文本简单说明了,detail的要HP内部的工具了。
如果是HP-UX的话,系统里面的/var/opt/resmon/log/event.log应该也有这3级以上日志的详细说明。
什么型号?温度过高?