E2900 系统panic 帮忙分析一下或者有没有类似案例的TX
SUN E2900机器经常panic,做了post有如下的报错
post信息
Testing IO Boards ...
Copying IO PROM to CPU DRAM
{/N0/SB0/P0/C0} Subtest: IDE Controller Reset Test for aid 0x18
{/N0/SB0/P0/C0} test case DCU: 1fd6da00.001ffe0e
{/N0/SB0/P0/C0} test case L2CR: 00000000.00000e03
{/N0/SB0/P0/C0} test case L3CR: 0007e500.96e71c02
{/N0/SB0/P0/C0} test case cmd646_reset_test: reset failed, busy-bit not set
{/N0/SB0/P0/C0} Subtest: IDE Controller Bus Probe for aid 0x18
{/N0/SB0/P0/C0} test case DCU: 1fd6da00.001ffe0e
{/N0/SB0/P0/C0} test case L2CR: 00000000.00000e03
{/N0/SB0/P0/C0} test case L3CR: 0007e500.96e71c02
{/N0/SB0/P0/C0} test case Probing ide bus0, device0:
{/N0/SB0/P0/C0} test case Device reset failed
下面是LOM的显示的信息:
lom>showboards
Slot Pwr Component Type State Status
---- --- -------------- ----- ------
SSC1 On System Controller V2 Main Passed
/N0/SCC - System Config Card Assigned OK
/N0/BP - Baseplane Assigned Passed
/N0/SIB - Indicator Board Assigned Passed
/N0/SPDB - System Power Distribution Bd. Assigned Passed
/N0/PS0 On A209 Power Supply - OK
/N0/PS1 On A209 Power Supply - OK
/N0/PS2 On A209 Power Supply - OK
/N0/PS3 On A209 Power Supply - OK
/N0/FT0 On Fan Tray Auto Speed Passed
/N0/RP0 On Repeater Board Assigned OK
/N0/RP2 On Repeater Board Assigned OK
/N0/SB0 On CPU Board V3 Active Passed
/N0/SB2 On CPU Board V3 Active Passed
/N0/IB6 On PCI-X I/O Board Active Passed
/N0/MB - Media Bay Assigned Passed
lom>showfault
fault is off
lom>showchs
Component Status
--------------- --------
lom>showalarm
Usage: showalarm 1|2|3|system
showalarm -h
lom>showcomponemt
showcomponemt: Command not found, type "help" for a list of commands.
lom>showcomponent
Component Status Pending POST Description
--------- ------ ------- ---- -----------
/N0/SB0/P0 enabled - pass UltraSPARC-IV+, 1500MHz, 32M ECache
/N0/SB0/P1 enabled - pass UltraSPARC-IV+, 1500MHz, 32M ECache
/N0/SB0/P2 enabled - pass UltraSPARC-IV+, 1500MHz, 32M ECache
/N0/SB0/P3 enabled - pass UltraSPARC-IV+, 1500MHz, 32M ECache
/N0/SB0/P0/B0/L0 enabled - pass 1024M DRAM
/N0/SB0/P0/B0/L2 enabled - pass 1024M DRAM
/N0/SB0/P0/B1/L1 enabled - pass 1024M DRAM
/N0/SB0/P0/B1/L3 enabled - pass 1024M DRAM
/N0/SB0/P1/B0/L0 enabled - pass 1024M DRAM
/N0/SB0/P1/B0/L2 enabled - pass 1024M DRAM
/N0/SB0/P1/B1/L1 enabled - pass 1024M DRAM
/N0/SB0/P1/B1/L3 enabled - pass 1024M DRAM
/N0/SB0/P2/B0/L0 enabled - pass 1024M DRAM
/N0/SB0/P2/B0/L2 enabled - pass 1024M DRAM
/N0/SB0/P2/B1/L1 enabled - pass 1024M DRAM
/N0/SB0/P2/B1/L3 enabled - pass 1024M DRAM
/N0/SB0/P3/B0/L0 enabled - pass 1024M DRAM
/N0/SB0/P3/B0/L2 enabled - pass 1024M DRAM
/N0/SB0/P3/B1/L1 enabled - pass 1024M DRAM
/N0/SB0/P3/B1/L3 enabled - pass 1024M DRAM
/N0/SB2/P0 enabled - pass UltraSPARC-IV+, 1500MHz, 32M ECache
/N0/SB2/P1 enabled - pass UltraSPARC-IV+, 1500MHz, 32M ECache
/N0/SB2/P2 enabled - pass UltraSPARC-IV+, 1500MHz, 32M ECache
/N0/SB2/P3 enabled - pass UltraSPARC-IV+, 1500MHz, 32M ECache
/N0/SB2/P0/B0/L0 enabled - pass 1024M DRAM
/N0/SB2/P0/B0/L2 enabled - pass 1024M DRAM
/N0/SB2/P0/B1/L1 enabled - pass 1024M DRAM
/N0/SB2/P0/B1/L3 enabled - pass 1024M DRAM
/N0/SB2/P1/B0/L0 enabled - pass 1024M DRAM
/N0/SB2/P1/B0/L2 enabled - pass 1024M DRAM
/N0/SB2/P1/B1/L1 enabled - pass 1024M DRAM
/N0/SB2/P1/B1/L3 enabled - pass 1024M DRAM
/N0/SB2/P2/B0/L0 enabled - pass 1024M DRAM
/N0/SB2/P2/B0/L2 enabled - pass 1024M DRAM
/N0/SB2/P2/B1/L1 enabled - pass 1024M DRAM
/N0/SB2/P2/B1/L3 enabled - pass 1024M DRAM
/N0/SB2/P3/B0/L0 enabled - pass 1024M DRAM
/N0/SB2/P3/B0/L2 enabled - pass 1024M DRAM
/N0/SB2/P3/B1/L1 enabled - pass 1024M DRAM
/N0/SB2/P3/B1/L3 enabled - pass 1024M DRAM
/N0/IB6/P0 enabled - pass IO Controller 0
/N0/IB6/P1 enabled - pass IO Controller 1
/N0/IB6/P0/B0 enabled - untest 100/66/33MHz. PCIX/EPCI/PCI Bus
/N0/IB6/P0/B1 enabled - untest 33MHz PCI Bus
/N0/IB6/P1/B0 enabled - untest 100/66/33MHz. PCIX/EPCI/PCI Bus
/N0/IB6/P1/B1 enabled - untest 100/66/33MHz. PCIX/EPCI/PCI Bus
/N0/IB6/P1/B0/C0 enabled - untest 100/66/33MHz. 3.3V Short PCI card
/N0/IB6/P1/B0/C1 enabled - untest 100/66/33MHz. 3.3V Short PCI card
/N0/IB6/P1/B1/C2 enabled - untest 100/66/33MHz. 3.3V Short PCI card
/N0/IB6/P1/B1/C3 enabled - untest 100/66/33MHz. 3.3V Short PCI card
/N0/IB6/P0/B0/C4 enabled - untest 100/66/33MHz. 3.3V Short PCI card
/N0/IB6/P0/B0/C5 enabled - untest 100/66/33MHz. 3.3V Short PCI card
/N0/RP0/ enabled - - Repeater Board Slot 10
/N0/RP2/ enabled - - Repeater Board Slot 12
lom>
下面是messages的信息
root@TDDB1 # grep -i panic messages*
messages:Feb 17 08:43:47 TDDB1 ^Mpanic[cpu2]/thread=30091f80ac0:
messages:Feb 17 08:52:03 TDDB1 lw8: [ID 501878 kern.notice] 2/17/11 8:43:48 AM Active - Panicking^M
messages:Feb 17 08:53:03 TDDB1 savecore: [ID 570001 auth.error] reboot after panic: BAD TRAP: type=31 rp=2a101d875c0 addr=f8 mmu_fsr=0 occurred in module "genunix" due to a NULL pointer dereference
messages:Feb 17 21:32:54 TDDB1 ^Mpanic[cpu522]/thread=3017a362cc0:
messages:Feb 17 21:42:16 TDDB1 lw8: [ID 724561 kern.notice] 2/17/11 9:32:55 PM Active - Panicking^M
messages:Feb 17 21:42:16 TDDB1 lw8: [ID 970353 kern.notice] 2/17/11 9:36:36 PM Active - Panicking^M
messages:Feb 17 21:43:22 TDDB1 savecore: [ID 570001 auth.error] reboot after panic: BAD TRAP: type=31 rp=2a10172f010 addr=800188 mmu_fsr=0 occurred in module "genunix" due to an illegal access to a user address
messages:Feb 18 14:15:09 TDDB1 savecore: [ID 570001 auth.error] reboot after panic: BAD TRAP: type=31 rp=2a10172f010 addr=800188 mmu_fsr=0 occurred in module "genunix" due to an illegal access to a user address
messages:Feb 21 13:26:19 TDDB1 ^Mpanic[cpu9]/thread=301d14b66c0:
messages:Feb 21 13:26:19 TDDB1 ^Mpanic[cpu9]/thread=301d14b66c0:
messages:Feb 21 13:34:44 TDDB1 lw8: [ID 707015 kern.notice] 2/21/11 1:26:20 PM Active - Panicking^M
messages:Feb 21 13:35:45 TDDB1 savecore: [ID 570001 auth.error] reboot after panic: BAD TRAP: type=31 rp=2a1014671d0 addr=4605ba30288 mmu_fsr=0
messages:Feb 21 15:30:12 TDDB1 lw8: [ID 777091 kern.notice] 2/21/11 3:24:26 PM Active - Panicking^M
messages:Feb 21 15:31:17 TDDB1 savecore: [ID 570001 auth.error] reboot after panic: BAD TRAP: type=31 rp=2a1014671d0 addr=4605ba30288 mmu_fsr=0
messages:Feb 21 15:31:44 TDDB1 ^Mpanic[cpu9]/thread=3006f7d88a0:
messages:Feb 21 15:31:44 TDDB1 ^Mpanic[cpu9]/thread=3006f7d88a0:
messages:Feb 21 15:39:02 TDDB1 lw8: [ID 595798 kern.notice] 2/21/11 3:31:45 PM Active - Panicking^M
messages:Feb 21 15:40:01 TDDB1 savecore: [ID 570001 auth.error] reboot after panic: BAD TRAP: type=31 rp=2a1011c6cb0 addr=800188 mmu_fsr=0 occurred in module "genunix" due to an illegal access to a user address
messages:Feb 21 22:56:36 TDDB1 ^Mpanic[cpu515]/thread=300f7e709c0:
messages:Feb 21 22:57:06 TDDB1 ^Mpanic[cpu515]/thread=300f7e709c0:
messages:Feb 21 22:57:06 TDDB1 unix: [ID 715357 kern.notice] panic sync timeout
messages:Feb 21 23:11:18 TDDB1 lw8: [ID 442326 kern.notice] 2/21/11 11:05:35 PM Active - Panicking^M
messages:Feb 21 23:12:17 TDDB1 savecore: [ID 570001 auth.error] reboot after panic: BAD TRAP: type=31 rp=2a101477460 addr=800038 mmu_fsr=0 occurred in module "genunix" due to an illegal access to a user address
messages:Feb 21 23:16:36 TDDB1 ^Mpanic[cpu1]/thread=2a100651d40:
messages:Feb 21 23:25:07 TDDB1 lw8: [ID 131132 kern.notice] 2/21/11 11:16:37 PM Active - Panicking^M
messages:Feb 21 23:26:01 TDDB1 savecore: [ID 570001 auth.error] reboot after panic: kernel heap corruption detected
messages:Feb 22 13:52:16 TDDB1 savecore: [ID 570001 auth.error] reboot after panic: kernel heap corruption detected
有没有类似案例的同学啊?
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(2)
换SB0
发个完整的panic信息{:3_200:}