Today after 2 years of uptime one of our Slots on BD-12804 had been turned off for no apparent reason. We had got some strange logs before the accident:
06/29/2017 20:06:29.22 MSM-A: Slot-1 FAILED (2) cartmanPollMBReady-594: cartman4 on slot 1 (1 errors):Mailbox Polling Timeour
06/29/2017 20:06:29.22 MSM-A: Slot-1, Error 12: cartmanPollMBReady-594: cartman4 on slot 1 (1 errors):Mailbox Polling Timeou)
06/29/2017 20:06:29.21 MSM-A: cartmanPollMBReady-594: cartman4 on slot 1 (1 errors):Mailbox Polling Timeout(reg 705=87)And after that that all the ports on a slot starts to turn off:
06/29/2017 20:06:29.64 MSM-A: Port 1:5 link down
06/29/2017 20:06:29.64 MSM-A: Port 1:4 link down
06/29/2017 20:06:29.23 MSM-A: Remove port 1:3 from aggregator
06/29/2017 20:06:29.23 MSM-A: Remove port 1:2 from aggregator
06/29/2017 20:06:29.23 MSM-A: Remove port 1:1 from aggregator
06/29/2017 20:06:29.22 MSM-A: Port 1:3 is Down, remove from aggregator 1:1
06/29/2017 20:06:29.22 MSM-A: Port 1:3 link down
06/29/2017 20:06:29.22 MSM-A: Port 1:2 is Down, remove from aggregator 1:1
06/29/2017 20:06:29.22 MSM-A: Port 1:2 link down
06/29/2017 20:06:29.22 MSM-A: Port 1:1 is Down, remove from aggregator 1:1
06/29/2017 20:06:29.22 MSM-A: Port 1:1 link downI have not found any references in Internet to the problem, and logs look really strange for me. I have not found any PollMBReady or Mailbox Poling Timeouts in documentation. We even have no any mailboxes in configuration of BD-12804.
Our equipment:
Chassis : 804023-00-09 06135-01409 Rev 9.0
Slot-1 : 804032-00-06 06284-00059 Rev 6.0
Slot-5 : 804032-00-06 0721F-00331 Rev 6.0
Slot-6 : 804032-00-06 0720F-00670 Rev 6.0
MSM-A : 804047-00-07 0711F-00084 Rev 7.0 BootROM: 1.0.0.3 IMG: 12.6.2.10
PSUCTRL-1 : 700087-00-07 06105-00862 Rev 7.0 BootROM: 2.13
PSUCTRL-2 : 700087-00-07 06105-00911 Rev 7.0 BootROM: 2.13
PSU-1 : PS 2336 4300-00145 0722K-30342 Rev 10.0
PSU-2 : PS 2336 4300-00137 0502J-03684 Rev 7.0
PSU-3 : PS 2336 4300-00137 0519J-05462 Rev 7.0
Image : ExtremeXOS version 12.6.2.10 v1262b10 by release-manager
on Thu Sep 29 17:48:22 EDT 2011
BootROM : 1.0.0.3
Any idea? After restart the chassis works perfect as ever, but I fear of repeating of the problem and don't understand, what was the problem with our Slot-1 (GM-20XTR)?