Create Date: Jan 25 2013 4:54AM
I am hoping that someone can shed some light and help me with a problem that I am currently facing in my environment. We recently upgraded our edge switches to Summit X460-48p and deployed it as a stacked solution with each stack consisting of around 4 nodes. Our entire estate including these switches is all running XOS 220.127.116.11.
On these switches and this is not specific to a single stack, I repeatedly see problems with high CPU utilization. Initially (about once or twice) it was the hal and cli processes skyrocketing. However I would probably call these one offs. Whats been killing me is this one process called bcmasync. It continually keeps skyrocketing (70-90%) and has already caused outages wherein a couple of nodes simply rebooted (system watchdog kicking in). A lot of times, the switch stack simply becomes unresponsive. The logs didnt indicate much and Epicenter didnt pickup anything either. As a pre-emptive step, I setup STP based edge port loop prevention, turned off auto polarity on edge ports, set rate limits on broadcast and multicast to rule out other possibilites. Nothing seems to have helped.
I do have a case with Extreme and they weren't initially sure either. I have already provided logs, tech support etc to no avail. They have finally come back asking for a debug to be done which I am planning. At this stage I am not sure whats happening and I almost had another outage yesterday. Any thoughts on this would be really appreciated and sorry for the long email. Tks again.