ELRP messages spiking SNMP process

  • 0
  • 1
  • Question
  • Updated 1 year ago
  • Doesn't Need an Answer
Yesterday I was working at a customer site installing EMC. After getting the switches setup and into EMC with SNMPV3 several of the switches 'went down'.  They were still accessible, so looked and SNMP process was high.  Also noticed ALOT of ELRP loop messages coming into logs from the uplink ports.  Opened a case with GTAC, narrowed down the issue with snmp (elrp was causing the spiked cpu) they tracked down the loop, and snmp process goes down on the affected switches.  My question was why were these non-looped switches affected by the loop, the reason was 'because they are all connected to the core'.   That just doesn't seem to make sense to me.  Loop PROTECTION should be keeping other switches from being affected, however in this case that doesn't happen.  On the switch where the loop was at, ELRP shut down the ports as configured, just would broadcast out 'LOOP' and send messages to other switches.  

So the main question is, is spiking the snmp process on non-looped switches REALLY by design?  If so, why?  I couldn't get an satisfactory response yesterday, so posing the question here.

Thanks.
Photo of Brian Anderson

Brian Anderson

  • 722 Points 500 badge 2x thumb

Posted 1 year ago

  • 0
  • 1
Photo of Patrick Voss

Patrick Voss, Alum

  • 11,594 Points 10k badge 2x thumb
Hello Brian,

Do you have ELRP traps enabled? This could cause SNMP to spike while the loops are being logged. Since the loop was not being fixed it continuously logged a loop and therefore continuously sent a trap.
Photo of Brian Anderson

Brian Anderson

  • 722 Points 500 badge 2x thumb
Yes traps were enabled.  Why do the traps get sent to other switches?  Shouldn't it be sent to Netsight or something configured to receive the traps?  In my opinion, for what it is worth, a loop getting prevented on a switch shouldn't affect other switches.  
Photo of Patrick Voss

Patrick Voss, Alum

  • 11,594 Points 10k badge 2x thumb
Hello Brian,

A loop affects the entire network. It sounds like all the switches were detecting the loop on the uplink but because that port is excluded from the disabled ports list it will only report it. If the log is generated on the switch then it will follow its configuration and send a trap to the server.
Photo of Jeremy

Jeremy, Embassador

  • 9,788 Points 5k badge 2x thumb
What is the output of "show config elrp".

Do you have ELRP configured on all the edge switches as well?  Also, are you using ExtremePolicy?  What version of code are you running?  In 16.x and some 21 and 22.x versions, ELRP does not work correctly with policy enabled.
Photo of Brian Anderson

Brian Anderson

  • 722 Points 500 badge 2x thumb
not using policy.  
# Module elrp configuration.
enable elrp-client
configure elrp-client periodic Default ports all interval 1 log-and-trap disable-port ingress duration 15
configure elrp-client periodic HS-Guest_Wifi ports all interval 1 log-and-trap disable-port ingress duration 15
configure elrp-client periodic HS-Mag ports all interval 1 log-and-trap disable-port ingress duration 15
configure elrp-client periodic HS-Staff_Wifi ports all interval 1 log-and-trap disable-port ingress duration 15
configure elrp-client periodic HS-Student_Wifi ports all interval 1 log-and-trap disable-port ingress duration 15
configure elrp-client periodic HS-Video ports all interval 1 log-and-trap disable-port ingress duration 15
configure elrp-client periodic HS-Voice ports all interval 1 log-and-trap disable-port ingress duration 15
configure elrp-client disable-port exclude 49
configure elrp-client disable-port exclude 50
configure elrp-client disable-port exclude 51
configure elrp-client disable-port exclude 52
(Edited)