cancel
Showing results for 
Search instead for 
Did you mean: 

SNMP Error Timeout

SNMP Error Timeout

Anonymous
Not applicable
Have a customer that has two different sites with a couple S8's at the core of one and S4's at the core of another.

The edge consists of multiple stacks of C5's which the top switch being a C5K for 10gb.

There seems to be a couple, out of the manystacks that persistently give the following error:

SNMP Contact Lost: No SNMP reply from device 192.168.xxx.xx caused by SNMP Error: Timeout[4098], last uptime was 21 Days 23:14:26.4

When Netsight looses SNMP contact to the switch you can still ping and SSH to it, and no other adverse affect seems to be happening to the switch then what seems random lose in polling.

When an event briefly happened I was able to get onto the device and tried to ping the IP address of the Netsight server, which failed! I wasn't able to do anymore testing before it came back online.

The uplink ports consist of one 1 x 10gb and 2 x 1GB as a lag. MSTP is configured with the data vlan using the 10Gb and the lag used for the voice vlan.

Have looked at spanning, and there has been no topology changes, the ports have remained continually up and show no errors in rmon stats.

Not sure if anyone has seen this before and can provide any suggestions?

Many thanks

13 REPLIES 13

Anonymous
Not applicable
Hi Jason,

Here are some of the ports that have RX counts:

port TX Pause Count RX Pause Count
-------- --------------- --------------
ge.1.7 0 720
ge.1.14 0 82
ge.1.25 0 734
ge.1.26 0 734
ge.1.29 0 26838
ge.1.45 0 706
ge.1.46 0 228596

So I'll investigate what these are, especially on port ge.1.46.

Out of interest IGMP Snooping has been enabled on the switch and all the user ports, although they don't use multicasting - could that have any bearing?

Thanks

Jason_Parker
Contributor

Please note that this could be as simple as Multicast traffic flooding on the network and some PC's do not take well to dealing with traffic they are dropping

Here are a few sets to take
1. See what ports have high Rx's
2. Duplex/speed issue -printers and low end machines
3. If daisy chained then maybe the traffic s coming from another switch and they have a bad client(Blue screen of death.

Good Luck
Jason


Anonymous
Not applicable
Just to let you know it seems you where right on the money there, the 10Gb link between the core and the edge is experiencing problems:

show port flowcontrol tg.3.2

Port TX Admin TX Oper RX Admin RX Oper TX Pause Count RX Pause Count
------------ -------- -------- -------- -------- -------------- --------------
tg.3.2 enabled enabled enabled enabled 0 4597790

KGH_SDP5-2_US1(su)->show port flowcontrol tg.1.49
port TX Pause Count RX Pause Count
-------- --------------- --------------
tg.1.49 31303 0

Now I just have to work out why.

Many thanks for your help.

Jason_Parker
Contributor
Take a look at the SecureStacks and type
show port flowcontrol
If you receive any pause frames you need to look at the end stations sending them as there may be an issue.
If you do not have the time to look at this then I would(Warning #1: the command will drop the link and renegotiate the options/speed duplex, and bring the link back to life) run the command, clear port advertise port# pause
If you select to run the command on the uplink port(s)
Warning #2- This would cause a topology change -thus clearing the FDB(filtering data base) of all local switches so please take note.
Good Luck
Jason

GTM-P2G8KFN