cancel
Showing results for 
Search instead for 
Did you mean: 

Network disruption VSP8600

Network disruption VSP8600

BRMS
New Contributor II

I’m using 4 VSP8600 in a SPBM-Configuration. Today we experienced a network disruption although no configuration changes were made. The logs are full of these:

************************************************************************************
Command Execution Time: Tue Oct 06 11:26:49 2020 CEST
************************************************************************************
1 2020-10-06T11:24:33.770+02:00 kreuz IO4 - 0x00138537 - 0004e001 DYNAMIC CLEAR GlobalRouter COP-SW INFO VIST peer mac b4:2d:56:9c:7a:11 on VID 2422 is learnt on non-IST MltId-1, Pointing record back to IST port.Total Peer Mac Move Count: 8051
1 2020-10-06T11:24:30.533+02:00 kreuz IO2 - 0x00138537 - 0004e001 DYNAMIC CLEAR GlobalRouter COP-SW INFO VIST peer mac b4:2d:56:9c:7a:36 on VID 426 is learnt on non-IST MltId-1, Pointing record back to IST port.Total Peer Mac Move Count: 16644
1 2020-10-06T11:22:35.621+02:00 kreuz IO3 - 0x00138537 - 0004e001 DYNAMIC CLEAR GlobalRouter COP-SW INFO VIST peer mac b4:2d:56:9c:7a:2b on VID 419 is learnt on non-IST MltId-1, Pointing record back to IST port.Total Peer Mac Move Count: 13707

...

1 2020-10-06T10:42:31.049+02:00 kreuz IO5 - 0x00138537 - 0004e001 DYNAMIC CLEAR GlobalRouter COP-SW INFO VIST peer mac b4:2d:56:9c:7a:28 on VID 205 is learnt on non-IST MltId-1, Pointing record back to IST port.Total Peer Mac Move Count: 9389
1 2020-10-06T10:42:31.035+02:00 kreuz IO6 - 0x00138537 - 0004e001 DYNAMIC CLEAR GlobalRouter COP-SW INFO VIST peer mac b4:2d:56:9c:7a:28 on VID 205 is learnt on non-IST MltId-1, Pointing record back to IST port.Total Peer Mac Move Count: 631

...

1 2020-10-06T10:42:22.788+02:00 kreuz CP1 - 0x00004619 - 01900001 DYNAMIC CLEAR GlobalRouter SNMP INFO Smlt Link Up Trap(SmltId=133)
1 2020-10-06T10:42:22.788+02:00 kreuz CP1 - 0x0000000a - 01900001.133 DYNAMIC CLEAR GlobalRouter SW INFO SMLT 133 Link is UP
1 2020-10-06T10:42:17.262+02:00 kreuz CP1 - 0x0000461a - 01900001 DYNAMIC SET GlobalRouter SNMP INFO Smlt Link Down Trap(SmltId=133)
1 2020-10-06T10:42:17.261+02:00 kreuz CP1 - 0x00000009 - 01900001.133 DYNAMIC SET GlobalRouter SW INFO SMLT 133 Link is DOWN

...

1 2020-10-06T10:04:03.768+02:00 kreuz IO3 - 0x00138537 - 0004e001 DYNAMIC CLEAR GlobalRouter COP-SW INFO VIST peer mac b4:2d:56:9c:7a:2b on VID 419 is learnt on non-IST MltId-1, Pointing record back to IST port.Total Peer Mac Move Count: 12529
1 2020-10-06T10:03:48.600+02:00 kreuz IO4 - 0x00138537 - 0004e001 DYNAMIC CLEAR GlobalRouter COP-SW INFO VIST peer mac b4:2d:56:9c:7a:11 on VID 2422 is learnt on non-IST MltId-1, Pointing record back to IST port.Total Peer Mac Move Count: 7347

What could be the cause of this problem and how can i debug this any further? my log only goes back for ~1hr

1 ACCEPTED SOLUTION

Miguel-Angel_RO
Valued Contributor II

brms,

 

I see the following points to be worked out:

  • change the isis metrics of your isis interfaces: the MLT should have the cost of the interfaces 1/1 or 1/5 divided by 2.
    • If it is 10G links: MLT=100, interfaces 1/1,1/5 = 200
  • I would enable SLPP on all the C-VLANs
  • Could you confirm the value of the i-sid used on the different switches for the vIST?
    • It should be uniq per cluster
  • I would change the subnet to /30 but this shouldn’t cause any issue using a /24
  • avoid the redistribution of the vIST subnets in ISIS/OSPF/other: https://gtacknowledge.extremenetworks.com/pkb_mobile#article/How_To/kA12T0000004QhGSAU/s
  • Ensure that you don’t use the vIST subnet for other purposes than VIST (not as next hop, not as SNMP access, etc)

Mig

View solution in original post

17 REPLIES 17

Miguel-Angel_RO
Valued Contributor II

brms, 

 

just wondering, what’s the output of:

show ip route | include 172.28.7

show vlan i-sid | include 4054

show isis spbm i-sid all | include 4054

Mig

 

BRMS
New Contributor II

the 4 VSPs are located at 2 locations. the 2 VSPs at each location are connected via the mlt 1 directly and via 1/1 and 1/5 to the VSPs at the other location. all 4 uplinks (1/1,1/2,5/1,5/2) are configure as nni-links. there should be no non-NNI-Links between the VSPs directly. a possibility could be a connected exos-switch that hasn’t LACP configured maybe. in that cause there might be an indirect non-NNI-Link?

Based on your data, the NNI link to the peer switch should be MLT-1, but the switch thinks MLT1 is the culprit.

thats what im wondering too, since mlt 1 is an nni-link and should server the vist-vlan, the log entries calling MLT-1 non-IST is strange.

i’m not really sure how i can: “Can you check how the two vist-peers see each other”. any idea?

Roger_Lapuh
Extreme Employee

From what you share something looks wrong. Based on your data, the NNI link to the peer switch should be MLT-1, but the switch thinks MLT1 is the culprit. Can you check how the two vist-peers see each other, I assume there are more NNI links than just MLT-1?

Do you have a parallel non-NNI link between the two nodes connected as well (even without any “looping” VLANs)?

I think you need someone from support to look into this.

Roger

BRMS
New Contributor II

Thanks in advance. I already created a ticket with our partner, unfortunately communication with them is rather slow and fruitless. Here are the outputs of the commands:

show mlt: https://pastebin.com/MG8yP18m

MLT-1 is an MLT between 2 VSPs at one location (pik, kreuz and karo, herz) which are configured as a vist-pair. VLAN 4051 and 4052 are the b-vlans.

show isis interface: https://pastebin.com/cPyu31AX

show isis adj: https://pastebin.com/Aj6WXRPM

show slpp: https://pastebin.com/Y0y51ARF

show spanning-tree config: https://pastebin.com/g3eFKEdH

show spanning-tree mst port role: https://pastebin.com/rQC7AMwS

show virtual-ist: https://pastebin.com/s4gcekmL

show vlan members: https://pastebin.com/4XKhPu3A

show i-sid elan (show i-sid vlan isn’t a valid command): https://pastebin.com/8547Hi6e

as far as i know vlan 4054 doesn’t need to be assigned to the MLT that forms the vIST, as the vlan connectivity is handled by spbm?! Maybe thats the problem?

Roger_Lapuh
Extreme Employee

The switch reports the error on the link it saw the issue on. In order for this error to be reported, the peer MAC (on any VLAN) is seen on a UNI (none vIST NNI) port. I doubt that there is a bug in this regards, as the message is only triggered when an actual peer-MAC move had to be executed by the switch. What is connect on the reported link?

 

Roger

GTM-P2G8KFN