Hi,
I understand you have a problem with one slot in the stack. You need to reboot it from time to time to solve a recurring issue:
- the whole stack (?) hangs and you have no connectivity. You restart it and it's fine.
- sometimes some users on contiguous ports can't obtain an IP address from the DHCP server. You restart it and it's fine.
The stack is only doing L2, you do L3 on a Cisco router.
You already tried to replace the switch in slot 1 (x440) with no improvement.
The master of the stack is the x460, the backup is the x440 in slot 1.
Your first thought of a loop, but after configuring ELRP, you do not see any loop detected.
In your log, I can see you are filling up the IP Multicast table of slot-1. This is typical because EXOS performs by default a L3 lookup for multicast, thus quickly filling the table of the x440. Hopefully, we can easily solve that part of the issue, if this is related, by using the mac-vlan mode. This configuration requires 15.3, and cannot work with IGMPv3, MVR, PVLAN and PIM, because it's using L2 table. If you need either of such protocol, you need to move to mixed-mode, which will give you the benefits of L2 table size for multicast traffic, but the entries using these protocols, which would still use the L3 table.
configure forwarding ipmc lookup-key [group-vlan | source-group-vlan | mac-vlan | mixed-mode]
Do you have an issue only on slot1, or do you have issues on every x440? Is the x460 impacted or not?
Thanks.