Hi Chris,
I understand your question but I don't know why you're seeing this error. There's nothing pointing to single root cause or a course of action that’s sure to fix the problem. At this stage there are probably better plans than a remix of your pools. The idea will be handy to have in the toolbox for later.
With no clear root cause a methodical troubleshoot is typically the next step. Unfortunately troubleshooting method requires trial and error - and while shared experience makes participating in the hub a no-brainer, the tedious back and forth of in depth network troubleshooting isn't always a great mix for this sort of forum.
I recommend the classic start - physical layer. Then statistics and states at L2, then L3, then NAT application stats and tables. There’s more than one way to approach this I’m certain but I don’t know any other way to do the work.
I encourage others in the community to add troubleshooting tips or experiences that might improve odds of a quick resolution.
That said, here are a few items to help the cause:
* specific hardware and firmware; release note review is always of interest
* NAT config. (Cone NAT etc)
* Firewall/dmz location.
* switch/router config.
* physical topology; traffic flow in and out of the NAT.
* > Review L1 stats for high or low frame-counts, errors, flow control, LACP/LAG health, etc. *
> Review L2 topology, stp and fdb stability.
* By default the switch collects 24 hrs rmon history. review traffic spikes and time frames.
This data may also point toward a problem source.
* Event record and a reference to correlate events.
* a gauge of the flow-count (unique sa/da-sip/dip-tcp/udp stream) on a port. If a traffic event such a port scan occurred, the timestamp on the flowlimit stat high-water mark will help. Correlate this with rmon history and log entries.
Best regards,
Mike