Matrix X-Series, firmware 18.104.22.168 through 22.214.171.124
IOMs are becoming unstable at 497 days of uptime. The 'ess' process is also crashing at this level of uptime, causing the IOM to restart.
In release 126.96.36.199, the linux kernel on the Matrix X was upgraded to version 2.6. Part of this upgrade was to resolve a previous 497 day bug that would destabilize the system. Unfortunately there is a bug remaining at the 497 day mark for the 'ess' process.
Upgrade to firmware 188.8.131.52 or higher.
Release notes state, in the 'Issues Resolved in release 184.108.40.206' section:
After a system has been up for 497 days the ESS processes on each IOM and the
CM are very likely to terminate abnormally producing syslog messages similar
to the following:
semTake(12000) failed 3997700 !!
This is often immediately preceded by messages indicating an unexpected
spanning tree topology change. This can cause general network instability
while the modules are rebooting.
In lieu of installing this release that contains the fix for this issue the
problem can be worked around by scheduling a controlled system reset before
the system has been up for 497 days.[/code]