We recently upgraded bd8806 from 12.5 to 15.7, unfortunately first attempt to upgrade interrupted and second attempt was successful. After reboot the following error was noticed on the logs:
04/26/2016 09:57:29.77 MSM-B: CPU 0: Kernel thread was stuck for 2.74 seconds, jiffies: 301270004/26/2016 09:57:29.77 MSM-B: CPU 1: Kernel thread was stuck for 2.32 seconds, jiffies: 3012674
04/26/2016 09:57:29.77 MSM-B: CPU 1: soft watchdog expiration warning EPC 8016633c(__rcu_pending+0x0/0x94) at 2 seconds.
04/26/2016 09:57:29.77 MSM-B: CPU 0: soft watchdog expiration warning EPC 80105df0(cpu_idle+0x3c/0x80) at 2 seconds.
since then all my OSPF neighbor has been unstable even after removing MSM-B. After rollback to 12.5 OSPF became stable. Is this bug and I have to fresh-install EXOS on MSM-B or both MSM.
Note: OSPF neigbor is not with Switch. (NodeA[ospf]-----BD8806(L2vlan)------Router[ospf]).
debug on MSM-B
=============================================== MSM-B system dump information
===============================================
core_dump_info storage: 8/3072 used [empty]
failure: process crash
time: Tue Apr 26 01:23:10 2016
process hal
pid 1331
signal 10
$0 : z0=00000000 at=10001f00 v0=00408e3c v1=004043b0
$4 : a0=0041bc7c a1=004003a4 a2=00000399 a3=0040924c
$8 : t0=7fff727c t1=2aac8504 t2=00000080 t3=f0000000
$12: t4=000014c0 t5=ffffffff t6=00000000 t7=7fff7198
$16: s0=00000000 s1=2aad1b04 s2=00000022 s3=a1cfb68c
$20: s4=00000000 s5=00000000 s6=2aad9ef8 s7=050e7db4
$24: t8=2aada2a8 t9=2aab2d9c
$28: gp=2aae2000 sp=7fff7178 s8=2aab8000 ra=2aab3704
Hi : 00000399
Lo : 0000b704
epc : 2aab36d4 Tainted: P
Status: 00001f13
Cause : 00808008
7fff7178: 00000001 2aacd800 7fff7160 2aab7bd4 7fff7308 2aab7e24 2aae2000 2aacd478
7fff7198: 2aad15b8 00000000 00000000 00000001 000012a3 00000000 2aaca000 2c512f4d
7fff71b8: 0041ccbc 2c512ba8 00000000 2aaccd80 7fff71a8 2aab7bd4 2aab2d9c 0040924c
7fff71d8: 7fff727c 2aac8504 7fff732c 2c512f4d 00000000 00000000 00000001 2aad15b8
7fff71f8: 2aad1844 a1cfb68c 7fff7208 2aab3a24 00000001 2aacc308 7fff71f0 2aab7bd4
7fff7218: 7fff7270 2aaca230 00000000 00000000 00000001 00000000 00000000 2aad15b8
7fff7238: 2aae2000 2aacbc00 7fff7220 2aab7bd4 7fff7308 2aab7e24 00000001 2aacb878
7fff7258: ffffffff 2aad15b8 2aae2000 0000000d 2aad15b8 00000000 00000000 00000000
7fff7278: 2c512f18 0b7268a5 00000000 2aacb180 7fff7268 2aab7bd4 2aad1068 00000001
7fff7298: 00000000 2aacae00 2aae2000 2aab7bd4 00000037 2c52a110 2c513536 2aad15b8
7fff72b8: 2c5134c0 00000001 00000002 2c512ba8 7fff72d0 2aab5670 2c51223c 2aab7bd4
7fff72d8: 2aad15b8 2aad1844 00000000 00000000 00000001 00000000 2aae2000 2aadabc0
7fff72f8: 2aaa8645 2aabd1e0 ffffffff 00000000 2aae2000 00000000 2aae2000 00000001
7fff7318: 00000b50 2aada2a8 2aad2cd0 2aabd6f4 7fff73d4 2c512ba8 00000000 00000022
7fff7338: 2aae2000 2f746f6f 2aada2a8 00000001 2c512f18 00000001 00000000 2aaa895c
7fff7358: 0000001c 00000000 00000000 2aaca000 00000001 2aabe0d0 2aab543c 2aad15b8
log: ... 2 notice: (1008) check_node_data: wrong data CRC in data node at 0x00337620: read 0xd10c9294, calculated 0xbcb7794b.
log: <4>Data CRC 040e30d4 != calculated CRC 1b75f785 for node at 0033ec40
log: <4>Data CRC 040e30d4 != calculated CRC 1b75f785 for node at 0033ec40
Text segment map
0x00400000-0x005ba000 /exos/bin/hal
0x2b030000-0x2b049000 /exos/lib/libpibutil.so.0.0
0x2aaa8000-0x2aaca000 /lib/ld-2.13.so
0x2aadc000-0x2aadf000 /lib/libdl-2.13.so
0x2aaf0000-0x2aafa000 /exos/lib/libhal.so
0x2ab0c000-0x2ab1f000 /exos/lib/libcommon.so
0x2ab30000-0x2ab35000 /exos/lib/libcli.so
0x2ab46000-0x2ab9a000 /exos/lib/libvlan.so
0x2abb2000-0x2abcb000 /exos/lib/libcmbackend.so
0x2abdc000-0x2abdf000 /exos/lib/libipv6.so
0x2abf0000-0x2ac0a000 /exos/lib/librtmgrc.so
0x2ac1c000-0x2ac20000 /exos/lib/libsnmpclient.so
0x2ac30000-0x2ac43000 /exos/lib/libacl.so
0x2ac54000-0x2ac62000 /exos/lib/libfdb.so
0x2ac74000-0x2aceb000 /exos/lib/libaspen.so
0x2ad54000-0x2af77000 /exos/lib/libpib.so
0x2b05a000-0x2b091000 /exos/lib/libaspenshared.so
0x2b0aa000-0x2b119000 /exos/lib/libaspensm.so
0x2b27e000-0x2b28d000 /exos/lib/libaspensvc.so
0x2b2b0000-0x2b2b5000 /exos/lib/libaspenutil.so
0x2b2c6000-0x2b2cb000 /exos/lib/libsummitbcmicm.so
0x2b84c000-0x2bce1000 /exos/lib/libbcmplat.so
0x2c0de000-0x2c148000 /exos/lib/libcorediags.so
0x2c16c000-0x2c171000 /exos/lib/libaspendiags.so
0x2c182000-0x2c1e3000 /exos/lib/libstratadiag.so
0x2c206000-0x2c214000 /exos/lib/libaspenpoe.so
0x2c224000-0x2c22c000 /exos/lib/libpibdiag.so
0x2c23c000-0x2c254000 /lib/libpthread-2.13.so
0x2c2fe000-0x2c35b000 /exos/lib/libdispatch.so
0x2c374000-0x2c381000 /exos/lib/libwkninfo.so
0x2c394000-0x2c4fa000 /lib/libc-2.13.so
Build directory: /data3/release-manager/v15_7_1_4/aspen_msm
failure: process crash
time: Tue Apr 26 01:23:15 2016
process xmld
pid 1468
signal 11
$0 : z0=00000000 at=10001f00 v0=00000003 v1=ffffffff
$4 : a0=0000432b a1=00000025 a2=00000025 a3=00000000
$8 : t0=0000007a t1=2aac8504 t2=00000050 t3=f0000000