TL-SX3016F blocks traffic
Hello,
I had an issue with a TL-SX3016F.
This is my schema.
1 x TL-SX3016F
2 x Mikrotik CCR1036-8G-2S+ connected to TL-SX3016F via 10 Gbps DAC cable
First CCR1036-8G-2S+ is the main and run routerOs 6.48.6. Second CCR1036-8G-2S+ is a neighbor and run routerOs 6.49.18
All devices are installed in air-conditioned datacenter.
To the TL-SX3016F are connected several fiber links, both 10 Gbps and 1 Gbps, to other Mikrotik routers in other sites. All those Mikrotik routers had routerOs 6.48.6.
All Mikrotik router run OSPF.
System was up since 300 days and suddenly, on 25th August, OSPF went down on all routers at 10.30AM for 5 minutes and then another time at 12.00AM for 5 minutes.
I rebooted main CCR1036-8G-2S+, thinking about an OSPF problem, and I added some static backup routes on routers.
The same issue appears twice on 27h August at he same time (more o less) and traffic didn't pass even with the static routes.
I upgraded main CCR1036-8G-2S+ to routerOs 6.49.18 thinking about some bug or firmware mismatch.
On 28th August early morning I also upgraded all other neighbor Mikrotik routers to 6.49.18 to have all OSPF routers with same firmware.
On 28th August night (about 10.30 PM) OSPF and traffic goes down.
During the night, on 29th August at 1.00 AM I rebooted TL-SX3016F and since that time I had no more problems. Now I'm still monitoring system.
I honestly don't think it's a routerOs problem because even with the upgrade the issue occured.
I also don't think it's a problem of neighbor routers because all OSPF went down at the same time.
Maybe a main CCR1036-8G-2S+ issue? It could be but:
- a single reboot didn't help => issue remains
- a routerOs upgrade didn't help => issue remains
- a switch reboot seems help...
I add another point.
One year ago I had similar issue on the same system (with less neighbor routers but same schema).
Suddenly traffic stops passing many time in few days. Thinking about a switch problem I changed the switch and everything worked good until now.
Old switch was TL-SX3016F bought together with that now installed but with previous firmware version 1.0.
Due to this I thought about an hardware o firmware problem occurred with the old switch but now, one year after, with a similar problem I think that on both cases something happened to both switches. I don't know if firmware issue or if something happens after many days of uptime or other.
I hope you can help me.
Thanks
Best regards
Andrea