Bug report - ER8411 - One vlan loses lan>wan communication
I have noticed a very strange bug with one particular vlan on my network. This issue has persisted from ER8411 firmware 1.2.2 at least, and 1.2.3. And OC200v1 firmware from January and the very latest march 10th beta.
This issue has persisted across factory resets of both gateway and controller, and complete rebuilds from scratch of the entire network. It does not matter if the ER8411 is doing internal routing on this particular VLAN or i have it switch routed through an SVI on the lan side switches for internal routing. And it only effects one particular vlan with the same IP range each and every time!
Issue-
VLAN with tag 8, with IP range 192.168.8.0/23 will lose internet connectivity at random and will never come back by itself. Internal routing still works. It can ping the gateway. It can ping and recieve response from an IP address on the internet. However, it cannot traverse the NAT and use the internet at all for normal browsing. DHCP for clients set to either cloudflare DNS servers directly, or to internal DNS proxy service of the gateway makes no difference. Problem effects both LAN and WiFi clients on this vlan, rehgardless of which switch path they are on from the gateway and core switch. Traceroutes fail at the gateway, but it can still ping an internet IP such as 8,8,8,8 ? weird
How i resolve it-
To fix it, all i need to do is assign it to another WAN on policy routing, then reassign it back to the WAN i actually want it to use. This resolves its internet connectivity every time without needing any reboots of anything. Very Strange!
What is weird, is it only effects this one single IP range vlan. none of the others at all, ever. All the other VLANs policy routed to the same WAN are not effected, at all, ever. I cannot reliably replicate the problem as it seeming happens at random, and is hard to monitor as this particular vlan has very infrequent use.
I would welcome some ideas to test!
Things i have tried:
IDS/IPS disabled
Different DNS servers on that vlans DHCP
client with static IP and statically set DNS
Deleted and recreated that one vlan
Deleted and recreated all vlans
Factory reset everything, readopted from existing configuration
Factory reset everything, including OC200, rebuild everything from scratch
Disabled all gateway and switch ACLs
Thrown candies at the ER8411 when testing due to frustration with this
Shouting and Swearing at it
Threatening to replace it with a Cisco unit