ER706W freezes
ER706W freezes
Hi,
I have two sets of ER706W in two different locations.
One is connected to 4G cellular network (modem is in bridge mode) - let's call it Side Office,
Second is connected to fiber 1/1Gbps network (directly to ONT) - let's call it Main Office.
Devices are connected via IPsec Site-2-Site, where Main Office is in responder mode. I do use DynDNS since public IPs may change.
Both sites are managed by one OC200 controller, placed in Main Office, connected directly to router, powered separately. Cloud access is enabled.
Since (about) a month I'm facing weird "freezez" of both devices which looks for me totally random and can happen at night, when there is no traffic in the network at all. It happens to both but not at the same time, every few days.
The freezes, looks the same for both routers: the device in theory is working (LEDs are blinking) but router is not responding:
1. Can't attempt to access it's webgui (address not responding) - both
2. Can't connect to WiFi - any attempt end up with failed connection - both
3. Side Office on Controller's dashboard looks like it is totally disconnected (no graph)
4. When Main Office's router is frozen, shows as Offline Controller in cloud access
5. After router's reboot (point 4) Main Office on Controller's dashboard from that time shows as connected, but there was no traffic, PING line was totally flat and constantly on value 1mc (which is impossible, minimum ping is 6ms)
The only thing I can do to make those devices work then is to manually restart those (reconnect to power supply).
I do have some more configuration on Main Office router (few subnets, IPsec and OpenVPN server, DynDNS, some ACLs) but Side Office router has almost no specific config (IPsec, DynDNS only).
No device error is showed in logs, no massive CPU/memory usage/spikes, no traffic before/during those freezes.
The only thing that comes to my mind is the last update (1.1.2) - but I'm not 100% sure if the problems started after that upgrade or some time before.
What should I check in that case? Are there any additional logs somewhere available or only those I can see in controller's log section? Anyone faced the same issue by any chance?
Thanks for any tips.
Cheers!
- Copy Link
- Subscribe
- Bookmark
- Report Inappropriate Content
Hi @RaRu
Thanks for posting in our business forum.
RaRu wrote
Hi, Thanks for your reply.
How should I check that: Does the WAN still get an IP address?
Since I can't log into router, it has no connection to controller, ping from WAN do not response.
Only one site crashes at the time. One crash do not cause a crast on the other site.
Ping to the default VLAN 1 GW IP, does it respond? Looks like the router crashed and died?
- Copy Link
- Report Inappropriate Content
On Site's router there is only one, default LAN - 192.168.100.1/24 and router's IP address is 192.168.100.1 which responds.
- Copy Link
- Report Inappropriate Content
Hi @RaRu
Thanks for posting in our business forum.
RaRu wrote
On Site's router there is only one, default LAN - 192.168.100.1/24 and router's IP address is 192.168.100.1 which responds.
DHCP IP still assigned, not showing up as 169.254.X.Y. And pinging to the 192.168.100.1 works as well. That'd be a VPN problem in which tunnel fails.
Not the router crashes. As for the setup where you have a S2S VPN, once the VPN tunnel is down, it'll affect the controller and readings on it.
Is that possible to backup your config and let us test with it?
I am prone to believe that the issue is about the ISP. If you have spare units, you can also test this under the local network as it creates a stable network environment.
Besides, possible to Wireshark on the WAN to monitor what caused the disconnection? Which you might need a full capture and filter the several parameters:
1. DNS resolution on the WAN is used to detect the online status. I explained this part in a KB article, which you can find helpful in determining whether this is actually a WAN issue.
2. IPsec related parameters. IKEV, IPsec and other values.
- Copy Link
- Report Inappropriate Content
Hi,
The router is connected to the controller via Internet, not via VPN. I can manually disconnect S2S VPN on the device and it still reports to controller.
Moreover, when the problem occurs, not only the intranet is not working, but also access to internet. If only VPN fails, then (in theory) Internet access should still work as usual on that device, but it is not.
I don't have spare unit, no. The problem occurs once a week or so, at random. Gathering Wireshark logs for whole week is almost impossible.
The only thing I have observed that (so far) daily reboots of router prevents this from reoccurring (one router is getting rebooted daily and so far there's no issue, second one is not getting reboots and is "crashing").
I can export the config, yes. How would you like to receive it?
- Copy Link
- Report Inappropriate Content
Hi @RaRu
Thanks for posting in our business forum.
RaRu wrote
Hi,
The router is connected to the controller via Internet, not via VPN. I can manually disconnect S2S VPN on the device and it still reports to controller.
Moreover, when the problem occurs, not only the intranet is not working, but also access to internet. If only VPN fails, then (in theory) Internet access should still work as usual on that device, but it is not.
I don't have spare unit, no. The problem occurs once a week or so, at random. Gathering Wireshark logs for whole week is almost impossible.
The only thing I have observed that (so far) daily reboots of router prevents this from reoccurring (one router is getting rebooted daily and so far there's no issue, second one is not getting reboots and is "crashing").
I can export the config, yes. How would you like to receive it?
Cloud access, we need that.
What's the WAN2 module you use? Do you have a specific model number?
Or RJ45 to SFP? Can you confirm?
If you replug the WAN port, will it recover the WAN connection? If you know.
Will get you a ticket later.
- Copy Link
- Report Inappropriate Content
Hi @Clive_A
Thank you so much for taking the time to post the issue on TP-Link community!
To better assist you, I've created a support ticket via your registered email address, and escalated it to our support engineer to look into the issue. The ticket ID is TKID241254733, please check your email box and ensure the support email is well received. Thanks!
Once the issue is addressed or resolved, welcome to update this topic thread with your solution to help others who may encounter the same issue as you did.
Many thanks for your great cooperation and patience!
- Copy Link
- Report Inappropriate Content
Hi,
Thank you for your help.
I use simple RJ45 connection on WAN2.
Replug of the cable is next to be tested whenever the problem reoccurs.
- Copy Link
- Report Inappropriate Content
Hi,
Just wanted to sum it up since the ticket has been created in official TP-Link support:
I have this problem on both ER706W - in Main and SIte location, where two different ISPs are providing their services.
Main device is configured to reboot daily at 3AM CET to prevent loosing WAN connection on it's own at random. Site device don't have this setting enabled in order to force it to fail and gather some additional info.
List of last outages which I've noted down (but I've been informed by users that it started long time ago (like 2 or more months ago):
Main:
12.12.2024 11:39 CET
Site:
06.12.2024 00:01 CET
19.12.2024 03:23 CET
Here's a graph of network structure to make is easier to understand:
The VPN config is based on DynDNS since both sites have Public but dynamic IP. The DynDNS service is updated by router itself.
When the problem occurs, the LAN and WiFI networks are accessible (ping to router works) but there's no Internet connection. PING on WAN (from internet) doesn't work. Router is not reporting to controller.
In Site location I was able to test that Huawei router's reboot was enough to restore connectivity. Prior to reboot, the Huavei device was showing LAN port down as no client was connected to it. Next thing to test (when problem occurs): if simply WAN RJ45 reconnection is enough to restore the connectivity.
I would think it's Huawei/ISP's fault if noto having the same issue in Main location where ISP is different.
Last logs (just before the lost connection) are pasted in previous posts.
- Copy Link
- Report Inappropriate Content
Information
Helpful: 0
Views: 322
Replies: 18
Voters 0
No one has voted for it yet.