1
Votes

Cold Boot / Power Up / coldStart or Reset Log Entry

 
1
Votes

Cold Boot / Power Up / coldStart or Reset Log Entry

Cold Boot / Power Up / coldStart or Reset Log Entry
Cold Boot / Power Up / coldStart or Reset Log Entry
3 weeks ago - last edited 3 weeks ago
Tags: #Boot Log
Model: OC200  
Hardware Version: V2
Firmware Version: 5.15.24.21

This is a feature request for device (EAP, Switch, etc) boot logs (reset, cold boot, reboot) that would assist in Omada WiFi distribution troubleshooting.  

 

There are no boot up logs.. cold boot... coldStart or anything to indicate a missing gateway or Omada controller.  Makes it tricky to infer a reboot from evidence.  Things look like they rebooted, but not always easy to see.  No infrastructure (switches, gateway, or Controller) create any boot up / cold boot / reset logs. Would be very helpful for troubleshooting.  

 

This is a screen shot of a power outage where the entire Omada WiFi distribution system went down.  Normal operation is visible until the UPS runs out that is not announced... last gasp...  the system simply disappears.  Does Omada remotely monitor or use any keep-alives that could indicate the system has missed heartbeat?  None are in the log.  Once power is restored, we see IP addressing start for EAP's only (never any switches).  There is no entry for the OC200 achieving sanity and booted up, which I assume it must be to capture these logs?  Once the addressing for EAPs is completed, all of them appear in logs as  "connected", and then clients begin to appear.  This is easy to diagnose, since it is a power outage that took it all down and then restored upon power up.  The issue is with flaky EAPs or individual items that may cycle, reboot, and don't show so cleanly.  Especially with a functional/powered/sane OC200, boot logs for devices could be captured, but there are none.  

 

The screen shots that follow are excerpts from the log.  It seems power came back 2X as addresses were being issued to devices at 17h01, then the process restarted at 17h25:  

 

 

#1
Options
1 Reply
Re:Cold Boot / Power Up / coldStart or Reset Log Entry
2 weeks ago

Hi  @RF_Dude 

 

Thanks for posting here.

 

Although this feature seems helpful for troubleshooting, from a practical operations and product design perspective, its significance may indeed be limited, for the following reasons:  

1. Sporadic issues are difficult to reproduce with logs
   - Most random failures (e.g., automatic device reboots, momentary power loss) are caused by random hardware or environmental factors (e.g., power fluctuations, overheating). Logs can only record that a reboot occurred but cannot directly identify the root cause.  

 

2. Excessive logs may obscure critical information
   - Frequently logging device startup/heartbeat status can lead to log overload, potentially burying more important failure signals (e.g., network congestion, misconfigurations).  
   - For example, in large networks, hundreds of devices reporting heartbeat logs every minute would create unnecessary noise for administrators.  

 

3. Existing mechanisms can already infer issues indirectly
   - Through indirect logs such as device offline time, IP reassignment records, and client disconnections, it is usually possible to deduce whether a device has rebooted—without needing dedicated boot logs.  
   - If a device frequently reboots abnormally, the issue is more likely due to hardware failure or firmware bugs, requiring replacement or updates rather than log analysis.  

 

4. Logs cannot be saved if the controller is offline  
   - If the entire system loses power (e.g., UPS depletion as mentioned by the user), the controller (OC200) will also shut down. In this case, even if devices generate boot logs, they **cannot be uploaded to the controller**, rendering the feature useless.  

 

5. More suitable for enterprise-grade monitoring tools  
   - In professional environments, network health should rely on active monitoring tools (e.g., SNMP, Prometheus) for real-time device status tracking, rather than post-mortem log analysis.  

#2
Options