Multiple WS-8-150-AC Units Hanging
Posted: Fri Jul 05, 2019 3:34 pm
Hello folks. Got a bit of a head-scratcher over here. We have a particular tower site with WS-8-150-AC units freezing, requiring a power cycle to come back up. Average run time can be anywhere from as long as a few days, to as short as a few hours. Another suspect behavior is on-location firmware updates on these units also result in a hang, necessitating a power cycle every time.
Thinking it was a hardware issue, we swapped in a few new WS-8-150-AC units. All in turn exhibit the same behaviors. All units were running various firmware in the 1.5.0-1.5.2 range, but trying different firmware revisions doesn't seem to resolve the hangs. I'm thinking it may be environmental, not not sure how to narrow it down without more logging data. For what it's worth, other equipment (Ubiquiti EdgeRouter, a cheap Netgear switch, DigitalLoggers WebPower Switch, and various radios) don't seem to be bothered at all.
The site is outdoors, & runs warm (as the central California valley does), but nothing outlandish. Temps max at about 75-80c on the CPU/PHY in the afternoons, board tops out around 60-65c. Should be noted hangs have occurred any time of the day, even during cool periods. There doesn't seem to be a timing pattern.
I've enabled remote syslog, but unfortunately it doesn't appear the logs reveal any clues. All we see is some spanning-tree messages from the end of the prior reboot followed by the new boot sequence messages such as follows:
Questions:
a) Is there a way to bump up verbosity on the log messaging (specifically syslog)?
b) Forum searches on temperature seem to show ours within reasonable range, so we're ruling that out somewhat. Could that be a factor?
c) We're going to put a different UPS in at the site, but not sure its dirty power if all the other equipment seems fine. Still a possibilty.
Anything obvious we're missing? We have dozens of Netonix WISPSwitch units and they're all bulletproof except for this particular site.
Thinking it was a hardware issue, we swapped in a few new WS-8-150-AC units. All in turn exhibit the same behaviors. All units were running various firmware in the 1.5.0-1.5.2 range, but trying different firmware revisions doesn't seem to resolve the hangs. I'm thinking it may be environmental, not not sure how to narrow it down without more logging data. For what it's worth, other equipment (Ubiquiti EdgeRouter, a cheap Netgear switch, DigitalLoggers WebPower Switch, and various radios) don't seem to be bothered at all.
The site is outdoors, & runs warm (as the central California valley does), but nothing outlandish. Temps max at about 75-80c on the CPU/PHY in the afternoons, board tops out around 60-65c. Should be noted hangs have occurred any time of the day, even during cool periods. There doesn't seem to be a timing pattern.
I've enabled remote syslog, but unfortunately it doesn't appear the logs reveal any clues. All we see is some spanning-tree messages from the end of the prior reboot followed by the new boot sequence messages such as follows:
- Code: Select all
Jul 5 07:32:31 192.168.6.6 STP: msti 0 set port 4 to forwarding sw-01
Jul 5 07:32:31 192.168.6.6 STP: msti 0 set port 3 to forwarding sw-01
Jul 5 07:32:31 192.168.6.6 STP: msti 0 set port 2 to forwarding sw-01
Dec 31 16:00:20 192.168.6.6 STP: msti 0 set port 3 to discarding sb-dry-dsw-01
Dec 31 16:00:20 192.168.6.6 Port: link state changed to 'down' on port 2 sw-01
Dec 31 16:00:20 192.168.6.6 Port: link state changed to 'down' on port 6 sw-01
Dec 31 16:00:20 192.168.6.6 Port: link state changed to 'down' on port 3 sw-01
Questions:
a) Is there a way to bump up verbosity on the log messaging (specifically syslog)?
b) Forum searches on temperature seem to show ours within reasonable range, so we're ruling that out somewhat. Could that be a factor?
c) We're going to put a different UPS in at the site, but not sure its dirty power if all the other equipment seems fine. Still a possibilty.
Anything obvious we're missing? We have dozens of Netonix WISPSwitch units and they're all bulletproof except for this particular site.