What's new

Occasional hang

  • SNBForums Code of Conduct

    SNBForums is a community for everyone, no matter what their level of experience.

    Please be tolerant and patient of others, especially newcomers. We are all here to share and learn!

    The rules are simple: Be patient, be nice, be helpful or be gone!

Tim G

New Around Here
Symptoms are:
  • Lost WAN connection (LAN ok)
  • Core-2 pinned at 100% load (Core-1 normal)
  • Memory usage normal (30%), no errors in syslog
  • Web interface still responding, but not possible to make SSH connection
Setup is RT-AC68U with two USB disks shared via Samba only. The hang has happened with Merlin firmwares 378.53_0 and 378.54_2.

Without SSH access I don't know how to debug this further. Any thoughts?
Many thanks, -Tim
 
Yep, I have experienced this too... normally during the day when I'm at work and can't access it because the WAN connection is dead...
 
Symptoms are:
  • Lost WAN connection (LAN ok)
  • Core-2 pinned at 100% load (Core-1 normal)
  • Memory usage normal (30%), no errors in syslog
  • Web interface still responding, but not possible to make SSH connection
Setup is RT-AC68U with two USB disks shared via Samba only. The hang has happened with Merlin firmwares 378.53_0 and 378.54_2.

Without SSH access I don't know how to debug this further. Any thoughts?
Many thanks, -Tim

You will need SSH access to determine what process is crashed and taking 100% of one CPU core.
 
You will need SSH access to determine what process is crashed and taking 100% of one CPU core.
Any idea what could take out the SSH daemon and the WAN at the same time? I assume that even with only one CPU core available, all non-crashed processes should remain responsive. I suppose I can try to write a logging script that stores to disk.
 
Just to double check....since you have gui access, anything in the syslog?
 
Nope, only DHCP requests. The SSH connection attempt showed up too, but I couldn't actually get a prompt.
 
How are you trying access.....wired or wireless...windows or Linux or mac? Is it the same from multiple clients?
 
Wired Debian. Didn't try other clients before rebooting the router. It only happens ~once a week, so difficult to investigate. Both crashes occurred whilst connected to my LAN from outside via SSH. I have two external ports forwarded to the same internal port on the same machine - can this cause problems?
 
Any idea what could take out the SSH daemon and the WAN at the same time? I assume that even with only one CPU core available, all non-crashed processes should remain responsive. I suppose I can try to write a logging script that stores to disk.

Seriously high CPU/IO load that might prevent the sshd daemon from properly opening and establishing a connections perhaps.

Try opening a SSH connection, and leave it open on your computer. With any luck, by the time it hangs again, you will still be able to check with "top" what is loading the CPU.
 
After I last posted I set up automatic logging of 'top' to disk to try to find the cause of the problem. Since then my router has run for almost 2 weeks without problems, until today when it appears to have crashed and rebooted several times.

The 'top' logs don't seem to show anything suspicious:
https://drive.google.com/file/d/0B9_DGx19AWmoYnJxdjVibUo2T2s/view?usp=sharing

I'm not really sufficiently knowledgeable to read the system logs, but it seems very odd that there should be multiple reboots:
https://drive.google.com/file/d/0B9_DGx19AWmob09qc1lDcGowNnM/view?usp=sharing

Still running 378.54_2 for now.
 

Latest threads

Support SNBForums w/ Amazon

If you'd like to support SNBForums, just use this link and buy anything on Amazon. Thanks!

Sign Up For SNBForums Daily Digest

Get an update of what's new every day delivered to your mailbox. Sign up here!
Top