I think I have narrowed this down to the router, but I would like to submit my issue to you guys for your perusal.
This issue just cropped up recently (I can't recall what day exactly. I would have been at least on build 42. I am on build 43 now.) and it took me a while to figure out exactly what was happening. I thought it was spanning tree for a while, but it turns out that it wasn't. Time to explain.
So I have two switches. I replaced my 8 port netgear switch with a 16 port Cisco SG100-16. Both switches are dumb switches. I have a server that is doing DHCP and DNS duty for the network and both of these things are disabled in the router. I am going to try my best to draw out my network with text.
modem -> r o u t e r
/ |
DHCP/DNS server switch
Some of the clients are plugged into the router switch ports and some are connected to the switch. I have a total of 9 (including the switch) items that would be plugged in. those items would be spread between the router and the switch. I typically keep one router port open due to the placement of the router just in case I need to troubleshoot something.
That said, that means there are 3 things (including the switch) plugged in to the router. The rest is plugged in to the switch. Now, on the switch if I have 6 things plugged in all internet traffic stops (see my pings in the paste bin below). And by all traffic I mean I can't get to any of the LAN resources or any WAN resources.
If I unplug just one item from the switch (it doesn't matter what it is), all traffic resumes like normal. And I mean immediately. The moment item X is unplugged, everything resumes.
I have tried plugging things in to different switch ports on the switch and tried unplugging different items. I also switched back to my 8 port netgear switch, thinking the switch was the problem, but the behavior was the same.
I had initially thought it may have to do with spanning tree (which was enabled), but it turns out it wasn't because I disabled spanning tree and the behavior was the same.
I also tested by plugging in another device to the empty port on the router with everything else plugged (all 9 devices plugged in) in and the same thing happened. Further leading me to believe the issue is with the router, but if I am wrong, please correct me. My router is showing 11 clients connected now, most of which are wired (8 to be precise) and the rest are wireless.
So here are the pings, I have truncated them quite a bit because they were running for a long time, but this will help give you an idea. To be clear, we have pings from server (plugged into router directly) to the internet (google.com) and to a client (which is plugged in to the switch). We have pings from another server (which is plugged into the switch) to the internet (google.com) and to another client on the network (plugged into the router) and one client pinging the internet while on wifi.
Client pinging google while on wifi I plugged in the device at line 3. At line 4 you see how I start getting timeouts. I unplugged it at line 30. Then you see REALLY high times for the pings then it drops back to normal by line 38:
http://fpaste.org/109897/
Server connected to switch pinging google. The device was plugged in at line 3 then the ping didn't appear to do anything. It just stopped. Once I unplugged the device everything continued like normal. You can see how it goes from a 19ms time to 32253ms and other REALLY high times (I wasn't seeing this on my screen) until line 15 when the device was unplugged.
http://fpaste.org/109898/
You can see the same behavior here. This time it was the same server pinging another device on the LAN (this device is attached to the switch). You see really low ping times then they shoot WAY up (the super high times were not displayed on my screen until I unplugged the device) then they drop back down:
http://fpaste.org/109899/
In this paste you see the DHCP server (connected to the router) pinging the other server (connected to the switch) and we see the same behavior. LAN type ping times, they then shoot WAY up when another device is plugged in, then you see them drop back down once the device is unplugged.
http://fpaste.org/109900/
And finally the DHCP server (connected to the router) pinging google. You can see it shoot way up then drop back down when the device is unplugged.
http://fpaste.org/109901/
I have tried everything I can think of, but I just don't know what to make of my findings. I tried unplugging different devices to see if it was one specific device causing the issue (it was not device specific). I tried plugging in another device to the router to see if that would cause the issue (it did). The cables are all in good shape. The only thing I could think it could possibly be is the firmware or the router hardware itself. Is there any sort of debugging I can do to help figure this out? Would switching to the stock ASUS firmware potentially resolve things? Maybe DD-WRT? Tomato? I would rather not have to switch to dd-wrt or tomato if I can avoid it. At any rate, please comment with your thoughts.
This issue just cropped up recently (I can't recall what day exactly. I would have been at least on build 42. I am on build 43 now.) and it took me a while to figure out exactly what was happening. I thought it was spanning tree for a while, but it turns out that it wasn't. Time to explain.
So I have two switches. I replaced my 8 port netgear switch with a 16 port Cisco SG100-16. Both switches are dumb switches. I have a server that is doing DHCP and DNS duty for the network and both of these things are disabled in the router. I am going to try my best to draw out my network with text.
modem -> r o u t e r
/ |
DHCP/DNS server switch
Some of the clients are plugged into the router switch ports and some are connected to the switch. I have a total of 9 (including the switch) items that would be plugged in. those items would be spread between the router and the switch. I typically keep one router port open due to the placement of the router just in case I need to troubleshoot something.
That said, that means there are 3 things (including the switch) plugged in to the router. The rest is plugged in to the switch. Now, on the switch if I have 6 things plugged in all internet traffic stops (see my pings in the paste bin below). And by all traffic I mean I can't get to any of the LAN resources or any WAN resources.
If I unplug just one item from the switch (it doesn't matter what it is), all traffic resumes like normal. And I mean immediately. The moment item X is unplugged, everything resumes.
I have tried plugging things in to different switch ports on the switch and tried unplugging different items. I also switched back to my 8 port netgear switch, thinking the switch was the problem, but the behavior was the same.
I had initially thought it may have to do with spanning tree (which was enabled), but it turns out it wasn't because I disabled spanning tree and the behavior was the same.
I also tested by plugging in another device to the empty port on the router with everything else plugged (all 9 devices plugged in) in and the same thing happened. Further leading me to believe the issue is with the router, but if I am wrong, please correct me. My router is showing 11 clients connected now, most of which are wired (8 to be precise) and the rest are wireless.
So here are the pings, I have truncated them quite a bit because they were running for a long time, but this will help give you an idea. To be clear, we have pings from server (plugged into router directly) to the internet (google.com) and to a client (which is plugged in to the switch). We have pings from another server (which is plugged into the switch) to the internet (google.com) and to another client on the network (plugged into the router) and one client pinging the internet while on wifi.
Client pinging google while on wifi I plugged in the device at line 3. At line 4 you see how I start getting timeouts. I unplugged it at line 30. Then you see REALLY high times for the pings then it drops back to normal by line 38:
http://fpaste.org/109897/
Server connected to switch pinging google. The device was plugged in at line 3 then the ping didn't appear to do anything. It just stopped. Once I unplugged the device everything continued like normal. You can see how it goes from a 19ms time to 32253ms and other REALLY high times (I wasn't seeing this on my screen) until line 15 when the device was unplugged.
http://fpaste.org/109898/
You can see the same behavior here. This time it was the same server pinging another device on the LAN (this device is attached to the switch). You see really low ping times then they shoot WAY up (the super high times were not displayed on my screen until I unplugged the device) then they drop back down:
http://fpaste.org/109899/
In this paste you see the DHCP server (connected to the router) pinging the other server (connected to the switch) and we see the same behavior. LAN type ping times, they then shoot WAY up when another device is plugged in, then you see them drop back down once the device is unplugged.
http://fpaste.org/109900/
And finally the DHCP server (connected to the router) pinging google. You can see it shoot way up then drop back down when the device is unplugged.
http://fpaste.org/109901/
I have tried everything I can think of, but I just don't know what to make of my findings. I tried unplugging different devices to see if it was one specific device causing the issue (it was not device specific). I tried plugging in another device to the router to see if that would cause the issue (it did). The cables are all in good shape. The only thing I could think it could possibly be is the firmware or the router hardware itself. Is there any sort of debugging I can do to help figure this out? Would switching to the stock ASUS firmware potentially resolve things? Maybe DD-WRT? Tomato? I would rather not have to switch to dd-wrt or tomato if I can avoid it. At any rate, please comment with your thoughts.