Y
yasu
Guest
Hello,
I have an unstable DIY NAS on my hands and it's beyond my abilities to diagnose. Any tips would be highly appreciated. It's exhibiting sporadic and unpredictable behavior, but I'll do my best to summarize as succinctly as possible.
Here's the config (mostly lifted from another SNB forum poster):
- First off, I should mention that it ran fine for a week after initial construction while I installed Ubuntu Server, installed packages, modified config files, etc.
- Then I tried to kick off a Handbrake video encode to stress test the CPU; this made the system completely non-responsive after about 20 minutes
- Now, during various resets, cold boots, and yanking of non-essential bits off hardware, there was a lot of random behavior which can be summed up as: freezes arbitrarily. Most commonly, it won't boot at all: no beep, no bios output. It seems like yanking the IDE cable from the MOBO will often get it to at least show the BIOS again (but not necessarily make it to the OS). Which might make you think it's the system HDD or the DVD, but it's not, because with neither of those connected, but with the IDE cable still in the MOBO, it will still not boot. Only once it's yanked from the MOBO itself will it do something.
- Sometimes the freeze will happen later, like partway through the BIOS check or while Linux is starting, or even just sitting at the Ubuntu command prompt doing nothing. But once it has crashed once, it will usually freeze sometime during or before the BIOS load from then on.
- The day after the first crash, it ran fine all day long and I used the day to test out every possible RAM chip configuration with memtest86 and several successful(!) HandBrake encodes, so I don't think it's the RAM.
- On this successful day, I had the case wide open, so I thought it might be temperature related, so I added an intake fan and have monitored the temps more closely, but there doesn't seem to be any correlation.
So, that seems to leave MOBO, CPU, and power supply. I don't know how to test any of those things without buying unnecessary duplicates. MOBO is probably my best guess because of the IDE thing, but I'd hate to RMA it and be wrong.
Any ideas?
Sorry for the long post! Thanks!!
I have an unstable DIY NAS on my hands and it's beyond my abilities to diagnose. Any tips would be highly appreciated. It's exhibiting sporadic and unpredictable behavior, but I'll do my best to summarize as succinctly as possible.
Here's the config (mostly lifted from another SNB forum poster):
- Chenbro ES34069
- JetWay JNC81-LF
- AMD Athlon X2 4850e 2.5GHz Socket AM2 45W
- Kingston 4GB (2 x 2GB) 240-Pin DDR2 SDRAM DDR2 800 (PC2 6400)
- RAID5: 4x Western Digital Caviar Green WD10EADS 1TB
- System drive: Western Digital Scorpio WD800BEVE 80GB
- Panasonic SR-8178-B Slimline Tray-loading CD/DVD-ROM
- First off, I should mention that it ran fine for a week after initial construction while I installed Ubuntu Server, installed packages, modified config files, etc.
- Then I tried to kick off a Handbrake video encode to stress test the CPU; this made the system completely non-responsive after about 20 minutes
- Now, during various resets, cold boots, and yanking of non-essential bits off hardware, there was a lot of random behavior which can be summed up as: freezes arbitrarily. Most commonly, it won't boot at all: no beep, no bios output. It seems like yanking the IDE cable from the MOBO will often get it to at least show the BIOS again (but not necessarily make it to the OS). Which might make you think it's the system HDD or the DVD, but it's not, because with neither of those connected, but with the IDE cable still in the MOBO, it will still not boot. Only once it's yanked from the MOBO itself will it do something.
- Sometimes the freeze will happen later, like partway through the BIOS check or while Linux is starting, or even just sitting at the Ubuntu command prompt doing nothing. But once it has crashed once, it will usually freeze sometime during or before the BIOS load from then on.
- The day after the first crash, it ran fine all day long and I used the day to test out every possible RAM chip configuration with memtest86 and several successful(!) HandBrake encodes, so I don't think it's the RAM.
- On this successful day, I had the case wide open, so I thought it might be temperature related, so I added an intake fan and have monitored the temps more closely, but there doesn't seem to be any correlation.
So, that seems to leave MOBO, CPU, and power supply. I don't know how to test any of those things without buying unnecessary duplicates. MOBO is probably my best guess because of the IDE thing, but I'd hate to RMA it and be wrong.
Any ideas?
Sorry for the long post! Thanks!!