bubbles
01-18-2006, 03:27 PM
Here is specs:
Asus P4P800-Deluxe
P4 2.4Ghz
1GB DDR400
2U rackmount case
2x SATA backplanes
RocketRaid 1820A controller
8x 320GB WD SATA drives in RAID5
1x 80GB WD O/S drive
Seasonic 600w PSU
100mbit LAN with 5 to 50 users at any time.
Fedora Core 3
Highpoint drivers
The RAID5 is run as an encrypted loop device using LOOP-AES. The device is one big 2TB partion formatted with JFS (4k block size).
System was started off with random file corruption so swapped memory and it worked fine for 1 week then started freezing up more and more frequently until it wouldnt even boot unless it was left turned off for at least 12 hours, sometimes only the NIC would crash.
Changed to a different motherboard and all seemed ok for about a week then it started crashing again. It would come back online right after a reboot unlike before but would crash frequently.
Next changed the PSU from stock 460w to a Seasonic 600w. Now system was running fine for about a week until i did some stuff with the "dd" command and it froze up. Rebooted it and it came back online but now crashing ever 1 to 5 days. The crashes seem to get more and more frequent.
- motherboard swapped
- memory swapped
- psu swapped
- moved to another room with better A/C and different power.
I can't think of anything else to try?? There is really no software on it besides the pretty much default kernel, server daemon, loop-aes module and RR driver.
Monitoring it with "top" can't really see anything that triggers a crash.
ANY IDEAS?
Asus P4P800-Deluxe
P4 2.4Ghz
1GB DDR400
2U rackmount case
2x SATA backplanes
RocketRaid 1820A controller
8x 320GB WD SATA drives in RAID5
1x 80GB WD O/S drive
Seasonic 600w PSU
100mbit LAN with 5 to 50 users at any time.
Fedora Core 3
Highpoint drivers
The RAID5 is run as an encrypted loop device using LOOP-AES. The device is one big 2TB partion formatted with JFS (4k block size).
System was started off with random file corruption so swapped memory and it worked fine for 1 week then started freezing up more and more frequently until it wouldnt even boot unless it was left turned off for at least 12 hours, sometimes only the NIC would crash.
Changed to a different motherboard and all seemed ok for about a week then it started crashing again. It would come back online right after a reboot unlike before but would crash frequently.
Next changed the PSU from stock 460w to a Seasonic 600w. Now system was running fine for about a week until i did some stuff with the "dd" command and it froze up. Rebooted it and it came back online but now crashing ever 1 to 5 days. The crashes seem to get more and more frequent.
- motherboard swapped
- memory swapped
- psu swapped
- moved to another room with better A/C and different power.
I can't think of anything else to try?? There is really no software on it besides the pretty much default kernel, server daemon, loop-aes module and RR driver.
Monitoring it with "top" can't really see anything that triggers a crash.
ANY IDEAS?