So I'm hosting a new server for Minecraft and everything seems fine for several days to weeks. Randomly it will just "lose" internet at the host and require a power cycle to come back online. (Edit To go more indepth - the machine itself does not respond to SSH/any connections. When I use my IP-KVM (SpiderDuo) the machine does not respond to keyboard commands - for lack of a better word, its as if it is powered off, but the power management states it has power and doing a power-cycle makes it come back (after shutting down/coming back on)
Debian 7.0
100Mbps
3-6TB monthly usage
Update: below are the list of components used in this server (from newegg)
Antec 4U Case
Intel BOXDX79SI MOBO
i7 3960x CPU
http://www.newegg.com/Product/Product.aspx?Item=N82E16833106015
Low Profile GPU
32GB of G-Skill 1600 DDR3
Intel RAID SATA 8
I've attempted to gather logs but I don't see anything out of the ordinary, here's what I've tried so far.
I've run a memtest86+ for 90 hours at my house before shipping it off to a colo (oplink.net) There were zero errors and it went over the ram 6-10 times during this time.
I've attempted to go back tot he default kernel on Debian 7.0 thinking that my "dev" kernel may have had a leak or some other issue that isnt solved.
Edit 1: I'm using the standard Kernel (3.2) for Deb 7 and it is still having issues.
Edit 2: I have attempted to enable speed-stepping and c3/c5 in the bios - but it still has crashed/frozen
Edit 3: I am unable to use cpufreq-info - it tells me the CPU drivers are unknown.
Edit 4: I am currently testing the ram at 1333 instead of 1600.
Having to handle this remotely, I'm not sure where to go next, the datacenter guys have been great but arent paid to troubleshoot my machine, any thoughts?
Debian 7.0
100Mbps
3-6TB monthly usage
Update: below are the list of components used in this server (from newegg)
Antec 4U Case
Intel BOXDX79SI MOBO
i7 3960x CPU
http://www.newegg.com/Product/Product.aspx?Item=N82E16833106015
Low Profile GPU
32GB of G-Skill 1600 DDR3
Intel RAID SATA 8
I've attempted to gather logs but I don't see anything out of the ordinary, here's what I've tried so far.
I've run a memtest86+ for 90 hours at my house before shipping it off to a colo (oplink.net) There were zero errors and it went over the ram 6-10 times during this time.
I've attempted to go back tot he default kernel on Debian 7.0 thinking that my "dev" kernel may have had a leak or some other issue that isnt solved.
Edit 1: I'm using the standard Kernel (3.2) for Deb 7 and it is still having issues.
Edit 2: I have attempted to enable speed-stepping and c3/c5 in the bios - but it still has crashed/frozen
Edit 3: I am unable to use cpufreq-info - it tells me the CPU drivers are unknown.
Edit 4: I am currently testing the ram at 1333 instead of 1600.
Having to handle this remotely, I'm not sure where to go next, the datacenter guys have been great but arent paid to troubleshoot my machine, any thoughts?
Last edited: