TheAMAZINGNorad
04-28-2008, 01:23 AM
I must proclaim I am a linux newbie. My friend has been showing me the ropes, and I am trying to solve all of my problems on my own. This problem, however, just seems beyond me. He is out of town for a week, and I need to figure out what is going on!
The other night, I got a mail message alerting me to a failure in md1. I proceeded to cat /proc/mdstat and was shocked to find that sdc2 was followed by [F]! I rebooted the machine, ensuring that all the cables were in place and checked mdstat again. This is what I see.
md1 : active raid5 sdd1[2] sdb2[1] sda1[3]
937392384 blocks level 5, 256k chunk, algorithm 2 [4/3] [_UUU]
If I do fdisk - l it shows all the partitions and info on /dev/sdc. When I attempt to add the drive back in, mdadm /dev/md1 -a /dev/sdc2, it succeeds! Then when it starts to rebuild, it crashes every time around 37%. Once it "hangs," if I try to do fdisk -l /dev/sdc is no longer visible. If I run dmesg, I get some garble along the lines of this.
ata3.00:,exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x2 frozen
I used seatools and there are NO Smart errors on the drive. What is going on here? Is the drive really bad even when all the seagate tools say it is ok? I am at a loss of what to do here. I am thinking of low-level formating the drive, partitioning it like before, adding it back in, and seeing if the array will rebuild. Any thoughts?
The other night, I got a mail message alerting me to a failure in md1. I proceeded to cat /proc/mdstat and was shocked to find that sdc2 was followed by [F]! I rebooted the machine, ensuring that all the cables were in place and checked mdstat again. This is what I see.
md1 : active raid5 sdd1[2] sdb2[1] sda1[3]
937392384 blocks level 5, 256k chunk, algorithm 2 [4/3] [_UUU]
If I do fdisk - l it shows all the partitions and info on /dev/sdc. When I attempt to add the drive back in, mdadm /dev/md1 -a /dev/sdc2, it succeeds! Then when it starts to rebuild, it crashes every time around 37%. Once it "hangs," if I try to do fdisk -l /dev/sdc is no longer visible. If I run dmesg, I get some garble along the lines of this.
ata3.00:,exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x2 frozen
I used seatools and there are NO Smart errors on the drive. What is going on here? Is the drive really bad even when all the seagate tools say it is ok? I am at a loss of what to do here. I am thinking of low-level formating the drive, partitioning it like before, adding it back in, and seeing if the array will rebuild. Any thoughts?