Hello everyone,
So a weird issue has popped up that started a couple days ago. I'm running 4x WD RED 3TB drives in RAIDZ1. All of these disks are practically brand new.
I'm not suffering from any read or write errors on the disks.
Now, any time I add a large amount of data, I always end up with checksum errors. I can't figure out what's going on here, I've swapped in completely different drives, created a new zpool with them and the same exact thing happens. I highly doubt it's related to my disks.
I'm using the on board SATA controller (non RAID), and have been for the past 3 - 4 years without a problem. I've replaced the SATA cables just to rule that out.
I'm a bit out of my depth here, I've been working with computers for most of my life but have only recently started dealing with *nix systems. So, what's my next step? I'm not really sure how to diagnose this.
I replaced a WD Green drive with a WD Red so I'd finally have all NAS type HDD's in there, and that's when the issues started but I don't think it has anything to do with the new disk, like I said above, I'm experiencing the same issues regardless of the drives I use.
What prompted me to replace the WD Green in the first place is a few checksum errors popped up on that drive, so it gave me an excuse. The resilvering process failed the first time, saying there was a corrupted file that needed to be addressed. So I deleted said file (was an unimportant temp file) and restarted the resilvering. Second time around it was complaining about my Plex jail, so I just deleted that dataset entirely and started over for a third time. It finally worked after doing that plus a scrub right after.
Ever since it's been a nightmare getting this thing to function properly. I went as far as recreating my primary dataset entirely in order to migrate to ashift = 12 (4k sectors) for maximum performance, since my old drives were 512b sectors. It didn't help at all but at least I know there should be no issue there. In the 4 years of having this system, this is the first time I've run into this.
Oh and I have a fresh install of FreeNAS, just to rule any of that out. Put it on a brand new 16GB flash drive shortly after replacing the WD Green disk.
So a weird issue has popped up that started a couple days ago. I'm running 4x WD RED 3TB drives in RAIDZ1. All of these disks are practically brand new.
I'm not suffering from any read or write errors on the disks.
Now, any time I add a large amount of data, I always end up with checksum errors. I can't figure out what's going on here, I've swapped in completely different drives, created a new zpool with them and the same exact thing happens. I highly doubt it's related to my disks.
I'm using the on board SATA controller (non RAID), and have been for the past 3 - 4 years without a problem. I've replaced the SATA cables just to rule that out.
I'm a bit out of my depth here, I've been working with computers for most of my life but have only recently started dealing with *nix systems. So, what's my next step? I'm not really sure how to diagnose this.
Code:
brandonb@freenas:~ % zpool status
pool: Tesla
state: ONLINE
status: One or more devices has experienced an unrecoverable error. An
attempt was made to correct the error. Applications are unaffected.
action: Determine if the device needs to be replaced, and clear the errors
using 'zpool clear' or replace the device with 'zpool replace'.
see: http://illumos.org/msg/ZFS-8000-9P
scan: scrub repaired 256K in 0h12m with 0 errors on Tue Jun 30 02:19:14 2015
config:
NAME STATE READ WRITE CKSUM
Tesla ONLINE 0 0 0
raidz1-0 ONLINE 0 0 0
ada0p1 ONLINE 0 0 2
ada1p1 ONLINE 0 0 0
ada2p1 ONLINE 0 0 2
ada3p1 ONLINE 0 0 2
errors: No known data errors
pool: freenas-boot
state: ONLINE
scan: none requested
config:
NAME STATE READ WRITE CKSUM
freenas-boot ONLINE 0 0 0
da0p2 ONLINE 0 0 0
errors: No known data errors
brandonb@freenas:~ %
I replaced a WD Green drive with a WD Red so I'd finally have all NAS type HDD's in there, and that's when the issues started but I don't think it has anything to do with the new disk, like I said above, I'm experiencing the same issues regardless of the drives I use.
What prompted me to replace the WD Green in the first place is a few checksum errors popped up on that drive, so it gave me an excuse. The resilvering process failed the first time, saying there was a corrupted file that needed to be addressed. So I deleted said file (was an unimportant temp file) and restarted the resilvering. Second time around it was complaining about my Plex jail, so I just deleted that dataset entirely and started over for a third time. It finally worked after doing that plus a scrub right after.
Ever since it's been a nightmare getting this thing to function properly. I went as far as recreating my primary dataset entirely in order to migrate to ashift = 12 (4k sectors) for maximum performance, since my old drives were 512b sectors. It didn't help at all but at least I know there should be no issue there. In the 4 years of having this system, this is the first time I've run into this.
Oh and I have a fresh install of FreeNAS, just to rule any of that out. Put it on a brand new 16GB flash drive shortly after replacing the WD Green disk.
Last edited: