After months of problem-free operation, I'm getting the following error messages on boot:
CAM status: ATA Status Error
ATA status: 51 (DRDY SERV ERR), error: 84 (ICRC ABRT )
RES: 51 84 a8 91 3f 00 00 00 00 00 00
Retrying command
(repeated over and over, except that the address in the RES line changes each time)
I occasionally had similar issues when this box ran Linux. The issues would come and go, and multiple drives were sometimes affected, although SMART was always clean (and still is now). After doing a bit of searching, I came across suggestions that it might not even be the hard drive(s) that were the problem, but rather faulty SATA cables, or "noise" (interference) from the power supply, or a bad motherboard, or...
Anyway, what can I do to track down the source of the error? I'm on FreeNAS-9.1.1-RELEASE-x64 (a752d35), in case that makes a difference. I'm not worried about data loss, because I've been exporting snapshots to an external drive regularly, and if this goes down for a week or two while I get parts, it won't be a big deal. But I would like to fix this.
zpool status -v outputs the following:
CAM status: ATA Status Error
ATA status: 51 (DRDY SERV ERR), error: 84 (ICRC ABRT )
RES: 51 84 a8 91 3f 00 00 00 00 00 00
Retrying command
(repeated over and over, except that the address in the RES line changes each time)
I occasionally had similar issues when this box ran Linux. The issues would come and go, and multiple drives were sometimes affected, although SMART was always clean (and still is now). After doing a bit of searching, I came across suggestions that it might not even be the hard drive(s) that were the problem, but rather faulty SATA cables, or "noise" (interference) from the power supply, or a bad motherboard, or...
Anyway, what can I do to track down the source of the error? I'm on FreeNAS-9.1.1-RELEASE-x64 (a752d35), in case that makes a difference. I'm not worried about data loss, because I've been exporting snapshots to an external drive regularly, and if this goes down for a week or two while I get parts, it won't be a big deal. But I would like to fix this.
zpool status -v outputs the following:
Code:
pool: Primary state: ONLINE status: One or more devices has experienced an unrecoverable error. An attempt was made to correct the error. Applications are unaffected. action: Determine if the device needs to be replaced, and clear the errors using 'zpool clear' or replace the device with 'zpool replace'. see: http://illumos.org/msg/ZFS-8000-9P scan: resilvered 121M in 0h6m with 0 errors on Thu May 15 21:06:08 2014 config: NAME STATE READ WRITE CKSUM Primary ONLINE 0 0 0 raidz1-0 ONLINE 0 0 0 gptid/125bca84-5550-11e3-b667-485b39a7a747 ONLINE 0 0 0 gptid/12d05cf3-5550-11e3-b667-485b39a7a747 ONLINE 0 0 0 gptid/133f9e29-5550-11e3-b667-485b39a7a747 ONLINE 0 0 0 gptid/139fe25f-5550-11e3-b667-485b39a7a747 ONLINE 0 0 3