Pool state degraded

Victor87

Dabbler
Joined
Aug 28, 2022
Messages
16
Hello all

I need some help to understand why one of my Crucial MX500 2Tb is starting to fail. Have only few months and around 6Tb writing on it.

Attached are also the smart results



Thank you
 

Attachments

  • Screenshot_20230121_112828_com.android.email.jpg
    Screenshot_20230121_112828_com.android.email.jpg
    119.8 KB · Views: 148
  • Screenshot_20230121_112834_com.android.email.jpg
    Screenshot_20230121_112834_com.android.email.jpg
    110.8 KB · Views: 142
  • Screenshot_20230121_112904_com.android.email.jpg
    Screenshot_20230121_112904_com.android.email.jpg
    117.3 KB · Views: 145

NugentS

MVP
Joined
Apr 16, 2020
Messages
2,947
Take it out, put it in a windows machine and run crucial tests on it - see what happens
 

Victor87

Dabbler
Joined
Aug 28, 2022
Messages
16
Take it out, put it in a windows machine and run crucial tests on it - see what happens
Thanks for answering.
Is heavy... My server is at 2000 km far away.
But on smart result the ssd appears ok or do you see some fails? Possible to be some data corruption?
The data on this server is not so important. If it fails doesn't matter. My question is why is creating this failure because I use same ssd's on other server where are important stuff.
 

Attachments

  • smartctl -a dev sda.txt
    7.1 KB · Views: 193
  • smartctl -x dev sda.txt
    16.7 KB · Views: 180
Last edited:

Victor87

Dabbler
Joined
Aug 28, 2022
Messages
16
Take it out, put it in a windows machine and run crucial tests on it - see what happens
On the ends looks like truenas is reporting completely wrong the failure of my ssd.
What should I do next to fix the degraded pool error?
 

Attachments

  • IMG-20230122-WA0000.jpg
    IMG-20230122-WA0000.jpg
    210.3 KB · Views: 122
  • IMG-20230122-WA0001.jpg
    IMG-20230122-WA0001.jpg
    254.5 KB · Views: 125
  • IMG-20230122-WA0002.jpg
    IMG-20230122-WA0002.jpg
    176.5 KB · Views: 132

artlessknave

Wizard
Joined
Oct 29, 2016
Messages
1,506
sometimes drives just fail. this is why RAID(z) exists, to mitigate the known fact that drives just fail.

if ZFS reports a drive as returning bad data, smart is irelevant; the drive returns bad data. smart failures are merely one way to predict future drive failures, not the only way to know a drive is failing.
this can be caused by bad cables or other connections, or a bad storage controlller, and you will need to investigate all of these possibilities.
 

Victor87

Dabbler
Joined
Aug 28, 2022
Messages
16
sometimes drives just fail. this is why RAID(z) exists, to mitigate the known fact that drives just fail.

if ZFS reports a drive as returning bad data, smart is irelevant; the drive returns bad data. smart failures are merely one way to predict future drive failures, not the only way to know a drive is failing.
this can be caused by bad cables or other connections, or a bad storage controlller, and you will need to investigate all of these possibilities.
Hi

Everything will fail sooner or later :)
I don't use raidz there because is just an server lab to test and learn stuff on it. Sata cable was replaced today and another scrubbing doesn't fix the error. I will leave it like it is to see what's happens in the future. Everything still works great at the moment.
 

artlessknave

Wizard
Joined
Oct 29, 2016
Messages
1,506
a scrub doesnt fix errors, it detects them. zfs will automatically fix any errors it can *if* there is redundancy.

you can zpool clear to clear the errors and see if more occur
 

Victor87

Dabbler
Joined
Aug 28, 2022
Messages
16
a scrub doesnt fix errors, it detects them. zfs will automatically fix any errors it can *if* there is redundancy.

you can zpool clear to clear the errors and see if more occur
Error gone. Should I run new scrub?

Thank you very much for your advice
 

Attachments

  • Screenshot_20230122_221826_com.brave.browser.jpg
    Screenshot_20230122_221826_com.brave.browser.jpg
    21.7 KB · Views: 131
Last edited:

mvipe01

Dabbler
Joined
Feb 1, 2022
Messages
18
Weird things can happen to drives, I have had a Samsung drive give errors, act perfectly fine and pass all tests and then fail. You also have some unexpected power loss counts, I would guess that a potential power outage could also cause it to throw an error and drop out.
 

Victor87

Dabbler
Joined
Aug 28, 2022
Messages
16
Weird things can happen to drives, I have had a Samsung drive give errors, act perfectly fine and pass all tests and then fail. You also have some unexpected power loss counts, I would guess that a potential power outage could also cause it to throw an error and drop out.
I run 4 scrub test in the past days. Looks like everything is fine. No errors anymore on that ssd. Now I get only one on checksum for the second ssd. I believe is some file corruption. When I have time I will rebuild the pool again.
 
Top