all my drives suddenly DEGRADED after months of working fine

TheThomen

Dabbler
Joined
Jan 26, 2021
Messages
18

CRITICAL​

Pool Shelf-Z2 state is DEGRADED: One or more devices has experienced an error resulting in data corruption. Applications may be affected.
The following devices are not healthy:
  • Disk ST4000DM 000-1F2168 SM Z3019W0G is DEGRADED
  • Disk ST4000DM 000-1F2168 SA Z30194ST is DEGRADED
  • Disk ST4000DM 000-1F2168 SM Z304VQLY is DEGRADED
  • Disk ST4000DM 000-1F2168 SM Z30195TE is DEGRADED
  • Disk ST4000DM 000-1F2168 SM Z30125R5 is DEGRADED
  • Disk ST4000DM 000-1F2168 SM Z3012WRX is DEGRADED
  • Disk ST4000DM 000-1F2168 SM Z3019SPT is DEGRADED
  • Disk ST4000DM 000-1F2168 SA Z30194L4 is DEGRADED



im useing a R320 with a LSI megaraid connected to a NETTAPP shelf with 8 4tb drives. not sure why but the pool works fine everythings running fine but the status is all the drives are degraded. tried reboot but no joy.
 

Samuel Tai

Never underestimate your own stupidity
Moderator
Joined
Apr 24, 2020
Messages
5,399
Is your MegaRAID flashed to IT mode? Or were you running in JBOD mode?

 

TheThomen

Dabbler
Joined
Jan 26, 2021
Messages
18
Is your MegaRAID flashed to IT mode? Or were you running in JBOD mode?

Hi Samuel,
1654106569468.png

I flashed it mode on it about 6 months ago when it was in another system (that system never had this issue)
Please see the results in the image, the card is a LSI SAS 9200-8e
 

TheThomen

Dabbler
Joined
Jan 26, 2021
Messages
18
This is my setup, R320 with E5-2450L and the LSI connected to the shelf using 2 cables
20220519_111559.jpg
 

Redcoat

MVP
Joined
Feb 18, 2014
Messages
2,925
Joined
Jun 2, 2019
Messages
591
Desktop DM-SMR drives + ZFS = disaster.

 
Last edited:

Samuel Tai

Never underestimate your own stupidity
Moderator
Joined
Apr 24, 2020
Messages
5,399
any ideas people?
That spec sheet @Redcoat linked to indicates this model changed from CMR (which is OK for ZFS) to SMR (which is bad for ZFS) after 13 May 2020. If you purchased your drives after then, odds are your pool is constructed solely of the SMR drives.
 

neofusion

Contributor
Joined
Apr 2, 2022
Messages
159
That spec sheet @Redcoat linked to indicates this model changed from CMR (which is OK for ZFS) to SMR (which is bad for ZFS) after 13 May 2020. If you purchased your drives after then, odds are your pool is constructed solely of the SMR drives.
The document is listed as having been revised on 13 May 2020.
It does not necessarily mean that they started rolling out the SMR-drives at that point in time; that may have happened before or after that date.

In other words, you would need to test the drives to be sure even if you bought them before that date.
I am not sure how to reliably do that.
 

TheThomen

Dabbler
Joined
Jan 26, 2021
Messages
18
gents after checking the drives were manufactured in 2015 with 16064 being the newest the rest are 14xxx
 

Samuel Tai

Never underestimate your own stupidity
Moderator
Joined
Apr 24, 2020
Messages
5,399
Yes, they're all SMR. They're also all desktop drives, which aren't suitable for NAS applications. Seagate's IronWolf line is their NAS line.
 

neofusion

Contributor
Joined
Apr 2, 2022
Messages
159
Yes, they're all SMR. They're also all desktop drives, which aren't suitable for NAS applications. Seagate's IronWolf line is their NAS line.
Desktop drives, sure, but what is your source as to whether they are SMR or not?
ST4000DM004 is SMR but I haven't been able to find anything showing that ST4000DM000 is, quite the opposite actually.
 
Joined
Jun 2, 2019
Messages
591
Desktop drives, sure, but what is your source as to whether they are SMR or not?
ST4000DM004 is SMR but I haven't been able to find anything showing that ST4000DM000 is, quite the opposite actually.
Seagate's own FAQ post.


Screen Shot 2022-06-12 at 10.17.17 AM.png


Data sheet states TGMR, which from what I have read is a form of SMR


Screen Shot 2022-06-12 at 10.24.34 AM.png


You apparently chose to buy used drives that are out of warranty.

Screen Shot 2022-06-12 at 10.35.15 AM.png


Bottom line, you put used desktop SMR drives in a server application. Accept your error and buy enterprise class NAS CMR drives.
 
Last edited:
Joined
Oct 22, 2019
Messages
3,641
@TheThomen, are you using TrueNAS Core or SCALE?

If Core, do any of the drives report they support trim?

diskinfo -v ada0 | grep TRIM

Repeat the above for each drive, such as ada1, ada2, ada3, etc.

(You have to run the command as the root user or with sudo.)
 
Joined
Oct 22, 2019
Messages
3,641
Some SMR drives will report they support "TRIM".

Based on the fact that none of your drives report they support "TRIM", and based on the purchase date of your drives, it's looking unlikely that they are SMR drives.

So perhaps the culprit for this sudden all drives degraded phenomenon is due to the LSI MegaRAID controller card?
 

TheThomen

Dabbler
Joined
Jan 26, 2021
Messages
18
Some SMR drives will report they support "TRIM".

Based on the fact that none of your drives report they support "TRIM", and based on the purchase date of your drives, it's looking unlikely that they are SMR drives.

So perhaps the culprit for this sudden all drives degraded phenomenon is due to the LSI MegaRAID controller card?
like corrupt firmware? should i reflash?
 

NugentS

MVP
Joined
Apr 16, 2020
Messages
2,947
Have you got decent cooling across the card?
Also post the result of zpool status please. Might need a -v in there as well
 
Top