s.m.a.r.t test results help

jcizzo

Explorer
Joined
Jan 20, 2023
Messages
79
hey folks, i'm hoping the pros can shed some light on how to interpret these.

this morning i logged into my TNC nas and noticed an 'alerts' stating that one of my drives experienced an error... the exact error escapes me and i dismissed the alerts, not sure how to retrieve them..

i ran a smart test (short) and the results came back as 'success'. also, the dashboard shows the pool as being fine, all green.. i've replaced drives in the past, and along with the 'alert' the pool will show as degraded.

smart test results state the following:
remaining: 0
lifetime: 6779
Error: N/A

can someone instruct me on how to interpret this?

TNC 13 (latest stable)
mirrored boot drives (ssd)
xeon e3-1240L w/ 32Gig ECC.
supermicro x11ssh-f
media pool that shot me the alert is comprised of 5x4TB spinners, attached with an lsi 9200-based HBA in I.T. mode

Thanks!!
 

joeschmuck

Old Man
Moderator
Joined
May 28, 2011
Messages
10,996
hey folks, i'm hoping the pros can shed some light on how to interpret these.
Nope, sure can't.

the exact error escapes me
How would you propose we help?

Okay, I'm being a jerk, not a big jerk but a jerk nonetheless however you gave no real error information to work with.

Here are the questions which need an answer:
1) What drive or drives was the alarm for? In the GUI go to the Alerts Bell and click, you have the option to show the dismissed messages. Post those that pertain to this issue.
2) What was the alarm message? (A screen capture of step 1 would help)
3) Run the command smartctl -x /dev/driveident for each suspect drive (or all drives) where "driveident is ada? or da?" And place the data in code brackets. Please do not cut off any of the data as so many people unfortunately do thinking they know what data is pertinent. I'm not saying you would do that but some folks do and it just delays providing timely advice.
4) If your pool is showing as degraded then that may not be an actual drive failure so post the output in code brackets of zpool status -v

With this information we should be able to formulate some sort of answer or we may ask you a few more questions to narrow this issue down.

One thing we have been seeing a lot of is pools becoming degraded due to using SMR drives. If you have SMR drives, this is likely your problem, however that is a guess based on the fact the you said the word "degraded", but it's a pull out of my butt guess.

Let's hope it's an easy fix or non-issue.
 

jcizzo

Explorer
Joined
Jan 20, 2023
Messages
79
Nope, sure can't.


How would you propose we help?

Okay, I'm being a jerk, not a big jerk but a jerk nonetheless however you gave no real error information to work with.

Here are the questions which need an answer:
1) What drive or drives was the alarm for? In the GUI go to the Alerts Bell and click, you have the option to show the dismissed messages. Post those that pertain to this issue.
2) What was the alarm message? (A screen capture of step 1 would help)
3) Run the command smartctl -x /dev/driveident for each suspect drive (or all drives) where "driveident is ada? or da?" And place the data in code brackets. Please do not cut off any of the data as so many people unfortunately do thinking they know what data is pertinent. I'm not saying you would do that but some folks do and it just delays providing timely advice.
4) If your pool is showing as degraded then that may not be an actual drive failure so post the output in code brackets of zpool status -v

With this information we should be able to formulate some sort of answer or we may ask you a few more questions to narrow this issue down.

One thing we have been seeing a lot of is pools becoming degraded due to using SMR drives. If you have SMR drives, this is likely your problem, however that is a guess based on the fact the you said the word "degraded", but it's a pull out of my butt guess.

Let's hope it's an easy fix or non-issue.
the reason why i was questioning things is because, despite my pool not being labelled as 'degraded', i had the alarm but the option to show dismissed messages isn't there. i know the button but i must've deleted them instead of just hiding them.

there is only one drive showing the problem, it wasn't the whole pool. the pool was not/is not showing up as degraded, it was just the alarm.

regarding the command:
smartctl -x /dev/driveident

do i type in or does something else go in it's place? the driveident is da3, i know that means nothing to you. everything seems to be running fine, it was just that alert but ever since i 'dismissed' the alert, all has been clear.
 

joeschmuck

Old Man
Moderator
Joined
May 28, 2011
Messages
10,996
If the drive is da3, then the command would be smartctl -x /dev/da3
Just post the results. If you are not using an SSH application and plan to use the GUI, here is a much easier command to get the data out of the machine:smartctl -x /dev/da3 > /tmp/da3.txt and then to email it to yourself mail -s "subject line" your_email_address.com < /tmp/da3.txt

My example would be:
mail -s "da3 SMART Data" joeschmuck2023@hotmail.com < /tmp/da3.txt

The file you created called /tmp/da3.txt is in a temporary RAM directory and is deleted upon reboot so cleanup is very easy. Once you have the data in your email, it will be easy to cut/paste it into the forum page.
 
Top