SOLVED Error Replacing Disk

Thousandbuckle

Contributor
Joined
Jul 9, 2014
Messages
136
Hello all running TrueNAS-12.0-U7 on a 8 disk RaidZ2 with WD Red 4TB drives. I had a failed disk and followed the instructions in the documentation to offline the disk, remove it, replace with new disk and check the force option. When I initially did this I got an error adding the drive so I attempted it again and now the drive does not even show in the Member Disk to choose. I have rebooted the sever but did not help. In the screen shot below you can see the - where it should have a disk number. I dont know what to do and could really use some help. The array is degraded due to the loss of the one drive and would like to get back to my 2 drive redundancy.

1644023124353.png
 

Thousandbuckle

Contributor
Joined
Jul 9, 2014
Messages
136
One comment which may or may not relate to the issue. The GPTID in the screen shot is the same GPTID that was showing before with the failed drive still running in the server. Is the GPTID unique to the controller on the motherboard or will it change value when a new HD is added. Maybe almost seems like the OS see the old drive still in the OS and doesnt see the new one for some reason.
 

Thousandbuckle

Contributor
Joined
Jul 9, 2014
Messages
136
Okay so not sure what is going on here but when I run zpool status you can see the GPTID I highlighted (gptid/56eed232-6944-11e9-8416-0cc47aaace0c) which as the GPTID of the old failed drive I removed which still shows offline. If I run the glabel status command I dont see that old GPTID from zpool status. This makes me think that the zpool has an old GPTID entry (gptid/56eed232-6944-11e9-8416-0cc47aaace0c) and it didnt flush or something? Not sure if that makes sense or not.

1644027846258.png
 

Samuel Tai

Never underestimate your own stupidity
Moderator
Joined
Apr 24, 2020
Messages
5,399
Did you pull the offline drive? Have you inserted the new drive?
 

Thousandbuckle

Contributor
Joined
Jul 9, 2014
Messages
136
I just powered the server back up after checking cables and such to make sure something wasnt loose. I can see the error message again now which is posted below.
1644073260144.png


Error: Traceback (most recent call last):
File "/usr/local/lib/python3.9/site-packages/middlewared/job.py", line 367, in run
await self.future
File "/usr/local/lib/python3.9/site-packages/middlewared/job.py", line 403, in __run_body
rv = await self.method(*([self] + args))
File "/usr/local/lib/python3.9/site-packages/middlewared/schema.py", line 975, in nf
return await f(*args, **kwargs)
File "/usr/local/lib/python3.9/site-packages/middlewared/plugins/pool_/replace_disk.py", line 93, in replace
enc_disks = await self.middleware.call(
File "/usr/local/lib/python3.9/site-packages/middlewared/main.py", line 1256, in call
return await self._call(
File "/usr/local/lib/python3.9/site-packages/middlewared/main.py", line 1213, in _call
return await methodobj(*prepared_call.args)
File "/usr/local/lib/python3.9/site-packages/middlewared/plugins/pool_/format_disks.py", line 56, in format_disks
await asyncio_map(format_disk, disks.items(), limit=16)
File "/usr/local/lib/python3.9/site-packages/middlewared/utils/asyncio_.py", line 16, in asyncio_map
return await asyncio.gather(*futures)
File "/usr/local/lib/python3.9/site-packages/middlewared/utils/asyncio_.py", line 13, in func
return await real_func(arg)
File "/usr/local/lib/python3.9/site-packages/middlewared/plugins/pool_/format_disks.py", line 29, in format_disk
await self.middleware.call(
File "/usr/local/lib/python3.9/site-packages/middlewared/main.py", line 1256, in call
return await self._call(
File "/usr/local/lib/python3.9/site-packages/middlewared/main.py", line 1224, in _call
return await self.run_in_executor(prepared_call.executor, methodobj, *prepared_call.args)
File "/usr/local/lib/python3.9/site-packages/middlewared/main.py", line 1128, in run_in_executor
return await loop.run_in_executor(pool, functools.partial(method, *args, **kwargs))
File "/usr/local/lib/python3.9/concurrent/futures/thread.py", line 52, in run
result = self.fn(*self.args, **self.kwargs)
File "/usr/local/lib/python3.9/site-packages/middlewared/plugins/disk_/format.py", line 25, in format
raise CallError(f'Failed to wipe disk {disk}: {job.error}')
middlewared.service_exception.CallError: [EFAULT] Failed to wipe disk da7: [Errno 5] Input/output error
 

Samuel Tai

Never underestimate your own stupidity
Moderator
Joined
Apr 24, 2020
Messages
5,399
This error means the replacement disk already had a partition table. TrueNAS does this to prevent overwriting a disk in error.
Go to Storage->Disks, and select the new disk. Then click the Wipe button. A quick wipe will overwrite any existing partition table. You should be able to run the replacement procedure afterwards.
 

Thousandbuckle

Contributor
Joined
Jul 9, 2014
Messages
136
Okay it turned out to be a DOA HD right out of the box. I put another drive in and it is acting like it should.

First time I have had a DOA WD Red NAS drive bad out of the box.

Thank you for your feedback.
 

Samuel Tai

Never underestimate your own stupidity
Moderator
Joined
Apr 24, 2020
Messages
5,399
It happens to all spinning rust. Glad you're back to good.
 
Top