Raid – ZPOOL Disk Fault – Where is I?

After checking the status of one of my zpools today, I encountered the following problems:

root@server: zpool status -v myPool


pool: myPool
state: ONLINE
status: One or more devices has experienced an unrecoverable error. An
attempt was made to correct the error. Applications are unaffected.
action: Determine if the device needs to be replaced, and clear the errors
using'zpool clear' or replace the device with'zpool replace'.
see: http://www.sun.com/msg/ZFS-8000-9P
scrub: resilver completed after 3h6m with 0 errors on Tue Sep 28 11:15:11 2010
config:< br />
NAME STATE READ WRITE CKSUM
myPool ONLINE 0 0 0
raidz1 ONLINE 0 0 0
c6t7d0 ONLINE 0 0 0
c6t8d0 ONLINE 0 0 0
spare ONLINE 0 0 0
c6t9d0 ONLINE 54 0 0
c6t36d0 ONLINE 0 0 0
c6t10d0 ONLINE 0 0 0
c6t11d0 ONLINE 0 0 0
c6t12d0 ONLINE 0 0 0
spares
c6t36d0 INUSE currently in use
c6t37d0 AVAIL
c6t38d0 AVAIL

errors: No known data errors

From what I can see, c6t9d0 has encountered 54 writes Error. It seems that it has automatically bounced back to the spare disk c6t36d0 and is now using it.

My question is, where am I? Yes, the “action” tells me to determine whether the disk needs to be replaced, but is this disk still in use? Can I replace/remove it?

Any explanation would be greatly appreciated, because I am very new to this thing:)

Update: follow the advice of C10k consulting, namely separation:

zpool detach myPool c6t9d0

and add as a spare:

zpool add myPool spare c6t9d0

It seems all is well The new state of my zpool is:

root@server: zpool status -v myPool
pool: myPool
state: ONLINE
scrub: resilver completed after 3h6m with 0 errors on Tue Sep 28 11:15:11 2010
config:

NAME STATE READ WRITE CKSUM
muPool ONLINE 0 0 0
raidz1 ONLINE 0 0 0
c6t7d0 ONLINE 0 0 0
c6t8d0 ONLINE 0 0 0
c6t36d0 ONLINE 0 0 0
c6t10d0 ONLINE 0 0 0
c6t11d0 ONLINE 0 0 0
c6t12d0 ONLINE 0 0 0
spares
c6t37d0 AVAIL
c6t38d0 AVAIL
c6t9d0 AVA IL

errors: No known data errors

Thank you for your help c10k consultation:)

zpool delete myPool c6t37d0

zpool replace myPool c6t9d0 c6t37d0

This will make one of your hot spares available as a normal disk ( c6t37d0)
Then replace the bad disk (c6t9d0) with the current free disk (c6t37d0)
Once everyone is happy to replace c6t9d0 then:

zppol adds myPool spare c6t9d0

< p>And you will return to the happy setting, there are 3 hot spare parts available.

Or you can simply pull c6t9d0 and let c6t36d0 take its place:

zpool detach myPool c6t9d0

Then replace c6t9d0 and add it back as a backup.

After checking the status of one of my zpools today, I met The following question arrived:

root@server: zpool status -v myPool


pool: myPool
state : ONLINE
status: One or more devices has experienced an unrecoverable error. An
attempt was made to correct the error. Applications are unaffected.
action: Determine if the device needs to be replaced, and clear the errors
using'zpool clear' or replace the device with'zpool replace'.
see: http://www.sun.com/msg/ZFS-8000-9P
scrub: resil ver completed after 3h6m with 0 errors on Tue Sep 28 11:15:11 2010
config:

NAME STATE READ WRITE CKSUM
myPool ONLINE 0 0 0
raidz1 ONLINE 0 0 0
c6t7d0 ONLINE 0 0 0
c6t8d0 ONLINE 0 0 0
spare ONLINE 0 0 0
c6t9d0 ONLINE 54 0 0
c6t36d0 ONLINE 0 0 0< br /> c6t10d0 ONLINE 0 0 0
c6t11d0 ONLINE 0 0 0
c6t12d0 ONLINE 0 0 0
spares
c6t36d0 INUSE currently in use
c6t37d0 AVAIL
c6t38d0 AVAIL

errors: No known data errors

From what I can see, c6t9d0 has encountered 54 write errors. It seems that it has automatically rebounded to the spare disk c6t36d0, I am using it now.

My question is, where am I? Yes, the “action” tells me to determine whether the disk needs to be replaced, but is this disk still in use? Can I replace/remove it?

Any explanation would be greatly appreciated, because I am very new to this thing:)

Update: follow the advice of C10k consulting, namely separation:

zpool detach myPool c6t9d0

and add as a spare:

zpool add myPool spare c6t9d0

It seems all is well The new state of my zpool is:

root@server: zpool status -v myPool
pool: myPool
state: ONLINE
scrub: resilver completed after 3h6m with 0 errors on Tue Sep 28 11:15:11 2010
config:

NAME STATE READ WRITE CKSUM
muPool ONLINE 0 0 0
raidz1 ONLINE 0 0 0
c6t7d0 ONLINE 0 0 0
c6t8d0 ONLINE 0 0 0
c6t36d0 ONLINE 0 0 0
c6t10d0 ONLINE 0 0 0
c6t11d0 ONLINE 0 0 0
c6t12d0 ONLINE 0 0 0
spares
c6t37d0 AVAIL
c6t38d0 AVAIL
c6t9d0 AVAIL

errors: No known data errors

Thank you for your help c10k consultation:)

zpool delete myPool c6t37d0

zpool replace myPool c6t9d0 c6t37d0

This will make one of your hot spares available as a normal disk (c6t37d0)
Then replace the bad with the current free disk (c6t37d0) Disk (c6t9d0)
Once everyone happily replaces c6t9d0 then:

zppol add myPool spare c6t9d0

and you will go back to the happy setting, there are 3 available Hot spare parts.

Or you can simply pull c6t9d0 and let c6t36d0 take its place:

zpool detach myPool c6t9d0

Then replace c6t9d0 and Add it back as a backup.

Leave a Comment

Your email address will not be published.