とうとう壊れた

原因はたぶんゆるゆるになってたSATAケーブルな気がするんだけど,raidzがDEGRADEDになったので交換+再構築を初体験.

こうなるまでの途中経過が面白くて,徐々に徐々にzpool statusのREADとCHKSUMのエラーの数が増えていってた.

 pool: share
state: ONLINE
status: One or more devices has experienced an unrecoverable error.  An
	attempt was made to correct the error.  Applications are unaffected.
action: Determine if the device needs to be replaced, and clear the errors
	using 'zpool clear' or replace the device with 'zpool replace'.
  see: http://www.sun.com/msg/ZFS-8000-9P
scrub: none requested
config:

	NAME        STATE     READ WRITE CKSUM
	share       ONLINE       0     0     0
	  raidz1    ONLINE       0     0     0
	    ad7     ONLINE       0     0     0
	    ad10    ONLINE      27     0    17
	    ad8     ONLINE       0     0     0

errors: No known data errors

そろそろやばいから換えのディスクを購入しようかなと思っていた矢先に,唯一エラーを増殖させていたディスクがREMOVEDされて,raidzがDEGRADEDに.

 pool: share
state: DEGRADED
status: The pool is formatted using an older on-disk format.  The pool can
	still be used, but some features are unavailable.
action: Upgrade the pool using 'zpool upgrade'.  Once this is done, the
	pool will no longer be accessible on older software versions.
scrub: none requested
config:

	NAME        STATE     READ WRITE CKSUM
	share       DEGRADED     0     0     0
	  raidz1    DEGRADED     0     0     0
	    ad7     ONLINE       0     0     0
	    ad10    REMOVED      0     0     0
	    ad8     ONLINE       0     0     0

errors: No known data errors

この状況でも,次にやるべき操作としてzfsをupgradeしましょうというソフトはどうかと思うのだが,それはさくっと無視して,電源とHDDを購入してきて入れ替え.最終的に1.5TB*3のraidzに組み換えることを狙っているので,samsungの1.5TB(HD154UI)を購入(5400rpm以外のHDDを購入する気は無い).手持ちにもう一つあるから,もうひとつ購入すると,バックアップを考えなければ1.5TB*3で3.0TBのraidzの作成が可能になる.電源は玄人志向のKRPW-V600Wを購入.

で,現在再構築中

  pool: share
 state: DEGRADED
status: One or more devices is currently being resilvered.  The pool will
	continue to function, possibly in a degraded state.
action: Wait for the resilver to complete.
 scrub: resilver in progress for 1h35m, 39.11% done, 2h28m to go
config:

	NAME           STATE     READ WRITE CKSUM
	share          DEGRADED     0     0     0
	  raidz1       DEGRADED     0     0     0
	    ad7        ONLINE       0     0     0  132M resilvered
	    replacing  DEGRADED     0     0     0
	      ad10     REMOVED      0     0     0
	      ad9      ONLINE       0     0     0  272G resilvered
	    ad8        ONLINE       0     0     0  132M resilvered

errors: No known data errors

だいたい,1.4TB使用した状態のraidzの再構築に4時間ほどかかる模様.

しかし,404 Blog Not Found:tips - zfs raidz修復の実際より倍くらい速いのが気になる.

追記

 scrub: resilver completed after 3h56m with 0 errors on Thu Jun 18 00:02:48 2009

結局,4時間を切って3時間56分で終了.すごい.ついでに,poolもupgrade.