Raid Zのディスクが異常になったので交換した

root からのメール

11/23

CurrentPendingSector

Device: /dev/sdd [SAT], 8 Currently unreadable (pending) sectors
OfflineUncorrectableSector

Device: /dev/sdd [SAT], 8 Offline uncorrectable sectors

11/24

FailedReadSmartData

Device: /dev/sdd [SAT], failed to read SMART Attribute Data

12/3

CurrentPendingSector

Device: /dev/sdd [SAT], 16 Currently unreadable (pending) sectors
OfflineUncorrectableSector

Device: /dev/sdd [SAT], 16 Offline uncorrectable sectors

12/4

FailedReadSmartData

Device: /dev/sdd [SAT], failed to read SMART Attribute Data

ZFS の状況確認

# zpool status
  pool: tank
 state: DEGRADED
status: One or more devices has been removed by the administrator.
        Sufficient replicas exist for the pool to continue functioning in a
        degraded state.
action: Online the device using 'zpool online' or replace the device with
        'zpool replace'.
  scan: resilvered 55.5G in 0 days 00:15:39 with 0 errors on Thu Dec  3 16:27:00 2020
config:

        NAME                                 STATE     READ WRITE CKSUM
        tank                                 DEGRADED     0     0     0
          raidz1-0                           DEGRADED     0     0     0
            ata-ST4000DM004-2CV104_ZFN15XX1  ONLINE       0     0     0
            ata-ST4000DM004-2CV104_ZFN15XX2  REMOVED      0     0     0
            ata-ST4000DM004-2CV104_ZFN15XX3  ONLINE       0     0     0
            ata-ST4000DM004-2CV104_ZFN15XX4  ONLINE       0     0     0
            ata-ST4000DM004-2CV104_ZFN15XXr  ONLINE       0     0     0

errors: No known data errors

隔離されてる。
2年半使っていて保証切れてるし、交換(4TB->8TB)することに。
まずは注文

SEAGATE 3.5インチ内蔵HDD8TB ST8000DM004 (8TB SATA) 代理店保証1年
価格:13816円(税込、送料別) (2020/12/9時点)楽天で購入

届いたら、電源を切り、HDD入れ替え
IDを確認

# ls -lh /dev/disk/by-id/

zpoolの状況を確認

$ zpool status
  pool: tank
 state: DEGRADED
status: One or more devices could not be used because the label is missing or
        invalid.  Sufficient replicas exist for the pool to continue
        functioning in a degraded state.
action: Replace the device using 'zpool replace'.
   see: http://zfsonlinux.org/msg/ZFS-8000-4J
  scan: resilvered 55.5G in 0 days 00:15:39 with 0 errors on Thu Dec  3 16:27:00 2020
config:

        NAME                                 STATE     READ WRITE CKSUM
        tank                                 DEGRADED     0     0     0
          raidz1-0                           DEGRADED     0     0     0
            ata-ST4000DM004-2CV104_ZFN15XX1  ONLINE       0     0     0
            9999999999999994100              UNAVAIL      0     0     0  was /dev/disk/by-id/ata-ST4000DM004-2CV104_ZFN15XX2-part1
            ata-ST4000DM004-2CV104_ZFN15XX3  ONLINE       0     0     0
            ata-ST4000DM004-2CV104_ZFN15XX4  ONLINE       0     0     0
            ata-ST4000DM004-2CV104_ZFN15XX5  ONLINE       0     0     0

errors: No known data errors

差し替え

# zpool replace tank ata-ST4000DM004-2CV104_ZFN15XX2 ata-ST8000DM004-2CX188_ZCTXXXXX

再同期中の様子

# zpool status
  pool: tank
 state: DEGRADED
status: One or more devices is currently being resilvered.  The pool will
        continue to function, possibly in a degraded state.
action: Wait for the resilver to complete.
  scan: resilver in progress since Tue Dec  8 17:23:26 2020
        186G scanned at 9.80G/s, 9.62M issued at 519K/s, 12.3T total
        0B resilvered, 0.00% done, no estimated completion time
config:

        NAME                                   STATE     READ WRITE CKSUM
        tank                                   DEGRADED     0     0     0
          raidz1-0                             DEGRADED     0     0     0
            ata-ST4000DM004-2CV104_ZFN15XX1    ONLINE       0     0     0
            replacing-1                        DEGRADED     0     0     0
              9999999999999994100              UNAVAIL      0     0     0  was /dev/disk/by-id/ata-ST4000DM004-2CV104_ZFN15XX2-part1
              ata-ST8000DM004-2CX188_ZCTXXXXX  ONLINE       0     0     0
            ata-ST4000DM004-2CV104_ZFN15XX3    ONLINE       0     0     0
            ata-ST4000DM004-2CV104_ZFN15XX4    ONLINE       0     0     0
            ata-ST4000DM004-2CV104_ZFN15XX5    ONLINE       0     0     0

errors: No known data errors

今現在、22時間経っていますが、半分くらいの時間は書き込みも読み込みも行っており、
こんな感じで半分が進んだ程度。

$ zpool status
  pool: tank
 state: DEGRADED
status: One or more devices is currently being resilvered.  The pool will
        continue to function, possibly in a degraded state.
action: Wait for the resilver to complete.
  scan: resilver in progress since Tue Dec  8 17:23:26 2020
        8.79T scanned at 116M/s, 8.37T issued at 110M/s, 12.3T total
        1.67T resilvered, 68.11% done, 0 days 10:22:24 to go

その後、再同期完了後にpoolを確認すると、エラーの表示が

$ zpool status -x
  pool: tank
 state: ONLINE
status: One or more devices has experienced an unrecoverable error.  An
        attempt was made to correct the error.  Applications are unaffected.
action: Determine if the device needs to be replaced, and clear the errors
        using 'zpool clear' or replace the device with 'zpool replace'.
   see: http://zfsonlinux.org/msg/ZFS-8000-9P
  scan: resilvered 2.43T in 1 days 07:34:55 with 0 errors on Thu Dec 10 00:58:21 2020

エラーをclearすることに

$ sudo zpool clear tank
$ zpool status -x
all pools are healthy

一回再起動しとこう。。

投稿者:

Kenzo

サイト管理人です。 お仕事募集中です。 システム開発を生業としています。 どうぞ宜しくお願い致します。

コメントを残す

メールアドレスが公開されることはありません。 が付いている欄は必須項目です

このサイトはスパムを低減するために Akismet を使っています。コメントデータの処理方法の詳細はこちらをご覧ください