HDD ERROR

なんかエラー出たので

Dec 11 08:10:09 fsrv1 kernel: ahcich1: Timeout on slot 30 port 0
Dec 11 08:10:09 fsrv1 kernel: ahcich1: is 00000000 cs 00000000 ss c0000000 rs c0000000 tfd 40 serr 00000000 cmd 0000df17
Dec 11 08:10:09 fsrv1 kernel: (ada1:ahcich1:0:0:0): READ_FPDMA_QUEUED. ACB: 60 10 88 72 80 40 91 00 00 00 00 00
Dec 11 08:10:09 fsrv1 kernel: (ada1:ahcich1:0:0:0): CAM status: Command timeout
Dec 11 08:10:09 fsrv1 kernel: (ada1:ahcich1:0:0:0): Retrying command, 3 more tries remain

セルフテストかけて忘れてたらこうなってたw

Dec 12 04:18:52 fsrv1 kernel: ahcich1: AHCI reset: device not ready after 31000ms (tfd = 00000080)
Dec 12 04:19:14 fsrv1 kernel: ada1 at ahcich1 bus 0 scbus1 target 0 lun 0
Dec 12 04:19:14 fsrv1 kernel: ada1: <WDC WD20EZRX-00D8PB0 80.00A80> s/n WD-WMC4N0EASTST detached
Dec 12 04:19:15 fsrv1 kernel: (ada1:ahcich1:0:0:0): Periph destroyed
Dec 12 04:19:15 fsrv1 ZFS[3780]: vdev I/O failure, zpool=$zdata path=$/dev/gpt/WD-WMC4N0EASTST offset=$270336 size=$8192 error=$6
Dec 12 04:19:15 fsrv1 ZFS[3781]: vdev I/O failure, zpool=$zdata path=$/dev/gpt/WD-WMC4N0EASTST offset=$2000398327808 size=$8192 error=$6
Dec 12 04:19:15 fsrv1 ZFS[3782]: vdev I/O failure, zpool=$zdata path=$/dev/gpt/WD-WMC4N0EASTST offset=$2000398589952 size=$8192 error=$6
Dec 12 04:19:15 fsrv1 ZFS[3783]: vdev probe failure, zpool=$zdata path=$/dev/gpt/WD-WMC4N0EASTST
Dec 12 04:19:15 fsrv1 ZFS[3784]: vdev state changed, pool_guid=$12285572291750150482 vdev_guid=$3779805168453648840
Dec 12 04:19:15 fsrv1 ZFS[3785]: vdev is removed, pool_guid=$12285572291750150482 vdev_guid=$3779805168453648840
Dec 12 04:19:15 fsrv1 ZFS[3786]: vdev state changed, pool_guid=$12285572291750150482 vdev_guid=$3779805168453648840
Dec 12 04:19:20 fsrv1 kernel: (aprobe0:ahcich1:0:0:0): NOP FLUSHQUEUE. ACB: 00 00 00 00 00 00 00 00 00 00 00 00
Dec 12 04:19:20 fsrv1 kernel: (aprobe0:ahcich1:0:0:0): CAM status: ATA Status Error
Dec 12 04:19:20 fsrv1 kernel: (aprobe0:ahcich1:0:0:0): ATA status: d1 (BSY DRDY SERV ERR), error: 04 (ABRT )
Dec 12 04:19:20 fsrv1 kernel: (aprobe0:ahcich1:0:0:0): RES: d1 04 ff ff ff ff ff ff ff ff ff
Dec 12 04:19:20 fsrv1 kernel: (aprobe0:ahcich1:0:0:0): Error 5, Retries exhausted
Dec 12 04:19:20 fsrv1 kernel: (aprobe0:ahcich1:0:0:0): NOP FLUSHQUEUE. ACB: 00 00 00 00 00 00 00 00 00 00 00 00
Dec 12 04:19:20 fsrv1 kernel: (aprobe0:ahcich1:0:0:0): CAM status: ATA Status Error
Dec 12 04:19:20 fsrv1 kernel: (aprobe0:ahcich1:0:0:0): ATA status: d1 (BSY DRDY SERV ERR), error: 04 (ABRT )
Dec 12 04:19:20 fsrv1 kernel: (aprobe0:ahcich1:0:0:0): RES: d1 04 ff ff ff ff ff ff ff ff ff
Dec 12 04:19:20 fsrv1 kernel: (aprobe0:ahcich1:0:0:0): Error 5, Retries exhausted

とりあえず、どうなってたかada3を確認

<fsrv1: 124> gpart show ada3
=>        40  3907029088  ada3  GPT  (1.8T)
          40  3907029088     1  freebsd-zfs  (1.8T)

作る

fsrv1# gpart create -s GPT ada1
ada1 created
fsrv1# gpart add -l ZFL694RP -t freebsd-zfs /dev/ada1
ada1p1 added
fsrv1# gpart show ada1
=>        40  3907029088  ada1  GPT  (1.8T)
          40  3907029088     1  freebsd-zfs  (1.8T)

zpool状態確認

fsrv1# zpool status zdata
  pool: zdata
 state: DEGRADED
status: One or more devices has been removed by the administrator.
        Sufficient replicas exist for the pool to continue functioning in a
        degraded state.
action: Online the device using 'zpool online' or replace the device with
        'zpool replace'.
  scan: resilvered 382G in 0 days 02:25:26 with 0 errors on Sun Mar 24 19:32:27 2019
config:

        NAME                     STATE     READ WRITE CKSUM
        zdata                    DEGRADED     0     0     0
          mirror-0               DEGRADED     0     0     0
            gpt/WD-WMAZA9411575  ONLINE       0     0     0
            3779805168453648840  REMOVED      0     0     0  was /dev/gpt/WD-WMC4N0EASTST

errors: No known data errors

入れ替える。

fsrv1# zpool replace -f zdata 3779805168453648840 /dev/gpt/ZFL694RP

なんかちょっと時間かかったw

fsrv1# zpool status zdata
  pool: zdata
 state: DEGRADED
status: One or more devices is currently being resilvered.  The pool will
        continue to function, possibly in a degraded state.
action: Wait for the resilver to complete.
  scan: resilver in progress since Fri Dec 16 22:02:50 2022
        35.6G scanned at 335M/s, 1.21M issued at 11.3K/s, 689G total
        0 resilvered, 0.00% done, no estimated completion time
config:

        NAME                       STATE     READ WRITE CKSUM
        zdata                      DEGRADED     0     0     0
          mirror-0                 DEGRADED     0     0     0
            gpt/WD-WMAZA9411575    ONLINE       0     0     0
            replacing-1            REMOVED      0     0     0
              3779805168453648840  REMOVED      0     0     0  was /dev/gpt/WD-WMC4N0EASTST
              gpt/ZFL694RP         ONLINE       0     0     0

errors: No known data errors