3 pgs not deep-scrubbed in time异常处理

本文转载自查看原文 2020-11-10 14:51 1629 ceph

　　查测试环境ceph集群时发现集群状态出来 “3 pgs not deep-scrubbed in time” 这个告警信息，特此将处理过程记录一下。

查看集群状态信息

[root@ceph-p-001 ~]# ceph -s 
  cluster:
    id:     f0f53ab6-36bf-48f0-98dd-4fad46e31991
    health: HEALTH_WARN 3 pgs not deep-scrubbed in time
 
  services:
    mon: 3 daemons, quorum ceph-p-001,ceph-v-003,ceph-p-002 (age 2w)
    mgr: node2(active, since 4M), standbys: node1
    mds: cephfs:1 {0=node2=up:active} 1 up:standby
    osd: 12 osds: 12 up (since 4M), 12 in (since 5M)
    rgw: 3 daemons active (node1, node2, node3)
 
  data:
    pools:   10 pools, 640 pgs
    objects: 1.17M objects, 617 GiB
    usage:   1.3 TiB used, 21 TiB / 22 TiB avail
    pgs:     640 active+clean
 
  io:
    client:   2.3 KiB/s rd, 352 KiB/s wr, 2 op/s rd, 37 op/s wr

　　发现出现异常warn信息，虽然不影响整个集群正常使用，但强迫症患者还是忍不了，下面是过程。

查看具体报错信息

[root@ceph-p-001 ~]# ceph health detail 
HEALTH_WARN 3 pgs not deep-scrubbed in time
PG_NOT_DEEP_SCRUBBED 3 pgs not deep-scrubbed in time
    pg 12.1b not deep-scrubbed since 2020-10-29 03:52:31.523550
    pg 4.d not deep-scrubbed since 2020-10-29 05:30:15.630028
    pg 9.39 not deep-scrubbed since 2020-10-29 05:01:41.849331

　　这是由于部分PG没有deep-scrubbed造成，手动对异常PG进行deep-scrubb清理及可

[root@ceph-p-001 ~]# ceph pg deep-scrub 12.1b
instructing pg 12.1b on osd.11 to deep-scrub

[root@ceph-p-001 ~]# ceph pg deep-scrub 4.d
instructing pg 4.d on osd.9 to deep-scrub

[root@ceph-p-001 ~]# ceph pg deep-scrub 9.39
instructing pg 9.39 on osd.11 to deep-scrub

处理后在查看集群状态发现出现了一个active+clean+scrubbing+deep进程

[root@ceph-p-001 ~]# ceph -s 
  cluster:
    id:     f0f53ab6-36bf-48f0-98dd-4fad46e31991
    health: HEALTH_WARN
            1 pgs not deep-scrubbed in time
 
  services:
    mon: 3 daemons, quorum ceph-p-001,ceph-v-003,ceph-p-002 (age 2w)
    mgr: node2(active, since 4M), standbys: node1
    mds: cephfs:1 {0=node2=up:active} 1 up:standby
    osd: 12 osds: 12 up (since 4M), 12 in (since 5M)
    rgw: 3 daemons active (node1, node2, node3)
 
  data:
    pools:   10 pools, 640 pgs
    objects: 1.17M objects, 618 GiB
    usage:   1.3 TiB used, 21 TiB / 22 TiB avail
    pgs:     639 active+clean
             1   active+clean+scrubbing+deep
 
  io:
    client:   3.2 KiB/s rd, 408 KiB/s wr, 3 op/s rd, 43 op/s wr

ceph集群详细信息中看到部分PG恢复正常

[root@ceph-p-001 ~]# ceph health detail 
HEALTH_WARN 1 pgs not deep-scrubbed in time
PG_NOT_DEEP_SCRUBBED 1 pgs not deep-scrubbed in time
    pg 4.d not deep-scrubbed since 2020-10-29 05:30:15.630028

一段时间后集群完成恢复正常

[root@ceph-p-001 ~]# ceph -s 
  cluster:
    id:     f0f53ab6-36bf-48f0-98dd-4fad46e31991
    health: HEALTH_OK
 
  services:
    mon: 3 daemons, quorum ceph-p-001,ceph-v-003,ceph-p-002 (age 2w)
    mgr: node2(active, since 4M), standbys: node1
    mds: cephfs:1 {0=node2=up:active} 1 up:standby
    osd: 12 osds: 12 up (since 4M), 12 in (since 5M)
    rgw: 3 daemons active (node1, node2, node3)
 
  data:
    pools:   10 pools, 640 pgs
    objects: 1.17M objects, 618 GiB
    usage:   1.3 TiB used, 21 TiB / 22 TiB avail
    pgs:     640 active+clean
 
  io:
    client:   2.4 KiB/s rd, 1.1 MiB/s wr, 3 op/s rd, 62 op/s wr

免责声明！

本站转载的文章为个人学习借鉴使用，本站对版权不负任何法律责任。如果侵犯了您的隐私权益，请联系本站邮箱yoyou2525@163.com删除。

猜您在找 ceph pgs not deep-scrubbed in time 处理异常处理和全局异常处理 python之异常处理 HttpClient异常处理 Lua异常处理 Django 异常处理 Python之异常处理 NodeJS之异常处理 AMD异常处理 python的异常处理