记录一次部署pod时候提示an operation with the given Volume ID already exists的rook错误

949 阅读1分钟

场景

部署nginx service mesh,提示错误如下,pod分配pvc时候出错

   # 查看pod的日志,错误如下
   - failed to provision volume with StorageClass "rook-ceph-block": rpc error: code = Aborted desc = an operation with the given Volume ID pvc-1f72de31-2bd3-41c5-8845-cccad96b671e already exists

此时查看ceph集群正常

sh-4.4$ ceph status
  cluster:
    id:     f5bfad59-071a-49ef-9a0a-da9e011e028a
    health: HEALTH_OK
 
  services:
    mon: 3 daemons, quorum a,c,d (age 33m)
    mgr: a(active, since 45m), standbys: b
    osd: 3 osds: 3 up (since 45m), 3 in (since 46m)
 
  data:
    pools:   2 pools, 33 pgs
    objects: 28 objects, 25 MiB
    usage:   174 MiB used, 50 GiB / 50 GiB avail
    pgs:     33 active+clean
 
  io:
    client:   5.3 KiB/s wr, 0 op/s rd, 0 op/s wr

是rook的bug,重启rook相关pod

官网有该问题的解决办法

image.png

参考链接

# 按着官网提示重启pod
kubectl rollout restart csi-cephfsplugin -n rook-ceph
kubectl rollout restart daemonsets csi-cephfsplugin -n rook-ceph
kubectl rollout restart deployment csi-cephfsplugin-provisioner -n rook-ceph
kubectl rollout restart deployment csi-rbdplugin-provisioner -n rook-ceph

集群恢复正常,可以分配pvc了