Redeployed pod can't mount a csi-rbd-sc PVC - rbd image is still being used #4410

stibi · 2024-01-16T11:02:40Z

stibi
Jan 16, 2024

Describe the bug

We have a Proxmox cluster with Ceph and k8s running there.
We are using the csi-rbd provider to deploy PVCs.

It is all seems to be working, a pod is started, pvc is created, a rbd image seems to be created, all is mounted and the app inside a pod is able to use it. Until the pod is deleted and recreated. From that moment, it won't start, as the PVC is not ready for the pod:

Events:
  Type     Reason                  Age               From                     Message
  ----     ------                  ----              ----                     -------
  Normal   Scheduled               9m51s             default-scheduler        Successfully assigned monitoring/kube-prometheus-grafana-74bfb88f64-tctbf to k8s-dev-worker0
  Normal   SuccessfulAttachVolume  9m51s             attachdetach-controller  AttachVolume.Attach succeeded for volume "pvc-7e9588ad-19d1-4099-bd49-30fd2b9f0d42"
  Warning  FailedMount             17s (x9 over 9m)  kubelet                  MountVolume.MountDevice failed for volume "pvc-7e9588ad-19d1-4099-bd49-30fd2b9f0d42" : rpc error: code = Internal desc = rbd image k8s-dev/csi-vol-dea80a4e-85b6-46e2-9d08-ffba10ef7825 is still being used

I think I was able to verify that the volume is really not mounted anywhere. I got a shell inside each ceph-csi-rbd-nodeplugin pod, on each worker node, and from the csi-rbdplugin container I was looking for the rbd device with something like rbd device ls | grep csi-vol-dea80a4e-85b6-46e2-9d08-ffba10ef7825. I double check directly on the worker node OS, by looking at all mount entries. Nothing found.

I also tried to mount the device manually. I picked a ceph-csi-rbd-nodeplugin container, prepared ceph and keyring config and I was able to map and mount: rbd -c /tmp/ceph.conf -k /tmp/ceph.keyring --id k8s-dev --pool k8s-dev map csi-vol-dea80a4e-85b6-46e2-9d08-ffba10ef7825, and then mount the block device on the host with like: mount /dev/rbd7 /mnt/. That worked. Now the rbd image is actually being used :)

But I can't find out why k8s things it can't be mounted. I need some help here please.

Is there a change that the storage.kubernetes.io/csiProvisionerIdentity could play any role in all of this? Does it change when the ceph-csi-rbd-provisioner is redeployed?
Because I noted that the csiProvisionerIdentity is different among persistent volumes and I did a couple of redeploys of the provisioner.

Environment details

Image/version of Ceph CSI driver :
Helm chart version : 3.10.1
Kernel version : 5.15.0-87-generic
Mounter used for mounting PVC (for cephFS its fuse or kernel. for rbd its
krbd or rbd-nbd) :
Kubernetes cluster version : 1.28.2
Ceph cluster version : 17.2.6

Steps to reproduce

create a deployment, backend with a rbd pvc
all works
delete the pod
pod get stucked in Init phase because of FailedMount - MountVolume.MountDevice failed for volume "pvc-7e9588ad-19d1-4099-bd49-30fd2b9f0d42" : rpc error: code = Internal desc = rbd image k8s-dev/csi-vol-dea80a4e-85b6-46e2-9d08-ffba10ef7825 is still being used

Nothing interesting from logs unfortunately, or at least I have not noticed.

nixpanic · 2024-01-16T13:11:07Z

nixpanic
Jan 16, 2024
Maintainer

When you see the error rbd image k8s-dev/csi-vol-dea80a4e-85b6-46e2-9d08-ffba10ef7825 is still being used, there is probably something watching the RBD-image. On the Ceph cluster, you can check with rbd status $image, which should list the clients that have the image open. Ceph-CSI checks if there are any other processes using the image, if that is the case, it will not attach/mount the image to prevent potential data corruption.

0 replies

stibi · 2024-01-16T18:05:54Z

stibi
Jan 16, 2024
Author

yes, that's true, there is a watcher:

root@bastion:~# rbd --id k8s-dev --pool k8s-dev status csi-vol-dea80a4e-85b6-46e2-9d08-ffba10ef7825
Watchers:
	watcher=192.168.1.213:0/3848453374 client.31966992 cookie=140516163400656

But the rbd-image is not mapped and mounted on this server. Is there anything else possibly creating the watcher?

[root@k8s-dev-worker0 /]# rbd -c /tmp/ceph.conf -k /tmp/ceph.keyring --id k8s-dev  device ls | grep csi-vol-dea80a4e-85b6-46e2-9d08-ffba10ef7825
[root@k8s-dev-worker0 /]#
[root@k8s-dev-worker0 /]# rbd -c /tmp/ceph.conf -k /tmp/ceph.keyring --id k8s-dev device unmap csi-vol-dea80a4e-85b6-46e2-9d08-ffba10ef7825
rbd: rbd/csi-vol-dea80a4e-85b6-46e2-9d08-ffba10ef7825: not a mapped image or snapshot
rbd: unmap failed: (22) Invalid argument

0 replies

nixpanic · 2024-01-18T16:02:54Z

nixpanic
Jan 18, 2024
Maintainer

Is it possible that the RBD image is mirrored to an other location? A process like rbd-mirror will have a watcher on the image too. There might be other processes that open the image with a watcher, but I am not familiar with them.

0 replies

wjzhyhh · 2024-02-02T02:40:30Z

wjzhyhh
Feb 2, 2024

I also encountered the same problem, but sometimes it can automatically mount back after a period of time, and sometimes it keeps getting stuck. I want to investigate whether the problem is in the CEPH cluster or in the CSI plugin.

0 replies

GitHub42096 · 2024-03-11T06:52:07Z

GitHub42096
Mar 11, 2024

How to solve it？

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Redeployed pod can't mount a csi-rbd-sc PVC - rbd image is still being used #4410

{{title}}

Replies: 5 comments

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

Select a reply

Redeployed pod can't mount a csi-rbd-sc PVC - rbd image is still being used #4410

stibi Jan 16, 2024

Describe the bug

Environment details

Steps to reproduce

Replies: 5 comments

nixpanic Jan 16, 2024 Maintainer

stibi Jan 16, 2024 Author

nixpanic Jan 18, 2024 Maintainer

wjzhyhh Feb 2, 2024

GitHub42096 Mar 11, 2024

stibi
Jan 16, 2024

nixpanic
Jan 16, 2024
Maintainer

stibi
Jan 16, 2024
Author

nixpanic
Jan 18, 2024
Maintainer

wjzhyhh
Feb 2, 2024

GitHub42096
Mar 11, 2024