-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Lost lease while inactive #114
Comments
Issuing lvmlockctl -i -d shows the kill_vg=1 on all nodes. Issuing a drop and lock start on just one node will force a kill event in less than 30 seconds. issuing a drop on all nodes and then starting worked. |
FROM LEO: 2020-10-05 12:02:31 1465913 [21551]: <<<<< RAID lock dump: raid_renew_lock <<<<< 2020-10-22 14:05:17 2942079 [21551]: ilm_failure_handler: kill_path=/usr/sbin/lvmlockctl So we can see the lockspace thread (PID=21551) renewed the lock and But the thread slept time point is "2020-10-05" and its waken up time I don't see drive firmware is relevant to this issue based on the log;
|
FROM LEO: I went through the sanlock lock manager and my conclusion is sanlock Rather than a self-correcting solution after the error happens, I |
From Leo: In the DLM daemon, it registers callbacks into corosync [1]: static cpg_model_v1_data_t cpg_callbacks = { So the callback function confchg_cb() will call the function In the kernel side, DLM driver provides callbacks dlm_ls_start() and So suggest below direction to move forward:
If use Corosync, we don't need to use watchdog anymore. |
Some following up discussion: https://listman.redhat.com/archives/lvm-devel/2021-February/msg00077.html And on the mailing list, there have two old patches to enable automatic deactivate VG/LVs: |
For the automatic failure handling, two patch sets have been merged:
So now the latest repository has supported automatic failure handling. |
Node lost VG lease for unknown reason on 1 of 4 nodes in cluster.
Broadcast message from systemd-journald@node1 (Tue 2020-10-13 09:21:52 CDT):
lvmlockctl[25120]: Lost access to sanlock lease storage in VG sbvg_datalake.
Broadcast message from systemd-journald@node1 (Tue 2020-10-13 09:21:52 CDT):
lvmlockctl[25120]: Immediately deactivate LVs in VG sbvg_datalake.
Broadcast message from systemd-journald@node1 (Tue 2020-10-13 09:21:52 CDT):
lvmlockctl[25120]: Once VG is unused, run lvmlockctl --drop sbvg_datalake.
The text was updated successfully, but these errors were encountered: