-
-
Notifications
You must be signed in to change notification settings - Fork 366
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug]: One control plane node stuck waiting for MicroOS #1484
Comments
@heysarver Please try rebooting the node with hcloud, see if it fixes it. |
Rebooting solved it but it's still an issue. I added another worker pool and had the same results, all but 1 came up ok and a reboot of that fixed it again. |
@heysarver Remove the kured-ttl setting. Remove also the autoscaler version (the default set value is needed). terraform init -upgrade Plan B Make sure the underlying image is good, rebuild it if needed, with the packer command. Debug cloud-init and what could be happening on boot, ask https://claude.ai for the exact commands and give it the logs. |
@JWDobken please create a new issue with all the details. |
rebuilding the image seemed to have solved my issue, thank you. |
@mysticaltech I've started using it already and have hit my limits on a new account so I'll have to wait to try, but sounds reasonable. |
I can confirm this was my issue with kured_options lock-ttl set to 30m. When I made a new cluster to confirm, I also had to manually open the firewall ports for the nginx ingress load balancer with this config. Any ideas on that or should I open a new issue? |
@heysarver Please reframe the issue, I'm not understanding clearly the issues you are still facing. |
@mysticaltech I'm having to add rules for the destination nginx-ingress ports manually to the firewall after creating, otherwise all the targets are unhealthy. This causes terraform state to get out of sync. |
@heysarver Please open a new issue with the full working kube.tf apart from private info, and steps to reproduce please. |
Description
I'm trying to deploy a cluster with 3 or 5 control nodes, both have the same result. N-1 nodes come up successfully but after several terraform destroy and apply plans there's always 1 control node that is stuck in "Waiting for MicroOS to become available..." until terraform times out.
Kube.tf file
Screenshots
Failed Node:
Platform
MacOS, Terraform Cloud
The text was updated successfully, but these errors were encountered: