Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Final EKS Auto Mode testing #178

Draft
wants to merge 8 commits into
base: main
Choose a base branch
from
Draft

Final EKS Auto Mode testing #178

wants to merge 8 commits into from

Conversation

Eli1123
Copy link
Collaborator

@Eli1123 Eli1123 commented Feb 13, 2025

Currently was able to have it deploy to my environment and work properly utilizing EKS auto for the management cluster and resolved the EBS volumes issues with changes to the postgres.yaml file and adding the keycloak.yaml file.

Passing to Hari to test changes locally to confirm.

@Eli1123 Eli1123 requested a review from hmuthusamy February 13, 2025 18:56
# # security group that Karpenter should utilize with the following tag
# # (i.e. - at most, only one security group should have this tag in your account)
# "karpenter.sh/discovery" = local.name
"eks.amazonaws.com/discovery" = "modern-engineering"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

keep local.name instead of modern-engineering.

Copy link
Contributor

@hmuthusamy hmuthusamy left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Tested with new 1.32 version locally. works fine. tested multiple times. Time to test in TE once merged @shapirov103 @elamaran11

@elamaran11
Copy link
Contributor

QQ, We are using Auto Mode only on Management cluster or also on Child clusters?

@elamaran11
Copy link
Contributor

I will do a code review tomorrow.

@hmuthusamy
Copy link
Contributor

QQ, We are using Auto Mode only on Management cluster or also on Child clusters?

Only Management cluster. Child clusters are using blueprints still

Copy link
Contributor

@elamaran11 elamaran11 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@Eli1123 @hmuthusamy Firstly great work, Im good with the changes, i dont see any issues on the changes. What im concerned on is the below which we need to prove that there is no issues before we can merge:

  1. Auto Mode runs only on BottleRocket, did we check if each any every workload, argocd app that was deployed on Management cluster works fine and shows Green in Argo UI?.
  2. Did all the pods come up fine in management cluster or we see any issues with any pods. Are there any pods that cannot work on BR, we need to double check this
  3. Did any one do apple to apple comparison of TE in Workshop vs the deployment run with Auto Mode in your environment. Do you see any difference in behavior
  4. Did we have anyone run the workshop like run end to end with your auto mode environment. This is mandatory for merge.

This is a breaking change, so we need to have thorough review, before we can merge.

@shapirov103 Please check from your end too.

@Eli1123
Copy link
Collaborator Author

Eli1123 commented Feb 20, 2025

@elamaran11 Just chatted with Hari and a brief review found some possible issues to work through. We plan to meet up tomorrow to trouble shoot more as his proper wifi should return later today, and he will run end to end one more time as well to see if he replicates my possible issues.

@Eli1123
Copy link
Collaborator Author

Eli1123 commented Mar 6, 2025

Hari identified the initial issue to be related to load balancer security group issues with EKS Auto mode. Currently I've been trouble shooting how to resolve that and then see if any other errors surface with auto mode.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants