Skip to content

Stability fixes, and related logging for slowdowns in dispatcher task processing #12676

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 9 commits into from
Aug 17, 2022

Conversation

AlanCoding
Copy link
Member

SUMMARY

Things this does

  • Add logging of things which might contribute to an instance's failure (task processing delays, for example)
  • Provision extra dispatch workers to prevent delays in running the heartbeat
  • Be extra sure to not reap running tasks, passing excluded_uuids into the waiting job reaper
  • Add lots of custom error messages - like distinguishing a control process SIGTERM from execution process
ISSUE TYPE
  • Bug, Docs Fix or other nominal change
COMPONENT NAME
  • API

@AlanCoding AlanCoding requested a review from shanemcd August 17, 2022 16:05
@AlanCoding AlanCoding marked this pull request as ready for review August 17, 2022 16:58
@AlanCoding AlanCoding requested a review from kdelee August 17, 2022 17:02
Copy link
Member

@kdelee kdelee left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

one day maybe we can fix DISPATCHER_DB_DOWNTOWN_TOLLERANCE 😮‍💨

@AlanCoding AlanCoding merged commit 85beb9e into ansible:devel Aug 17, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants