Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add provisioning retries to beaker plugin to overcome aborted beaker jobs #3214

Open
happz opened this issue Sep 12, 2024 · 0 comments
Open
Labels
plugin | mrack The beaker provision plugin step | provision Stuff related to the provision step

Comments

@happz
Copy link
Collaborator

happz commented Sep 12, 2024

As discussed with several beaker plugin users, a single failed Beaker job means an end of the provisioning, even though a retried, new job might succeed. tmt lacks beaker-jobwatch capability to deal with unsuccessful reservations:

  • submit new job when current one aborts
  • there must be a limit on retries, controlled by a key + sane default
  • if possible, new jobs should avoid machines on which the aborted jobs crashed - might be optional, controlled by a flag-like key
  • it probably makes less sense to retry a job that spent its lifetime in queue and never run
  • nice logging of job details and task results would be nice, if only within the debug/verbose mode
@happz happz added step | provision Stuff related to the provision step plugin | mrack The beaker provision plugin labels Sep 12, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
plugin | mrack The beaker provision plugin step | provision Stuff related to the provision step
Projects
None yet
Development

No branches or pull requests

1 participant