-
Notifications
You must be signed in to change notification settings - Fork 3.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Proposal: Automatically suspend CronWorkflow for repeated errors in child workflows #5659
Comments
@alexec I can work on this if you think this is a reasonable feature to have. We currently have a hacky workaround and it would be good to implement the logic in the controller itself. |
I think this is a good idea |
I created a quick draft PR in #5662 and left some comments so that we can discuss pros and cons of different designs in several places. |
Addressed in #12305 |
@terrytangyuan per #12696 (comment), #12305's So it seems like this is not entirely addressed in #12305, a separate |
This feels like just a difference in the words used, not a difference in function. You could patch the phase away from We don't have a CLI command to patch back to |
No the functional difference was very intentional per #12696 (comment), as linked above. I literally asked about the difference and suggested a rename initially.
If you put it back to
The UI also (intentionally) treats it differently |
Re-opening per my above comments, a |
Summary
What change needs making?
Automatically suspend a CronWorkflow for repeated errors in its workflows.
Use Cases
When would you use this?
When there are repeated errors observed in the child workflows, future child workflows would likely also fail. In this case, the controller could automatically suspend CronWorkflow from submitting new child workflows to avoid waste of computational resources.
Implementation =-wise, we could:
suspendFailedJobsLimit bool
(probably need a better name) in CronWorkflow to specify the maximum number of consecutive failed workflows before we suspend the corresponding CronWorkflow.Message from the maintainers:
Impacted by this bug? Give it a 👍. We prioritise the issues with the most 👍.
The text was updated successfully, but these errors were encountered: