-
Notifications
You must be signed in to change notification settings - Fork 1.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
KEP-4443: Add more granular Job failure reason for PodFailurePolicy #4479
Conversation
danielvegamyhre
commented
Feb 5, 2024
- One-line PR description: Configurable Job failure reason for PodFailurePolicy
- Issue link: Add more granular failure reason for Job PodFailurePolicy #4443
- Other comments:
56d4b81
to
0e39c11
Compare
@soltysh for sig-apps lead review |
0e39c11
to
02bf4dc
Compare
keps/sig-apps/4443-configurable-pod-failure-policy-reasons/README.md
Outdated
Show resolved
Hide resolved
keps/sig-apps/4443-configurable-pod-failure-policy-reasons/README.md
Outdated
Show resolved
Hide resolved
keps/sig-apps/4443-configurable-pod-failure-policy-reasons/README.md
Outdated
Show resolved
Hide resolved
keps/sig-apps/4443-configurable-pod-failure-policy-reasons/README.md
Outdated
Show resolved
Hide resolved
keps/sig-apps/4443-configurable-pod-failure-policy-reasons/README.md
Outdated
Show resolved
Hide resolved
keps/sig-apps/4443-configurable-pod-failure-policy-reasons/README.md
Outdated
Show resolved
Hide resolved
keps/sig-apps/4443-configurable-pod-failure-policy-reasons/README.md
Outdated
Show resolved
Hide resolved
keps/sig-apps/4443-configurable-pod-failure-policy-reasons/README.md
Outdated
Show resolved
Hide resolved
keps/sig-apps/4443-configurable-pod-failure-policy-reasons/README.md
Outdated
Show resolved
Hide resolved
keps/sig-apps/4443-configurable-pod-failure-policy-reasons/README.md
Outdated
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You need a prod-readiness file for alpha I believe.
keps/sig-apps/prod-readiness/4443.yaml
keps/sig-apps/4443-configurable-pod-failure-policy-reasons/kep.yaml
Outdated
Show resolved
Hide resolved
de03300
to
ac89cc7
Compare
ac89cc7
to
5cdd49e
Compare
@wojtek-t would you mind doing a PRR review for this KEP? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There are still some missing bits that needs to be filled in before we can accept this KEP.
## Summary | ||
|
||
This KEP proposes to extend the Job API by adding an optional `Name` field to `PodFailurePolicyRule`. If unset, it would default to the index of | ||
the rule in the `podFailurePolicy.rules` slice. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
PodFailurePolicy reason was rejected quite some time ago, as potentially overbloating condition's reason field. Index or the short name, which would be validated to fit in the condition's reason was preferred.
keps/sig-apps/4443-configurable-pod-failure-policy-reasons/README.md
Outdated
Show resolved
Hide resolved
keps/sig-apps/4443-configurable-pod-failure-policy-reasons/README.md
Outdated
Show resolved
Hide resolved
CRI or CNI may require updating that component before the kubelet. | ||
--> | ||
|
||
N/A. This feature doesn't require coordination between control plane components, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Still holds.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Clarifying the default is the last bit to get this approved.
## Summary | ||
|
||
This KEP proposes to extend the Job API by adding an optional `Name` field to `PodFailurePolicyRule`. If unset, it would default to the index of | ||
the rule in the `podFailurePolicy.rules` slice. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The above although readable to human is hard to parse. The idea is to be able to use the reason being in your example PodFailurePolicy_0
and parse it in higher level controller.
## Design Details | ||
|
||
#### Defaulting | ||
If unset, the `Name` field will default to the index of the pod failure policy rule in the `Rules` slice. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm of the opinion we should not default, leave it empty, which is acceptable in which case the resulting Reason
will contain the index. Iow. no defaulting happening, and that should be explicitly mentioned here.
keps/sig-apps/4443-configurable-pod-failure-policy-reasons/README.md
Outdated
Show resolved
Hide resolved
Updated the defaulting section to specify there will be no defaulting, and if |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
One point from Filip, but that's not blocking as long as it gets implemented.
/lgtm
/approve
from sig-apps pov
/hold
for prr review
/assign @jpbetz
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/lgtm
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/approve
For PRR
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: danielvegamyhre, jpbetz, soltysh The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
/hold cancel Thanks a lot all for the great feedback, I think we got to a very good conclusion here! kudos to the community! |