You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We use ray on Kubernetes using the kuberay project. We have a sanity test that runs a simple job via the job submission API the workload succeeds however the metric unintentional_worker_failures_total is also incremented.
That metric should not however be incremented. The definition of the metric reads Number of worker failures that are not intentional.
I asked about it on the slack channel and was told to file an issue.
What happened + What you expected to happen
We use ray on Kubernetes using the
kuberay
project. We have a sanity test that runs a simple job via the job submission API the workload succeeds however the metricunintentional_worker_failures_total
is also incremented.That metric should not however be incremented. The definition of the metric reads Number of worker failures that are not intentional.
I asked about it on the slack channel and was told to file an issue.
Versions / Dependencies
2.6.1
Reproduction script
Issue Severity
Low: It annoys or frustrates me.
The text was updated successfully, but these errors were encountered: