[Feature] Modify Job provider to support any kind of Kubernetes CRDs #1214
Closed
Description
/kind feature
After migrating to the new Trial Template design (#1202), we want to extend current Job provider to support any kind of Kubernetes CRDs that follow Trial job patterns (e.g Argo template: #1081).
Currently, Job provider supports only batch
Jobs and Kubeflow Jobs.
We can extend Trial Template API with the custom settings to define:
- How to get succeeded state of the Job.
- How to properly mutate metrics collector on training pod.
- How to specify
sidecar.istio.io/inject: false
annotation. (That can be done by user in advance).
Maybe we should define something more, this needs to be investigated.
Let's discuss about all required changes in this issue.