-
Notifications
You must be signed in to change notification settings - Fork 1.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Load samples returns an error in 0.4 Kubeflow #603
Comments
Thanks Jeremy, how did you do the deployment? Did you deploy to a clean cluster or upgrading a existing cluster? |
Same here!
|
@hongye-sun On surface the error message looks relevant to the metrics table. do you have any idea on this failure? meanwhile I'll create a cluster too and dive into the issue |
It seems the issue was due to the DB initialized twice during both k8s job to sample loading and api server startup. We can remove the DB initialization from the job to load samples |
I'll send a PR shortly |
Cool
…On Wed, Jan 2, 2019 at 2:47 PM IronPan ***@***.***> wrote:
I'll send a PR shortly
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#603 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AHXSRPdaJqzmrh-Kk8kMpf5ApgaJoOFTks5u_TcRgaJpZM4Zkxk1>
.
|
Interestingly I wasn't able to reproduce it from a GKE cluster. Does it fail all the time? |
The source code shows it would handles gracefully if the foreign key already exist. It align with what I observed. Is there any cluster you can share with me to investigate? @jlewi |
I am now on v0.4.0-rc.3 and the issue went away (0 restarts) |
Nice I'll close this one |
/close |
@IronPan: Closing this issue. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
1 similar comment
@IronPan: Closing this issue. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
* Create a new Kubeflow cluster to run auto-deploy v2. * kubeflow#595 we are creating a new version of the infrastructure to auto-deploy Kubeflow from master and release branches * This PR contains the GCP and Kustomize manifests for setting up a new Kubeflow cluster where this app will run. * We can't use the existing kubeflow-testing cluster because that doesn't have ISTIO or workload identity which we want for the auto-deploy app. * Remove OWNERs files
* Fix for the bug kubeflow#602 Clears some verbiage for kserve/kserve#602 * remove references to installing KNative through manifests Few people have reported that causes confusion. Either you get KNative and Istio through Kubeflow, or through their respective installation sites * Update Istio versions * fiz istio version
I deployed Kubeflow from the 0.4 release branch
The ml pipelines container is in an error state. Logs for the pod show
Related to: kubeflow/kubeflow#2098 0.4 release
/cc @IronPan
The text was updated successfully, but these errors were encountered: