-
Notifications
You must be signed in to change notification settings - Fork 89
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Apps CD scripts need to be updated to work with kustomize refactor #631
Comments
Issue-Label Bot is automatically applying the labels:
Please mark this comment with 👍 or 👎 to give our bot feedback! |
@Jeffwan @terrytangyuan FYI; another engprod issue that we will want to fix in order to automatically update the manifests with newly built images. |
Issue-Label Bot is automatically applying the labels:
Please mark this comment with 👍 or 👎 to give our bot feedback! |
@jlewi, Hi I'm Kevin Su. |
/assign |
Thanks @pingsutw this would be a huge help! In case its helpful: Here's an example of an auto generated PR to update the manifest for the jupyter webapp The file that is getting updated is We also need to update this file: |
@pingsutw How's this going? |
Issue-Label Bot is automatically applying the labels:
Please mark this comment with 👍 or 👎 to give our bot feedback! |
@jlewi Sorry for the reply. |
@pingsutw you can check And see if updated PRs are being created. |
@pingsutw How's this going? |
sorry @jlewi, I thought this PR is already done in #673, So I work on other kubeflow issues recently. If I understand correctly, https://github.com/kubeflow/testing/blob/master/apps-cd/applications.yaml will update docker tag in https://github.com/kubeflow/manifests So I'm not sure which manifest also needs to update and be added in application.yaml. |
Thanks @pingsutw ; we will need to create updates for most of the applications you can get a list here I went through the file and compared to the manifests; looks like we just need a couple more to
Also it looks like the unittests are failing for the updated PRs Could you try cloning one of the auto-update PRs locally and running the unittests to see why they are failing? |
@jlewi gotcha, it's very clear to me. |
Hi @pingsutw! @jlewi Do you know, how we can trigger kubeflow-bot to run |
Hi, I just test branch kubeflow/manifests#1253 as well. git clone https://github.com/kubeflow/manifests.git
cd manifests
git fetch origin pull/1253/head:test
git checkout test
cd tests
make generate-changed-only I found that $ git diff
iff --git a/tests/go.mod b/tests/go.mod
index 899b522..7155915 100644
--- a/tests/go.mod
+++ b/tests/go.mod
@@ -3,7 +3,6 @@ module github.com/kubeflow/manifests/tests
go 1.13
require (
- github.com/PuerkitoBio/purell v1.1.1 // indirect
github.com/ghodss/yaml v1.0.1-0.20190212211648-25d852aebe32
github.com/gogo/protobuf v1.2.2-0.20190730201129-28a6bbf47e48 // indirect
github.com/golang/protobuf v1.3.2 // indirect
diff --git a/tests/stacks/examples/alice/test_data/expected/apps_v1_deployment_pytorch-operator.yaml b/tests/stacks/examples/alice/test_data/expected/apps_v1_deployment_pytorch-operator.yaml
index 8897df4..a8ad07d 100644
--- a/tests/stacks/examples/alice/test_data/expected/apps_v1_deployment_pytorch-operator.yaml
+++ b/tests/stacks/examples/alice/test_data/expected/apps_v1_deployment_pytorch-operator.yaml
@@ -40,6 +40,6 @@ spec:
valueFrom:
fieldRef:
fieldPath: metadata.name
- image: gcr.io/kubeflow-images-public/pytorch-operator:vmaster-g047cf0f
+ image: gcr.io/kubeflow-images-public/pytorch-operator:vmaster-gd596e904
name: pytorch-operator
serviceAccountName: pytorch-operator
diff --git a/tests/stacks/generic/test_data/expected/apps_v1_deployment_pytorch-operator.yaml b/tests/stacks/generic/test_data/expected/apps_v1_deployment_pytorch-operator.yaml
@andreyvelich Did you have the same problem? |
@pingsutw I think you just need to run You can run After updating test files in |
@pingsutw Do you know how we can manually update branch: https://github.com/kubeflow-bot/manifests/tree/update_tf_operator_vmaster-ga2ae7bff ? |
I did it as well. If I use
Sorry, I don't know how to update that branch |
@pingsutw @andreyvelich we don't want to manually update the branch. I think we need to fix the update code to run the proper command.
So you can change that to Because the |
It looks to me like the expected values for the tests. Is trying to change the image to So we need to update the expected value for the test which is in this file
@pingsutw I'm a little confused by your diff because based on the diff it looks like
So it looks like |
@pingsutw i wonder if for some reason
You might want to look into getting viewer access to the stackdriver logs so you can see what happened. |
Sorry, I'm wrong. I didn't notice the path is
yap, there is a file that isn't getting included in the commit. |
@jlewi Yes, you are right. How we can check that this script (https://github.com/kubeflow/testing/blob/master/py/kubeflow/testing/cd/create_manifests_pr.py) was properly run? |
You would need to get access to the GCP release infrastructure. The GCP project it is running in is here: I think automation of access control to the release infra is a bit wanting. So you could add yourself there and then ping me to sync the groups after the PR is merged. |
Sure, thanks. I created the PR: kubeflow/internal-acls#266 |
I tried to dig in a little bit. Having a bit of trouble locating the runs The Tekton dashboard is crashing:
Youngest pipeline is a 141 days. I'm going to try GC'ing the old runs to see if that helps. |
Looks like they are currently running in the
|
Clean up
|
Here are the logs for one of the pods creating the PR It looks like |
* The scripts to generate the tests now depend on the jinja2 library but its not in the container. * Add some docs for debugging. * Related to kubeflow#631
@pingsutw We will also want to create a v1.1.0 config testing/apps-cd/applications.yaml Line 173 in e72b7f5
This should point at the v1.1-branches of the applications and the v1.1-branch of the manifests /cc @andreyvelich |
* The scripts to generate the tests now depend on the jinja2 library but its not in the container. * Add some docs for debugging. * Related to kubeflow#631 * Catch FilenotFoundErrors * The problem is that on master the location of some of the kustomize manifests changes (e.g. v3 versions) but for the v1.0 branches these paths won't exist. So we should just catch these errors and continue. * Update the docker image used by the Tekton pipeline because we need jinja2.
* The scripts to generate the tests now depend on the jinja2 library but its not in the container. * Add some docs for debugging. * Related to #631 * Catch FilenotFoundErrors * The problem is that on master the location of some of the kustomize manifests changes (e.g. v3 versions) but for the v1.0 branches these paths won't exist. So we should just catch these errors and continue. * Update the docker image used by the Tekton pipeline because we need jinja2.
Design Doc: http://bit.ly/kf_kustomize_v3
We will likely refactor the kustomize packages to create a more kustomize native experience.
To do that we will need to update our apps CD scripts to specify the locations of the new images.
https://github.com/kubeflow/testing/blob/master/apps-cd/applications.yaml
We should likely keep the locations of the new and old packages so they stay in sync until we get rid of the old packages.
The text was updated successfully, but these errors were encountered: