Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

WIP: cluster-api - boot nodes without state store access #8198

Closed

Conversation

justinsb
Copy link
Member

@justinsb justinsb commented Dec 24, 2019

This is the other half of the initial cluster-api spike (the first half is #8151).

Instead of requiring nodes to read from the state store (s3 or GCS) in nodeup, we spin up a GRPC server in kops-controller that lets us serve the artifacts. Currently they're still loaded from S3/GCS by kops-controller, but this means the node doesn't need permissions. We're also getting closer to no S3/GCS dependency, if kops-controller can generate more of these artifacts on the fly (it's only the kubelet & kube-proxy keypairs).

What this does now allow is dynamic creation of InstanceGroups via the k8s api.

We were previously building it in development using bazel, but then
using go build for the release.  That is risky, better to just use
bazel everywhere.
We want to start re-using it for cluster-api support.
This is the first step to supporting cluster-api; starting with GCE
for now.

Slight problem at the moment: kubernetes/kubernetes#86509
This allows us to create an InstanceGroup without needing to upload to
the state store (S3/GCS)
If we're not going to use the docker images, we don't need to preload
them - saves a bit off the boot time.
@k8s-ci-robot k8s-ci-robot added do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files. labels Dec 24, 2019
@k8s-ci-robot
Copy link
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: justinsb

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@k8s-ci-robot k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Dec 24, 2019
Contents: fi.NewBytesResource(pkiutil.EncodePrivateKeyPEM(serverKey)),
Type: nodetasks.FileType_File,

// TODO: Because we run as an unprivileged user (1000), we can't use hostPaths with mode 600
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Perhaps specify the file Owner?

}

func (b *Builder) BuildMachineDeployment(clusterObj *kopsv1alpha2.Cluster, igObj *kopsv1alpha2.InstanceGroup) ([]*unstructured.Unstructured, error) {
cloudup.AlphaAllowGCE.SetEnabled(true)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I guess it is necessary to enable this builder to run GCE. But I wonder if we can move it somewhere else, where it does not bloat a unrelated context.

@k8s-ci-robot k8s-ci-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Feb 15, 2020
@k8s-ci-robot
Copy link
Contributor

@justinsb: PR needs rebase.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@k8s-ci-robot
Copy link
Contributor

@justinsb: The following tests failed, say /retest to rerun all failed tests:

Test name Commit Details Rerun command
pull-kops-verify-packages e7015f9 link /test pull-kops-verify-packages
pull-kops-verify-boilerplate e7015f9 link /test pull-kops-verify-boilerplate
pull-kops-bazel-test e7015f9 link /test pull-kops-bazel-test
pull-kops-verify-staticcheck e7015f9 link /test pull-kops-verify-staticcheck
pull-kops-e2e-cni-cilium e7015f9 link /test pull-kops-e2e-cni-cilium
pull-kops-e2e-cni-flannel e7015f9 link /test pull-kops-e2e-cni-flannel
pull-kops-e2e-cni-weave e7015f9 link /test pull-kops-e2e-cni-weave
pull-kops-e2e-cni-amazonvpc e7015f9 link /test pull-kops-e2e-cni-amazonvpc
pull-kops-e2e-cni-kuberouter e7015f9 link /test pull-kops-e2e-cni-kuberouter
pull-kops-verify-cloudformation e7015f9 link /test pull-kops-verify-cloudformation
pull-kops-verify-hashes e7015f9 link /test pull-kops-verify-hashes
pull-kops-e2e-k8s-containerd e7015f9 link /test pull-kops-e2e-k8s-containerd

Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

@olemarkus
Copy link
Member

This one is partially superseded by #10469, but this one also adds support for MachineDeployment resources

@fejta-bot
Copy link

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-contributor-experience at kubernetes/community.
/lifecycle stale

@k8s-ci-robot k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label May 26, 2021
@fejta-bot
Copy link

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-contributor-experience at kubernetes/community.
/lifecycle rotten

@k8s-ci-robot k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Jul 6, 2021
@k8s-triage-robot
Copy link

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-contributor-experience at kubernetes/community.
/close

@k8s-ci-robot
Copy link
Contributor

@k8s-triage-robot: Closed this PR.

In response to this:

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-contributor-experience at kubernetes/community.
/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved Indicates a PR has been approved by an approver from all required OWNERS files. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

7 participants