v0.4.1 - 2020-09-15

This is a patch release which includes mainly bug fixes.

NOTE: Please read the upgrading guidelines here.

Changes in v0.4.1

Component updates

Dex convert to helm chart and update to v2.25.0 (#962).

Features

feat: add severity labels to MetalLB alerts (#925).

Bug fixes

Override memory limits of rook operator to 512Mi (#938).
Fix envoy grafana dashboard errors (#969).
MetalLB: Fix regressions of tolerations and nodeSelectors (#927).
Fix controlplane components update order (#937).
component/metallb: Fix controller tolerations (#931).
Increased the node-ready and cluster-ping timeouts (#952).

Docs

docs: fix etcd version upgrade sed expression (#921).
docs: fix rook version update command (#930).

Development

Fix output of convertNodeSelector in rook (#945).
httpbin convert to helm chart (#965).
FLUO: convert to Helm chart (#935).
Makefile: Don't build before linting and add new target lint-bin (#901).

v0.4.0 - 2020-09-07

We're happy to announce the release of Lokomotive v0.4.0 (Darjeeling Himalayan).

This release packs new features, bug fixes, code optimizations, better user interface, latest versions of components, security hardening and much more.

Changes in v0.4.0

Kubernetes Updates

Update Kubernetes version to v1.18.8 (#861).

Platform updates

AKS

Update Kubernetes version to 1.17.9 (#849).

AWS

AWS: Add support for custom taints and labels (#832).

New Components

Add component experimental-istio-operator (#686).
Add component experimental-linkerd (#690).

Component updates

Update etcd to v3.4.13 (#838).
Update Calico to v3.15.2 (#841).
Update Grafana to 7.1.4 and chart version 5.5.5 (#842).
Update Velero chart to 1.4.2 (#830).
Update ExternalDNS chart to 3.3.0 (#845).
Update Amazon Elastic Block Store (EBS) CSI driver to v0.6.0 (#856).
Update Cluster Autoscaler to v2 version 1.0.2 (#859).
Update cert-manager to v0.16.1 (#847).
Update OpenEBS to v1.12.0 (#781).
Update MetalLB to v0.1.0-789-g85b7a46a (#885).
Update Rook to v1.4.2 (#879).
Use new bootkube image at version v0.14.0-helm-7047a87 (#775), later updated to v0.14.0-helm-ec64535 as a part of (#704).
Update Prometheus operator to 0.41.0 and chart version 9.3.0 (#757).
Update Contour to v1.7.0 (#771).

Terraform Providers Updates

Update all Terraform providers to latest versions (#835).

UX

Add autocomplete for bash and zsh in lokoctl (#880).

Run the following command to start using auto-completion for lokoctl:
```
source <(lokoctl completion bash)
```
Add kubeconfig fallback to Terraform state (#701).

Features

Add label lokomotive.kinvolk.io/name: <namespace_name> to all namespaces (#646).
Add admission webhook to lokomotive, which disables automounting default service account token (#704).
[Breaking Change] Kubelet joins cluster using TLS Bootstrapping now, add flag enable_tls_bootstrap = false to disable. (#618).
Add csi_plugin_node_selector and csi_plugin_toleration for rook-ceph's CSI plugin (#892).

Docs

Setting up third party OAuth for Grafana (#542).
Upgrading bootstrap kubelet (#592).
Upgrading etcd (#802).
How to add custom monitoring resources? (#554).
Kubernetes storage with Rook Ceph on Packet cloud (#494).

Bug fixes

aws: Add check for multiple worker pools with same LB ports (#889).
packet: ignore changes to plan and user_data on controller nodes (#907).
Introduce platform.PostApplyHook interface and implement it for AKS cluster (#886).
aws-ebs-csi-driver: add NetworkPolicy allowing access to metadata (#865).
pkg/components/cluster-autoscaler: fix checking device uniqueness (#768).

Development

Replace use of github.com/pkg/errors.Wrapf with fmt (#831, #877).
Refactor assets handling (#807).
cli/cmd: improve --kubeconfig-file flag help message formatting (#818).
Use host's /etc/hosts entries for bootkube (#409).
Refactor Terraform executor (#794).
Pass kubeconfig content around rather than a file path (#631).

Upgrading from v0.3.0

Lokoctl Host binary upgrades

terraform-provider-ct

Update the ct Terraform provider to v0.6.1, find the install instructions here.

Disable TLS Bootstrap

In this release we introduced TLS bootstrapping and we enable it by default. To avoid cluster recreation, disable it by adding the following attribute to the cluster ... block:

cluster "packet" {
  enable_tls_bootstrap = false
...

Cluster upgrade steps

Go to your cluster's directory and run the following command:

lokoctl cluster apply --skip-components -v

The update process typically takes about 10 minutes. After the update, running lokoctl health should result in an output similar to the following.

Node                     Ready    Reason          Message

lokomotive-controller-0  True     KubeletReady    kubelet is posting ready status
lokomotive-1-worker-0    True     KubeletReady    kubelet is posting ready status
lokomotive-1-worker-1    True     KubeletReady    kubelet is posting ready status
lokomotive-1-worker-2    True     KubeletReady    kubelet is posting ready status
Name      Status    Message              Error

etcd-0    True      {"health":"true"}

Cluster nodes component upgrade (optional)

Manually upgrade etcd following the steps mentioned in the doc here.
Manually upgrade the kubelet running on the nodes, by following the steps mentioned in the doc here.

Manual Cluster Changes

The latest version of Metallb changes the labels of the ingress nodes. Label all the nodes that have asn set with the new labels:

kubectl label $(kubectl get nodes -o name -l metallb.universe.tf/my-asn) \
  metallb.lokomotive.io/my-asn=65000 metallb.lokomotive.io/peer-asn=65530

Find a peer address of a node and assign it new label:

for node in $(kubectl get nodes -o name -l metallb.universe.tf/peer-address); do
  peer_ip=$(kubectl get $node -o jsonpath='{.metadata.labels.metallb\.universe\.tf/peer-address}')
  kubectl label $node metallb.lokomotive.io/peer-address=$peer_ip
done

Now it is safe to update:

lokoctl component apply metallb

Ceph Upgrade steps

These steps are curated from the upgrade doc provided by rook: https://rook.io/docs/rook/master/ceph-upgrade.html.

Keep note of the CSI images:

kubectl --namespace rook get pod -o \
  jsonpath='{range .items[*]}{range .spec.containers[*]}{.image}{"\n"}' \
  -l 'app in (csi-rbdplugin,csi-rbdplugin-provisioner,csi-cephfsplugin,csi-cephfsplugin-provisioner)' | \
  sort | uniq

Ensure autoscale is on

Ensure that the output of the command ceph osd pool autoscale-status | grep replicapool says on (in the last column) and not warn in the toolbox pod. If it says warn. Then run the command ceph osd pool set replicapool pg_autoscale_mode on to set it to on. This is to ensure we are not facing: rook/rook#5608.

Read more about the toolbox pod here: https://github.com/kinvolk/lokomotive/blob/v0.4.0/docs/how-to-guides/rook-ceph-storage.md#enable-and-access-toolbox.

NOTE: If you see this error [errno 5] RADOS I/O error (error connecting to the cluster) in toolbox pod then tag the toolbox pod image to a specific version using this command: kubectl -n rook set image deploy rook-ceph-tools rook-ceph-tools=rook/ceph:v1.3.2.
Ceph Status

Run the following in the toolbox pod:
```
watch ceph status
```
Ensure that the output says that health is HEALTH_OK. Match the output such that everything looks fine as explained here: https://rook.io/docs/rook/master/ceph-upgrade. html#status-output.
Pods in rook namespace:

Watch the pods status in another from the rook namespace in another terminal window. Just running this will be enough:
```
watch kubectl -n rook get pods -o wide
```

Watch for the rook version update

Run the following command to keep an eye on the rook version update as it is rolls down for all the components:

watch --exec kubectl -n rook get deployments -l rook_cluster=rook -o jsonpath='{range .items[*]}{.metadata.name}{"  \treq/upd/avl: "}{.spec.replicas}{"/"}{.status.updatedReplicas}{"/"}{.status.readyReplicas}{"  \trook-version="}{.metadata.labels.rook-version}{"\n"}{end}'

You should see that rook-version slowly changes to v1.4.2.

Watch for the Ceph version update

Run the following command to keep an eye on the Ceph version update as the new pods come up:

watch --exec kubectl -n rook get deployments -l rook_cluster=rook -o jsonpath='{range .items[*]}{.metadata.name}{"  \treq/upd/avl: "}{.spec.replicas}{"/"}{.status.updatedReplicas}{"/"}{.status.readyReplicas}{"  \tceph-version="}{.metadata.labels.ceph-version}{"\n"}{end}'

You should see that ceph-version slowly changes to 15.

Keep an eye on the events in the rook namespace
```
kubectl -n rook get events -w
```
Ceph Dashboard

Keep it open in one window, but sometimes it is more hassle than any help. It keeps reloading and logs you out automatically. See this on how to access the dashboard: https://github.com/kinvolk/lokomotive/blob/v0.4.0/docs/how-to-guides/rook-ceph-storage.md#access-the-ceph-dashboard.
Grafana dashboards

Keep an eye on the Grafana dashboard, but the data here will always be old, and the most reliable state of the system will come from the watch running inside toolbox pod.

Run updates

kubectl apply -f https://raw.githubusercontent.com/kinvolk/lokomotive/v0.4.0/assets/charts/components/rook/templates/resources.yaml
lokoctl component apply rook rook-ceph

Verify that the csi images are updated:

kubectl --namespace rook get pod -o jsonpath='{range .items[*]}{range .spec.containers[*]}{.image}{"\n"}' -l 'app in (csi-rbdplugin,csi-rbdplugin-provisioner,csi-cephfsplugin,csi-cephfsplugin-provisioner)' | sort | uniq

Final checks:

Once everything is up to date then run following commands in the toolbox pod:
```
ceph status
ceph osd status
ceph df
rados df
```

OpenEBS

OpenEBS control plane components and data plane components work independently. Even after the OpenEBS Control Plane components have been upgraded to 1.12.0, the Storage Pools and Volumes (both jiva and cStor) will continue to work with older versions.

Upgrade functionality is still under active development. It is highly recommended to schedule a downtime for the application using the OpenEBS PV while performing this upgrade. Also, make sure you have taken a backup of the data before starting the below upgrade procedure. - Openebs documentation

Upgrade the component by running the following steps:

lokoctl component apply openebs-operator openebs-storage-class

Upgrade cStor Pools

Extract the SPC name using the following command and replace it in the subsequent YAML file:

$ kubectl get spc
NAME                          AGE
cstor-pool-openebs-replica1   24h

The Job spec for upgrade cstor pools is:

# This is an example YAML for upgrading cstor SPC.
# Some of the values below need to be changed to
# match your openebs installation. The fields are
# indicated with VERIFY
---
apiVersion: batch/v1
kind: Job
metadata:
  # VERIFY that you have provided a unique name for this upgrade job.
  # The name can be any valid K8s string for name. This example uses
  # the following convention: cstor-spc-<flattened-from-to-versions>
  name: cstor-spc-11101120

  # VERIFY the value of namespace is same as the namespace where openebs components
  # are installed. You can verify using the command:
  # `kubectl get pods -n <openebs-namespace> -l openebs.io/component-name=maya-apiserver`
  # The above command should return status of the openebs-apiserver.
  namespace: openebs
spec:
  backoffLimit: 4
  template:
    spec:
      # VERIFY the value of serviceAccountName is pointing to service account
      # created within openebs namespace. Use the non-default account.
      # by running `kubectl get sa -n <openebs-namespace>`
      serviceAccountName: openebs-operator
      containers:
      - name:  upgrade
        args:
        - "cstor-spc"

        # --from-version is the current version of the pool
        - "--from-version=1.11.0"

        # --to-version is the version desired upgrade version
        - "--to-version=1.12.0"

        # Bulk upgrade is supported from 1.9
        # To make use of it, please provide the list of SPCs
        # as mentioned below
        - "cstor-pool-openebs-replica1"
        # For upgrades older than 1.9.0, use
        # '--spc-name=<spc_name> format as
        # below commented line
        # - "--spc-name=cstor-sparse-pool"

        #Following are optional parameters
        #Log Level
        - "--v=4"
        #DO NOT CHANGE BELOW PARAMETERS
        env:
        - name: OPENEBS_NAMESPACE
          valueFrom:
            fieldRef:
              fieldPath: metadata.namespace
        tty: true

        # the image version should be same as the --to-version mentioned above
        # in the args of the Job
        image: quay.io/openebs/m-upgrade:1.12.0
        imagePullPolicy: Always
      restartPolicy: OnFailure

Apply the Job manifest using kubectl. Check the logs of the pod started by the Job:

$ kubectl get logs -n openebs cstor-spc-1001120-dc7kx
..
..
..
I0903 12:25:00.397066       1 spc_upgrade.go:102] Upgrade Successful for spc cstor-pool-openebs-replica1
I0903 12:25:00.397091       1 cstor_spc.go:120] Successfully upgraded storagePoolClaim{cstor-pool-openebs-replica1} from 1.11.0 to 1.12.0

Upgrade cStor volumes

Extract the cstor volume names using the following command and replace it in the subsequent YAML file:

$ kubectl get cstorvolumes -A
NAMESPACE   NAME                                       STATUS    AGE   CAPACITY
openebs     pvc-3415af20-db82-42cf-99e0-5d0f2809c657   Healthy   72m   50Gi
openebs     pvc-c3d0b587-5da9-457b-9d0e-23331ade7f3d   Healthy   77m   50Gi
openebs     pvc-e115f3f9-1666-4680-a932-d05bfd049087   Healthy   77m   100Gi

Create a Kubernetes Job spec for upgrading the cstor volume. An example spec is as follows:

# This is an example YAML for upgrading cstor volume.
# Some of the values below need to be changed to
# match your openebs installation. The fields are
# indicated with VERIFY
---
apiVersion: batch/v1
kind: Job
metadata:
  # VERIFY that you have provided a unique name for this upgrade job.
  # The name can be any valid K8s string for name. This example uses
  # the following convention: cstor-vol-<flattened-from-to-versions>
  name: cstor-vol-11101120

  # VERIFY the value of namespace is same as the namespace where openebs components
  # are installed. You can verify using the command:
  # `kubectl get pods -n <openebs-namespace> -l openebs.io/component-name=maya-apiserver`
  # The above command should return the status of the openebs-apiserver.
  namespace: openebs

spec:
  backoffLimit: 4
  template:
    spec:
      # VERIFY the value of serviceAccountName is pointing to service account
      # created within openebs namespace. Use the non-default account.
      # by running `kubectl get sa -n <openebs-namespace>`
      serviceAccountName: openebs-operator
      containers:
      - name:  upgrade
        args:
        - "cstor-volume"

        # --from-version is the current version of the volume
        - "--from-version=1.11.0"

        # --to-version is the version desired upgrade version
        - "--to-version=1.12.0"

        # Bulk upgrade is supported from 1.9
        # To make use of it, please provide the list of cstor volumes
        # as mentioned below
        - "pvc-3415af20-db82-42cf-99e0-5d0f2809c657"
        - "pvc-c3d0b587-5da9-457b-9d0e-23331ade7f3d"
        - "pvc-e115f3f9-1666-4680-a932-d05bfd049087"
        # For upgrades older than 1.9.0, use
        # '--pv-name=<pv_name> format as
        # below commented line
        # - "--pv-name=pvc-c630f6d5-afd2-11e9-8e79-42010a800065"

        #Following are optional parameters
        #Log Level
        - "--v=4"
        #DO NOT CHANGE BELOW PARAMETERS
        env:
        - name: OPENEBS_NAMESPACE
          valueFrom:
            fieldRef:
              fieldPath: metadata.namespace
        tty: true

        # the image version should be same as the --to-version mentioned above
        # in the args of the job
        image: quay.io/openebs/m-upgrade:1.12.0
        imagePullPolicy: Always
      restartPolicy: OnFailure
---

Apply the Job manifest using kubectl. Check the logs of the pod started by the Job:

$ kubectl get logs -n openebs cstor-vol-1001120-8b2h9
..
..
..
I0903 12:41:41.984635       1 cstor_volume_upgrade.go:609] Upgrade Successful for cstor volume pvc-e115f3f9-1666-4680-a932-d05bfd049087
I0903 12:41:41.994013       1 cstor_volume.go:119] Successfully upgraded cstorVolume{pvc-e115f3f9-1666-4680-a932-d05bfd049087} from 1.11.0 to 1.12.0

Verify that all the volumes are updated to the latest version by running the following command:

$ kubectl get cstorvolume -A -o jsonpath='{.items[*].versionDetails.status.current}'
1.12.0 1.12.0 1.12.0

Upgrade other components

Other components are safe to upgrade by running the following command:

lokoctl component apply <component name>

v0.3.0 - 2020-07-31

We're happy to announce the release of Lokomotive v0.3.0 (Coast Starlight).

This release packs new features and bugfixes. Some of the highlights are:

Kubernetes 1.18.6
For Lokomotive clusters running on top of AKS, Kubernetes 1.16.10 is installed.
Component updates

Changes in v0.3.0

Kubernetes updates

Update Kubernetes to v1.18.6 (#726).

Platform updates

Packet

Update default machine type from t1.small.x86 to c3.small.x86, since t1.small.x86 are EOL and no longer available in new Packet projects (#612).

WARNING: If you haven't explicitly defined the controller_type and/or worker_pool.node_type configuration options, upgrading to this release will replace your controller and/or worker nodes with c3.small.x86 machines thereby losing all your cluster data. To avoid this, set these configuration options to the desired values.

Make sure that the below attributes are explicitly defined in your cluster configuration. This only applies to machine type t1.small.x86.
```
cluster "packet" {
  .
  .
  controller_type = "t1.small.x86"
  .
  .
  worker_pool "pool-name" {
    .
    node_type = "t1.small.x86"
    .
  }
}
```

AKS

Update Kubernetes version to 1.16.10 (#712).

Component updates

openebs: update to 1.11.0 (#673).
calico: update to v3.15.0 (#652).

UX

prometheus-operator: Organize Prometheus related attributes under a prometheus block in the configuration (#710).
Use prometheus.ingress.host to expose Prometheus instead of prometheus_external_url (#710).
contour: Remove ingress_hosts from contour configuration (#635).

Features

Add enable_toolbox attribute to rook-ceph component (#649). This allows managing and configuring Ceph using toolbox pod.
Add Prometheus feature external_labels for federated clusters to Prometheus operator component. This helps to identify metrics queried from different clusters. (#710).

Docs

Add Type column to Attribute reference table in configuration references (#651).
Update contour configuration reference for usage with AWS (#674).
Add documentation related to the usage of clc_snippets for Packet and AWS (#657).
Improve documentation on using remote backends (#670).
How to guide for setting up monitoring on Lokomotive (#480).
Add codespell section in development documentation (#700).
Include a demo GIF in the readme (#636).

Bugfixes

Remove contour ingress workaround (due to an upstream issue) for ExternalDNS (#635).

Development

Do not show Helm release values in terraform output (#627).
Remove Terraform provider aliases from platforms code (#617).

Miscellaneous

Following flatcar/Flatcar#123, Flatcar 2513.1.0 for ARM contains the dig binary so the workaround is no longer needed (#703).
Improve error message for wait-for-dns output (#735).
Add codespell to enable spell check on all PRs (#661).

Upgrading from v0.2.1

Configuration syntax changes

There have been some minor changes to the configurations of following components:

contour
prometheus-operator.

Please make sure new the configuration structure is in place before the upgrade.

Contour component

Optional ingress_hosts attribute is now removed.

old:

component "contour" {
  .
  .
  ingress_hosts = ["*.example.lokomotive-k8s.net"]
}

new:

component "contour" {
  .
  .
}

Prometheus-operator component

Prometheus specific attributes are now under a prometheus block.
A new optional prometheus.ingress sub-block is introduced to expose Prometheus over ingress.
Attribute external_url is now removed and now configured under prometheus.ingress.host. Remove URL scheme (e.g. https://) and URI (e.g. /prometheus) when configuring. URI is no longer supported and protocol is always HTTPS.

old:

component "prometheus-operator" {
  .
  .
  prometheus_metrics_retention = "14d"
  prometheus_external_url      = "https://prometheus.example.lokomotive-k8s.net"
  prometheus_storage_size      = "50GiB"
  prometheus_node_selector = {
    "kubernetes.io/hostname" = "worker3"
  }
  .
  .
}

new:

component "prometheus-operator" {
  .
  .
  prometheus {
    metrics_retention = "14d"
    storage_size      = "50GiB"
    node_selector = {
      "kubernetes.io/hostname" = "worker3"
    }

    ingress {
      host                       = "prometheus.example.lokomotive-k8s.net"
    }
    .
    .
  }
  .
  .
}

Check out the new syntax in the Prometheus Operator configuration reference for details.

Upgrade steps

Go to your cluster's directory and run the following command.

lokoctl cluster apply

The update process typically takes about 10 minutes. After the update, running lokoctl health should result in an output similar to the following.

Node                     Ready    Reason          Message

lokomotive-controller-0  True     KubeletReady    kubelet is posting ready status
lokomotive-1-worker-0    True     KubeletReady    kubelet is posting ready status
lokomotive-1-worker-1    True     KubeletReady    kubelet is posting ready status
lokomotive-1-worker-2    True     KubeletReady    kubelet is posting ready status
Name      Status    Message              Error

etcd-0    True      {"health":"true"}

Post upgrade steps

Openebs

OpenEBS control plane components and data plane components work independently. Even after the OpenEBS Control Plane components have been upgraded to 1.11.0, the Storage Pools and Volumes (both jiva and cStor) will continue to work with older versions.

Upgrade functionality is still under active development. It is highly recommended to schedule a downtime for the application using the OpenEBS PV while performing this upgrade. Also, make sure you have taken a backup of the data before starting the below upgrade procedure. - Openebs documentation

Upgrade cStor Pools

Extract the SPC name using kubectl get spc:

NAME                          AGE
cstor-pool-openebs-replica1   24h

The Job spec for upgrade cstor pools is:

#This is an example YAML for upgrading cstor SPC.
#Some of the values below needs to be changed to
#match your openebs installation. The fields are
#indicated with VERIFY
---
apiVersion: batch/v1
kind: Job
metadata:
  #VERIFY that you have provided a unique name for this upgrade job.
  #The name can be any valid K8s string for name. This example uses
  #the following convention: cstor-spc-<flattened-from-to-versions>
  name: cstor-spc-1001120

  #VERIFY the value of namespace is same as the namespace where openebs components
  # are installed. You can verify using the command:
  # `kubectl get pods -n <openebs-namespace> -l openebs.io/component-name=maya-apiserver`
  # The above command should return status of the openebs-apiserver.
  namespace: openebs
spec:
  backoffLimit: 4
  template:
    spec:
      #VERIFY the value of serviceAccountName is pointing to service account
      # created within openebs namespace. Use the non-default account.
      # by running `kubectl get sa -n <openebs-namespace>`
      serviceAccountName: openebs-operator
      containers:
      - name:  upgrade
        args:
        - "cstor-spc"

        # --from-version is the current version of the pool
        - "--from-version=1.10.0"

        # --to-version is the version desired upgrade version
        - "--to-version=1.11.0"

        # Bulk upgrade is supported from 1.9
        # To make use of it, please provide the list of SPCs
        # as mentioned below
        - "cstor-pool-name"
        # For upgrades older than 1.9.0, use
        # '--spc-name=<spc_name> format as
        # below commented line
        # - "--spc-name=cstor-sparse-pool"

        #Following are optional parameters
        #Log Level
        - "--v=4"
        #DO NOT CHANGE BELOW PARAMETERS
        env:
        - name: OPENEBS_NAMESPACE
          valueFrom:
            fieldRef:
              fieldPath: metadata.namespace
        tty: true

        # the image version should be same as the --to-version mentioned above
        # in the args of the job
        image: quay.io/openebs/m-upgrade:1.11.0
        imagePullPolicy: Always
      restartPolicy: OnFailure

Apply the Job manifest using kubectl. Check the logs of the pod started by the Job:

$ kubectl get logs -n openebs cstor-spc-1001120-dc7kx
..
..
..
I0728 15:15:41.321450       1 spc_upgrade.go:102] Upgrade Successful for spc cstor-pool-openebs-replica1
I0728 15:15:41.321473       1 cstor_spc.go:120] Successfully upgraded storagePoolClaim{cstor-pool-openebs-replica1} from 1.10.0 to 1.11.0

Upgrade cStor volumes

Extract the PV name using kubectl get pv:

$ kubectl get pv
NAME                                       CAPACITY   ACCESS MODES   RECLAIM POLICY   STATUS   CLAIM                                                             STORAGECLASS       REASON   AGE
pvc-b69260c4-5cc1-4461-b762-851fa53629d9   50Gi       RWO            Delete           Bound    monitoring/data-alertmanager-prometheus-operator-alertmanager-0   openebs-replica1            24h
pvc-da29e4fe-1841-4da9-a8f6-4e3c92943cbb   50Gi       RWO            Delete           Bound    monitoring/data-prometheus-prometheus-operator-prometheus-0       openebs-replica1            24h

Create a Kubernetes Job spec for upgrading the cstor volume. An example spec is as follows:

#This is an example YAML for upgrading cstor volume.
#Some of the values below needs to be changed to
#match your openebs installation. The fields are
#indicated with VERIFY
---
apiVersion: batch/v1
kind: Job
metadata:
  #VERIFY that you have provided a unique name for this upgrade job.
  #The name can be any valid K8s string for name. This example uses
  #the following convention: cstor-vol-<flattened-from-to-versions>
  name: cstor-vol-1001120

  #VERIFY the value of namespace is same as the namespace where openebs components
  # are installed. You can verify using the command:
  # `kubectl get pods -n <openebs-namespace> -l openebs.io/component-name=maya-apiserver`
  # The above command should return status of the openebs-apiserver.
  namespace: openebs

spec:
  backoffLimit: 4
  template:
    spec:
      #VERIFY the value of serviceAccountName is pointing to service account
      # created within openebs namespace. Use the non-default account.
      # by running `kubectl get sa -n <openebs-namespace>`
      serviceAccountName: openebs-operator
      containers:
      - name:  upgrade
        args:
        - "cstor-volume"

        # --from-version is the current version of the volume
        - "--from-version=1.10.0"

        # --to-version is the version desired upgrade version
        - "--to-version=1.11.0"

        # Bulk upgrade is supported from 1.9
        # To make use of it, please provide the list of PVs
        # as mentioned below
        - "pvc-b69260c4-5cc1-4461-b762-851fa53629d9"
        - "pvc-da29e4fe-1841-4da9-a8f6-4e3c92943cbb"
        # For upgrades older than 1.9.0, use
        # '--pv-name=<pv_name> format as
        # below commented line
        # - "--pv-name=pvc-c630f6d5-afd2-11e9-8e79-42010a800065"

        #Following are optional parameters
        #Log Level
        - "--v=4"
        #DO NOT CHANGE BELOW PARAMETERS
        env:
        - name: OPENEBS_NAMESPACE
          valueFrom:
            fieldRef:
              fieldPath: metadata.namespace
        tty: true

        # the image version should be same as the --to-version mentioned above
        # in the args of the job
        image: quay.io/openebs/m-upgrade:1.11.0
        imagePullPolicy: Always
      restartPolicy: OnFailure
---

Apply the Job manifest using kubectl. Check the logs of the pod started by the Job:

$ kubectl get logs -n openebs cstor-vol-1001120-8b2h9
..
..
..
I0728 15:19:48.496031       1 cstor_volume_upgrade.go:609] Upgrade Successful for cstor volume pvc-da29e4fe-1841-4da9-a8f6-4e3c92943cbb
I0728 15:19:48.502876       1 cstor_volume.go:119] Successfully upgraded cstorVolume{pvc-da29e4fe-1841-4da9-a8f6-4e3c92943cbb} from 1.10.0 to 1.11.0

v0.2.1 - 2020-06-24

This is a patch release to fix AKS platform deployments.

Changes in v0.2.1

Kubernetes updates

Updated Kubernetes version on AKS platform to 1.16.9 (#626). This fixes deploying AKS clusters, as the previously used version is not available anymore.

Security

Updated golang.org/x/text dependency to v0.3.3 (#648) to address CVE-2020-14040.

Bugfixes

Fixes example configuration for AKS platform (#626). Contour component configuration syntax changed and those files had not been updated.

Misc

Bootkube Docker images are now pulled using Docker protocol, as quay.io plans to deprecate pulling images using ACI (#656.

Development

AKS platform is now being tested for every pull request and master branch changes in the CI.
Added script for finding available component updates in upstream repositories (#375).

v0.2.0 - 2020-06-19

We're happy to announce Lokomotive v0.2.0 (Bernina Express).

This release includes a ton of new features, changes and bugfixes. Here are some highlights:

Kubernetes v1.18.3.
Many component updates.
AKS platform support.
Cloudflare DNS support.
Monitoring dashboards fixes.
Dynamic provisioning of Persistent Volumes on AWS.
Security improvements.

Check the full list of changes for more details.

Upgrading from v0.1.0

Prerequisites

All platforms

The Calico component has a new CRD that needs to be applied manually.

kubectl apply -f https://raw.githubusercontent.com/kinvolk/lokomotive/v0.2.0/assets/lokomotive-kubernetes/bootkube/resources/charts/calico/crds/kubecontrollersconfigurations.yaml

Some component objects changed apiVersion so they need to be labeled and annotated manually to be able to upgrade them.

Dex

kubectl -n dex label ingress dex app.kubernetes.io/managed-by=Helm
kubectl -n dex annotate ingress dex meta.helm.sh/release-name=dex
kubectl -n dex annotate ingress dex meta.helm.sh/release-namespace=dex

Gangway

kubectl -n gangway label ingress gangway app.kubernetes.io/managed-by=Helm
kubectl -n gangway annotate ingress gangway meta.helm.sh/release-name=gangway
kubectl -n gangway annotate ingress gangway meta.helm.sh/release-namespace=gangway

Metrics Server

kubectl -n kube-system label rolebinding metrics-server-auth-reader app.kubernetes.io/managed-by=Helm
kubectl -n kube-system annotate rolebinding metrics-server-auth-reader meta.helm.sh/release-namespace=kube-system
kubectl -n kube-system annotate rolebinding metrics-server-auth-reader meta.helm.sh/release-name=metrics-server

httpbin

kubectl -n httpbin label ingress httpbin app.kubernetes.io/managed-by=Helm
kubectl -n httpbin annotate ingress httpbin meta.helm.sh/release-namespace=httpbin
kubectl -n httpbin annotate ingress httpbin meta.helm.sh/release-name=httpbin

AWS

You need to remove an asset we've updated from your assets directory:

rm $ASSETS_DIRECTORY/lokomotive-kubernetes/aws/flatcar-linux/kubernetes/workers.tf

Upgrading

lokocfg syntax changes

Before upgrading, make sure your lokocfg configuration follows the new v0.2.0 syntax. Here we describe the changes.

DNS for the Packet platform

The DNS configuration syntax for the Packet platform has been simplified.

Here's an example for the Route 53 provider.

Old:

dns {
    zone = "<DNS_ZONE>"
    provider {
        route53 {
            zone_id = "<ZONE_ID>"
        }
    }
}

New:

dns {
    zone     = "<DNS_ZONE>"
    provider = "route53"
}

Check out the new syntax in the Packet configuration reference for details.

External DNS component

The owner_id field is now required.

Prometheus Operator component

There is a specific block for Grafana now.

Here's an example of the changed syntax.

Old:

component "prometheus-operator" {
    namespace              = "<NAMESPACE>"
    grafana_admin_password = "<GRAFANA_PASSWORD>"
    etcd_endpoints         = ["<ETCD_IP>"]
}

New:

component "prometheus-operator" {
    namespace = "<NAMESPACE>"
    grafana {
        admin_password = "<GRAFANA_PASSWORD>"
    }
    # etcd_endpoints is not needed anymore
}

Check out the new syntax in the Prometheus Operator configuration reference for details.

Upgrade

Go to your cluster's directory and run the following command.

lokoctl cluster apply

The update process typically takes about 10 minutes. After the update, running lokoctl health should result in an output similar to the following.

Node                     Ready    Reason          Message

lokomotive-controller-0  True     KubeletReady    kubelet is posting ready status
lokomotive-1-worker-0    True     KubeletReady    kubelet is posting ready status
lokomotive-1-worker-1    True     KubeletReady    kubelet is posting ready status
lokomotive-1-worker-2    True     KubeletReady    kubelet is posting ready status
Name      Status    Message              Error

etcd-0    True      {"health":"true"}

If you have the cert-manager component installed, you will get an error on the first update and need to do a second one. Run the following to upgrade your components again.

lokoctl component apply

Changes in v0.2.0

Kubernetes updates

Update Kubernetes to v1.18.3 (#459).

Component updates

openebs: update to 1.10.0 (#528).
dex: update to v2.24.0 (#525).
contour: update to v1.5.0 (#524).
cert-manager: update to v0.15.1 (#522).
calico: update to v3.14.1 (#415).
metrics-server: update to 0.3.6 (#343).
external-dns: update to 2.21.2 (#340).
rook: update to v1.3.1 (#300).
etcd: Update to v3.4.9 (#521).

New platforms

Add AKS platform support (#219).

Bugfixes

Handle OS interrupts in lokoctl to fix leaking terraform process (#483).
Fix self-hosted Kubelet on bare metal platform (#436). It wasn't working.
grafana: remove cluster label in kubelet dashboard (#474). This fixes missing information in the Kubelet Grafana dashboard.
Rook Ceph: Fix dashboard templating (#476). Some graphs were not showing information.
pod-checkpointer: update to pod-checkpointer image (#498). Fixes communication between the pod checkpointer and the kubelet.
Fix AWS worker pool handling (#367). Remove invisible worker pool of size 0 and fix NLB listener wiring to fix ingress.
Fix rendering of ingress_hosts in Contour component (#417). Fixes having a wildcard subdomain as ingress for Contour.
kube-apiserver: fix TLS handshake errors on Packet (#297). Removes harmless error message.
calico-host-protection: fix node name of HostEndpoint objects (#201). Fixes GlobalNetworkPolcies for nodes.

Features

aws: add the AWS EBS CSI driver (#423). This allows dynamic provisioning of Persistent Volunmes on AWS.
grafana: provide root_url in grafana.ini conf (#547). So Grafana exposes its URL and not localhost.
packet: add Cloudflare DNS support (#422).
Monitor etcd by default (#493). It wasn't being monitored before.
Add variable grafana_ingress_host to expose Grafana (#468). Allows exposing Grafana through Ingress.
Add ability to provide oidc configuration (#182). Allows to configure the API Server to use OIDC for authentication. Previously this was a manual operation.
Parameterise ClusterIssuer for Dex, Gangway, HTTPBin (#482). Allows using a different cluster issuer.
grafana: enable piechart plugin for the Prometheus Operator chart (#469). Pie chart graphs weren't showing.
Add a knob to disable self hosted kubelet (#425).
rook-ceph: add StorageClass config (#402). This allows setting up rook-ceph as the default storage class.
Add monitoring config and variable to rook component (#405). This allows monitoring rook.
packet: add support for hardware reservations (#299).
Add support for lokoctl component delete (#268).
bootkube: add calico-kube-controllers (#283).
metallb: add AlertManager rules (#140).
Label service-monitors so that they are discovered by Prometheus (#200). This makes sure all components are discovered by Prometheus.
external-dns: expose owner_id (#207). Otherwise several clusters in the same DNS Zone will interact badly with each other.
contour: add Alertmanager rules (#193).
contour: add nodeAffinity and tolerations (#386). This allows using ingress in a subset of cluster nodes.
prometheus-operator: add storage class & size options (#387).
grafana: add secret_env variable (#541). This allows users to provide arbitrary key values pairs that will be exposed as environment variables inside the Grafana pod.
rook-ceph: allow volume resizing (#640). This enables the PVs created by the storage class to be resized on the fly.

Security

Block access to metadata servers for all components by default (#464). Most components don't need it and it is a security risk.
packet: disable syncing allowed SSH keys on nodes (#471). So nodes aren't accessible to all authorized SSH keys in the Packet user and project keys.
packet: tighten up node bootstrap iptables rules (#202). So nodes are better protected during bootstrap.
PSP: Rename restricted to zz-minimal (#293). So PSPs apply in the right order.
kubelet: don't automount service account token (#306). The Kubelet doesn't need it. Apiserver, mounted using HostPath.
prometheus-operator: add seccomp annotations to kube-state-metrics (#288). This reduces the attack surface by blocking unneeded syscalls.
prometheus operator: add seccomp annotations to PSP (#294). So Prometheus Operator pods have seccomp enabled.
Binding improvements (#194). This makes the kubelet, kube-proxy and calico-node processes listen on the Host internal IP.

UX

Add --confirm flag to delete component without asking for confirmation (#568).
Add error message for missing ipxe_script_url (#540).
Show logs when terraform fails in lokoctl cluster apply/destroy (#323).
cli/cmd: rename --kubeconfig flag to --kubeconfig-file (#602). This is because cobra/viper consider the KUBECONFIG environment variable and the --kubeconfig flag the same and this can cause surprising behavior.

Docs

docs: make Packet quickstart quick (#332).
docs: document Route 53 and S3+DynamoDB permissions (#561).
docs/quickstart/aws: Fix flatcar-linux-update-operator link (#552).
docs: Add detailed contributing guidelines (#404).
docs: Add instructions to run conformance tests (#236).
docs/quickstarts: add reference to usage docs and PSP note (#233).
docs: clarify values for ssh_pubkeys (#230).
docs/quickstarts: fix kubeconfig path (#229).
docs/prometheus-operator: clarify alertmanager config indentation (#199).
quickstart-docs: Add ssh-agent instructions (#325).
docs: provide alternate way of declaring alertmanager config (#570).
examples: make Flatcar channels explicit (#565).
docs/aws: document TLS handshake errors in kube-apiserver (#599).

Misc

Update terraform-provider-ct to v0.5.0 and mention it in the docs (#281).
Update broken links (#569).
Fix example configs and typos (#535).
docs/httpbin: Fix table (#510).
Add missing bracket in Prometheus Operator docs (#490).
docs: Update the component deleting steps (#481).
Fix broken bare metal config link (#473).
Remove period (.) from flag descriptions (#574).
Several fixes to make updates from v0.1.0 smooth (#638, #639, #642)
baremetal quickstart: Add double quotes (#633).
pkg/components/util: improvements (#605).
New internal package for helper functions (#588).
Remove vars from assets that were unused by tmpl file (#620).
keys: iago's key is kinvolk.io, not gmail 🤓 (#616).

v0.1.0 - 2020-03-18

Initial release.

Kubernetes v1.18.0
Running on Flatcar Container Linux
Fully self-hosted, including the kubelet
Single or multi-master
Calico networking
On-cluster etcd with TLS, RBAC-enabled, PSP-enabled, network policies
In-place upgrades, including experimental kubelet upgrades
Supported on:
- Packet
- AWS
- Bare metal
Initial Lokomotive Components:

Files

CHANGELOG.md

Latest commit

History

CHANGELOG.md

File metadata and controls

v0.4.1 - 2020-09-15

Changes in v0.4.1

Component updates

Features

Bug fixes

Docs

Development

v0.4.0 - 2020-09-07

Changes in v0.4.0

Kubernetes Updates

Platform updates

AKS

AWS

New Components

Component updates

Terraform Providers Updates

UX

Features

Docs

Bug fixes

Development

Upgrading from v0.3.0

Lokoctl Host binary upgrades

terraform-provider-ct

Disable TLS Bootstrap

Cluster upgrade steps

Cluster nodes component upgrade (optional)

Manual Cluster Changes

Ceph Upgrade steps

OpenEBS

Upgrade cStor Pools

Upgrade cStor volumes

Upgrade other components

v0.3.0 - 2020-07-31

Changes in v0.3.0

Kubernetes updates

Platform updates

Packet

AKS

Component updates

UX

Features

Docs

Bugfixes

Development

Miscellaneous

Upgrading from v0.2.1

Configuration syntax changes

Contour component

Prometheus-operator component

Upgrade steps

Post upgrade steps

Openebs

Upgrade cStor Pools

Upgrade cStor volumes

v0.2.1 - 2020-06-24

Changes in v0.2.1

Kubernetes updates

Security

Bugfixes

Misc

Development

v0.2.0 - 2020-06-19

Upgrading from v0.1.0

Prerequisites

All platforms

AWS

Upgrading

lokocfg syntax changes

DNS for the Packet platform

External DNS component

Prometheus Operator component

Upgrade

Changes in v0.2.0