You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The deployment is in the contoso-traders test subscription.
Change Description
Redeployed AKS cluster via bicep template from mithun/hpa2 branch, which has the Azure CNI network policy (instead of the default kubenet policy).
Had to manually modify AKS's vnet to create a new subnet aci-subnet with address space 10.255.0.0/16.
Tethered it to existing AKS cluster using az aks enable-addons (full instructions here).
Applied the Deployment.yaml manifest from mithun/hpa2 branch, which has the nodeSelector, tolerations changes to configure pods to only run in virtual nodes.
Issue Details
The pods (configured to run in ACI virtual nodes) are stuck in waiting state.
The logs only show that an active endpoint is not being detected for the services / ingress
Hypothesis
Could have something to do with the fact that we switched over to Azure CNI network policy instead of the default kubenet policy.
Could have something to do with the nodeSelector, tolerations changes made in the Deployment.yaml file to configure pod to only run in virtual nodes.
We're investigating the following options for AKS scaling:
1. ACI VIRTUAL NODES
Status
Currently
BLOCKED.Where
mithun/hpa2branch. (see PR microsoft/ContosoTraders#78)Change Description
mithun/hpa2branch, which has theAzure CNInetwork policy (instead of the defaultkubenetpolicy).aci-subnetwith address space10.255.0.0/16.az aks enable-addons(full instructions here).Deployment.yamlmanifest frommithun/hpa2branch, which has thenodeSelector,tolerationschanges to configure pods to only run in virtual nodes.Issue Details
The pods (configured to run in ACI virtual nodes) are stuck in
waitingstate.The logs only show that an active endpoint is not being detected for the services / ingress
Hypothesis
Azure CNInetwork policy instead of the defaultkubenetpolicy.nodeSelector,tolerationschanges made in theDeployment.yamlfile to configure pod to only run in virtual nodes.2. CLUSTER AUTOSCALER
Status
Currently
INVESTIGATINGWhere
[mithun/enable-autoscal](mithunshanbhag:mithun/cluster-autoscaler)branch (See PR [DRAFT] enable cluster autoscaling #81)Change Description
minCount: 1andmaxCount: 10Issue Details
Load test has a high failure rate. This issue is being tracked separately in Load Test: Fix API issues identified from load testing contosotraders-cloudtesting#3
The pods are also not scaling out (this could be related to above issue).
Hypothesis
Currently none, still investigating.
Misc Notes
Ingress controller was stuck in
PENDINGstate for a few minutes after provisioning. Then automatically went toOKstate.