K8s Cluster Autoscaling — Findings and Research

Our findings on how Kubernetes Cluster Autoscaling works, including insights on Karpenter and Cluster Autoscaler.

KT ksctl Team

January 23, 2024

kubernetesautoscalingkarpentercluster-autoscalerresearch

Pod Auto-scaler

Information gathered from the Kubernetes Slack channels (#karpenter, #auto-scaler).

Karpenter scales using pending pod pressure.
There are two primary controllers for scale up and scale down.
Provisioning trigger controller: Creates nodeclaims or initializes a scale request. It watches for pending pods, and in response checks if they can be scheduled on existing nodes. If not, Karpenter solves scheduling by creating additional nodeclaims. Node lifecycle controllers watch for these nodeclaims and launch VMs accordingly.
- Source: provisioning/controller.go
Disruption Controller: Tells Karpenter to scale down the nodes. It polls the cluster every 10 seconds and iterates through disruption methods to determine if any disruption action can be initiated.
- Source: disruption/controller.go

For scale up, CAS looks for pending pods.
For scale down, it looks for under-utilized nodes (calculated by resource usage).
Resource usage = sum(resource_requests) / node_allocatable
It has nothing to do with “real” utilization.
CAS’s job is to make all pods able to schedule using as few nodes as possible — it only looks at scheduling.
You can use HPA/VPA to update pods based on actual resource usage, which will in turn trigger CA to add/remove nodes.

Andreasen, J. V. (2024). Carbon Efficient Karpenter: Optimizing Kubernetes Cluster Autoscaling for Carbon Efficiency. GitHub Repository