Autoscaling

Forms of Kubernetes Autoscaling

Autoscaling is a method for automatically scaling Kubernetes (K8s) workloads up or down based on resource usage/expectations. Autoscaling in Kubernetes has three dimensions:

The different autoscalers operate at one of two Kubernetes levels:

Pod Level: The HPA and VPA methods occur at the pod level. Both HPA and VPA scale the available resources or instances of the pod, both up and down.

Cluster Level: The Cluster Autoscaler operates at the cluster level and scales the number of nodes within your cluster up or down.

Simultaneous Use of VPA and HPA

Last modified 25.03.2024: Marketing changes (4542007)