min = max | You're Not Autoscaling

[ 0.000] kubernetes autoscaling observatory — initializing [ 0.112] loading cluster state... [ 0.341] ✓ kube-apiserver UP [ 0.342] ✓ kube-scheduler UP [ 0.343] ✓ cluster-autoscaler UP [ 0.344] ✓ metrics-server UP [ 0.500] scanning HPAs in namespace: production [ 0.512] ✓ api-gateway SCALING (2→5 replicas, cpu: 62%/70%) [ 0.513] ✓ auth-service SCALING (2→2 replicas, cpu: 31%/70%) [ 0.514] ⚠ worker-service SCALING (2→12 replicas, cpu: 88%/70%) [ 0.515] ✗ your-service DISABLED (minReplicas == maxReplicas == 8) [ 0.600] --- [ 0.601] FAULT DETECTED: ScalingActive=False [ 0.602] REASON: the HPA was disabled because minReplicas equals maxReplicas [ 0.603] --- [ 0.700] diagnosis: this is not autoscaling. [ 0.701] diagnosis: this is a number. a fixed, unchanging number. [ 0.800] rendering summary...

MIN = MAX

↓ scroll for the evidence

replicas saved by HPA today

engineers who distrust HPA

cluster autoscaler events ignored

pods over-provisioned per day

replica-hours wasted since you opened this page 0 replica-hours

exhibit a

A Healthy HPA.
Doing Its Job.

Below is what a functioning HPA looks like. The replicas are different numbers. That's the whole point. Wild, right?

kubectl — prod-cluster

❯ kubectl get hpa -n production

NAME                     REFERENCE                TARGETS         MINPODS   MAXPODS   REPLICAS   AGE

api-gateway              Deployment/api-gateway   62%/70%         2         20        5          47d

worker-service           Deployment/worker-svc    88%/70%         2         50        12         47d

auth-service             Deployment/auth-svc      31%/70%         2         10        2          47d

static-replica-svc       Deployment/static-svc    Unknown         8         8         8          47d

❯ kubectl describe hpa static-replica-svc -n production | grep -A3 "Conditions"

Conditions:

  Type             Status  Reason

  ScalingActive    False   ScalingDisabled: the HPA was disabled because minReplicas equals maxReplicas

  AbleToScale      True    SucceededGetScale

# ^ kubernetes literally telling you. in plain english. right there.

"Setting min = max is not
a safety net.
it is a static deployment
in a costume."

— kubectl describe hpa, probably

autoscaling, explained

The Three
Musketeers

01

HPA — Horizontal Pod Autoscaler

Scales your pod count horizontally based on CPU, memory, or custom metrics. Watches your workload. Reacts to load. Does this continuously. Has been doing this reliably since Kubernetes 1.1. In 2024.
02

VPA — Vertical Pod Autoscaler

Right-sizes your resource requests and limits based on actual usage. Stops you from over-requesting 4 CPUs for a pod that uses 0.2. Saves real money. Works great in recommendation mode.
03

Cluster Autoscaler

Provisions and deprovisions nodes when pods can't be scheduled or when nodes are underutilized. Pairs with HPA. The outer loop to HPA's inner loop. Together: an elastic, cost-aware platform.

common fears vs. reality

Why People
Disable It

✗

"HPA scaled down and caused an outage"

Set appropriate minReplicas for baseline load. Tune scaleDown stabilization windows. Don't set min to 0 for critical services. This is configuration, not a product defect.
✗

"It scales too aggressively"

Adjust scaleUp/scaleDown policies. Set stabilizationWindowSeconds. Use behavior blocks. The defaults are conservative by design. You have full control.
✗

"I don't trust the metrics"

Then fix your metrics pipeline. An HPA is only as good as the signals it receives. Bad observability is a prerequisite problem, not an HPA problem.

Setting minReplicas = maxReplicas disables autoscaling entirely. Kubernetes will tell you this. It's in the HPA conditions output. You can see it right now. Just run describe.

exhibit b

What Autoscaling
Actually Looks Like

A replica count that moves. In response to load. Automatically. Note the difference between the green line (desired) and the flat red line (you, with min=max).

replica count over time — 24h window

HPA-managed replicas

static (min=max=8) "autoscaling"

A Healthy HPA.Doing Its Job.

The ThreeMusketeers

Why PeopleDisable It

What AutoscalingActually Looks Like

A Healthy HPA.
Doing Its Job.

The Three
Musketeers

Why People
Disable It

What Autoscaling
Actually Looks Like