Skip to content

Commit b3816d7

Browse files
committed
Add scaleUp stabilization window to mitigate metric oscillation
AverageValue divides total token throughput by replica count, which means the per-replica value halves after a scale-up event. With stabilizationWindowSeconds: 0 this could cause flapping near the threshold. Setting it to 30s requires the metric to stay above threshold for two consecutive polling intervals before a scale-up is committed, while the existing 120s scaleDown window prevents premature scale-down.
1 parent b69e04d commit b3816d7

1 file changed

Lines changed: 1 addition & 1 deletion

File tree

serving/kserve-keda-autoscaling/scaled-object.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -26,7 +26,7 @@ spec:
2626
horizontalPodAutoscalerConfig:
2727
behavior:
2828
scaleUp:
29-
stabilizationWindowSeconds: 0
29+
stabilizationWindowSeconds: 30 # short window to absorb metric noise before committing to scale-up
3030
scaleDown:
3131
stabilizationWindowSeconds: 120
3232
policies:

0 commit comments

Comments
 (0)