I am using Kubernetes HPA to scale up my cluster. I have set up target CPU utilization is 50% . It is scaling up properly. But, when load decreases and it scales down so fast. I want to set a cooling period. As an example, even the CPU util is below 50% , it should wait for 60 sec before terminating a node.
I have checked this article, but it is not saying that I can change the default value in HPA, https://kubernetes.io/docs/concepts/workloads/pods/pod/index.html#termination-of-pods
Kops version :- 1.9.1
This is configured at the HPA level: https://kubernetes.io/docs/tasks/run-application/horizontal-pod-autoscale/#support-for-cooldown-delay
--horizontal-pod-autoscaler-downscale-delay: The value for this option is a duration that specifies how long the autoscaler has to wait before another downscale operation can be performed after the current one has completed. The default value is 5 minutes (5m0s).