In the Kubernetes documentation for horizontal pod autoscalers it states as of version 1.12 a "new algorithmic update removes the need for the upscale delay"
I have searched for information on this change including going through the v1.12 change log. The change I see mentioned is the polling frequency from 30 seconds to 15 seconds.
There are also some discussions about adding HPA configurations for scale delay.
What was the change that removed the need for upscale delay?
There are several changes (quoted from the release notes):
Speed up HPA reaction to metric changes by removing scale up forbidden window. (#66615, @jbartosik)
Horizontal Pod Autoscaler default update interval has been increased from 30s to 15s, improving HPA reaction time for metric changes. (#68021, @krzysztof-jastrzebski)
Stop counting soft-deleted pods for scaling purposes in HPA controller to avoid soft-deleted pods incorrectly affecting scale up replica count calculation. (#67067, @moonek)
This is a related change (quoted from the release notes):
More documentation related to that change is here.