What is the Maximum number of request can a Kubernetes Service handle in parellel. I will be providing auto scaling for the pods to scale based on the input request, but is there any limit for the maximum number of request which a Service can handle at a point of time.
Services in Kubernetes are just routing configurations in the kernel. When you call a Service URL/IP your local kernel will forward the request to a random Pod.
Therefore the request limit is defined by the number of requests a single node of your cluster can handle.