I've simple following configuration of the cluster: 3 nodes on 3 machines. On top of each node i have 1 pod. Each pod has readinessProbe. All these pods exposed to one service object with type NodePort. On the top of this object i have the following ingress object:
apiVersion: extensions/v1beta1
kind: Ingress
metadata:
name: dess-ingress
spec:
backend:
serviceName: dess-index
servicePort: 2280
This configuration works fine, but i want to return http busy status when all readinessProbe fails and cluster cannot handle more requests. How can I do that? And a related question - google compute engine allows to specify backend service capacity based on cpu utilisation or requests per second for load-balancing purposes. Can I do the following with the kubernetes service object?