Azure Kubernetes Cluster Node Failure Scenario

8/17/2018

Let's say I have 3 nodes in my cluster and I want to run 300 jobs.

If I run 1 job per POD and 100 pods per NODE, what will happen if a node fails in Azure Kubernetes Service?

-- Talha Irfan
azure
azure-kubernetes
kubernetes

2 Answers

8/30/2018
-- Vikram Hosakote
Source: StackOverflow

8/30/2018

Those Jobs will go to pending, as Kubernetes supports 110 pods per node, so wouldn't have the resources to support the failed over jobs. You could look at using the Cluster Autoscaler (Beta) and it would provision more host to satisfy running those jobs that are in a pending state.

-- Strebel - MSFT
Source: StackOverflow