I have a cluster that was recently upgraded. Since then none of the pods running in the cluster are able to get a response from https://kubernetes.default/healthz.
To be clear, they can resolve the URL, but it constantly times out of comes back with connection refused.
I have verified that the api-server is running as I can get a response from http://localhost:8080/healthz, but I can't get anything from within a pod.
I've checked all the scripts and configs and compared them to the other clusters that were upgraded at the same time and there appears to be nothing different.
I'm sure it's something small I've overlooked but I don't know where else to look.
Additional information:
[edit]
providing /etc/resolv.conf
cat /etc/resolv.conf
nameserver 100.64.0.10
search jenkins.svc.cluster.local svc.cluster.local cluster.local us-west-2.compute.internal
options ndots:5
Check the image I uploaded, you can only resolve the dns kubernetes
from within a pod in the default
namespace. For other namespaces use kubernetes.default.svc
like in the following curl. You have the certificate and token already mounted in the pod as well.
curl --cacert /var/run/secrets/kubernetes.io/serviceaccount/ca.crt -H "Authorization: Bearer $(cat /var/run/secrets/kubernetes.io/serviceaccount/token)" -H "Accept: application/json" https://kubernetes.default.svc/api/