I have setup this https://github.com/actions-runner-controller/actions-runner-controller in my eks cluster.
That creates per each runner a docker contanier and a runner container. In order to share docker images and not pull every time I have created a volume to share the /var/lib/docker
and injects it in all docker containers.
The problem is that time to time (with 1 or 2 pods does not fails) the docker container fails initialization with this error:
evel=warning msg="grpc: addrConn.createTransport failed to connect to {unix:///var/run/docker/containerd/containerd.sock <nil> 0 <nil>}. Err :connection error: desc = \"transport: error while dialing: dial unix:///var/run/docker/containerd/containerd.sock: timeout\". Reconnecting..." module=grpc
I have created in the repo an issue in the repo but I dont think the error is related with the controller. Is something on my side.
https://github.com/actions-runner-controller/actions-runner-controller/issues/847
here are the yamls:
apiVersion: actions.summerwind.dev/v1alpha1
kind: RunnerDeployment
metadata:
name: github-runner-deploy
namespace: actions-runner-system
spec:
template:
spec:
organization: org-name
volumes:
- name: docker-data-ci
persistentVolumeClaim:
claimName: docker-data-ci
dockerVolumeMounts:
- mountPath: /var/lib/docker
name: docker-data-ci
---
kind: PersistentVolumeClaim
apiVersion: v1
metadata:
name: "docker-data-ci"
spec:
storageClassName: gp2
accessModes:
- ReadWriteOnce
resources:
requests:
storage: 100Gi