Internet-less environment cluster downtime due to missing kube-dns images
book
Article ID: 298569
calendar_today
Updated On:
Products
VMware Tanzu Kubernetes Grid Integrated Edition
Issue/Introduction
Symptoms:
System pods that are re-scheduled cannot start in an internet less environment because kube-dns images are missing.
Environment
Cause
In an internet-less environment if a worker fills up, when docker cleanup (garbage collection) is run it will remove all unused docker images. Currently the clean up does not differentiate between system images and non system images. This means the kube-dns image can be cleaned up. Since it's an airgapped environment, the image cannot be pulled from k8s.gcr.io if the kube-dns pod is scheduled to run on this k8s node (worker).
Resolution
To prevent this from happening the following container images can be pre-loaded onto your private registry. They will need to be updated for each release: