Guest cluster nodes do not get created when using webhooks (such as rancher) in the guest cluster.
The webhooks watch for secret events, and if all the workers are down, the webhooks pods are inaccessible.
This causes the control plane rollout/creation to get stuck.
The Machine, wcpMachine
objects are created, however the respective virtualMachine
objects and Guest Cluster VMs in vCenter Server are not created. Additionally,
bootstrap data is not available
controller/kubeadmconfig "msg"="Reconciler error" "error"="failed to create new bootstrap token: Internal error occurred: failed calling webhook rancher.cattle.io: ..."
vSphere with Tanzu
capi-kubeadmconfig-controller
is trying to create a token secret in the guest cluster before generating the datavalueSecret
for cloud-init
to join the node.
The rancher webhooks block the creation of the token secret as connections to the webhook pods fail.
As the worker nodes are down, no webhook pods are running.
controller/kubeadmconfig "msg"="Reconciler error" "error"="failed to create new bootstrap token: Internal error occurred: failed calling webhook rancher.cattle.io:
alidatingwebhookconfigurations
and mutatingwebhookconfigurations
that watch secret events# kubectl get validatingwebhookconfigurations rancher.cattle.io
-o yaml > rancher-cattle-io.yaml
validatingwebhookconfigurations
and mutatingwebhookconfigurations
# kubectl delete validatingwebhookconfigurations rancher.cattle.io
validatingwebhookconfigurations
and mutatingwebhookconfigurations
from the backups that were taken in step 1.# kubectl apply -f rancher-cattle-io.yaml