MachineDeploymentsUpgradePending status, with the node remaining in the DrainingNode stage for an extended period.Machine deletion in progress since more than 15m, stage: DrainingNode
Cluster API (CAPI) logs report: Drain not completed yet... Pods not terminating
Eviction triggers indicate: PodsToTriggerEvictionNow: vmware-system-tmc/cluster-auth-pinniped-kube-cert-agent-*
cluster-auth-pinniped-kube-cert-agent pod within the vmware-system-tmc namespace.VMware vSphere Kubernetes Service
Tanzu Mission Control (TMC) agent pod finalizers prevent pod termination during the automated node drain sequence. When the machine controller executes a node drain, the TMC pod finalizer blocks eviction, causing the pod eviction timeout to be reached. Consequently, the node drain stalls, which triggers a machine deletion timeout and permanently blocks the TKC rollout.
To resolve this issue, scale down the blocking deployment to allow the machine drain to complete.
kubectl scale deployment cluster-auth-pinniped-kube-cert-agent --replicas=0 -n vmware-system-tmckubectl get machine <machine-name> -n <namespace> -wVerify that the machine status successfully transitions to Deleted.
Monitor the TKC object to verify the rollout automatically completes once the node is removed:kubectl get tanzukubernetescluster <cluster-name> -w