1/3 CP node got renewal, the other 2 CP still didn't have the updated certs.
The KCP status showed EtcdClusterUnhealthy is false "'Following machine is reporting etcd member errors: cl-cp-1-faulty'", but the machine/node cl-cp-1-faulty doesn't exist on vCenter or machine list.
TKGm 2.1.1
Management cluster version is 1.24.10
TCA 3.2
The KCP status.replicas=4 but spec.replicas=3.
It is because that during rolling update when KCP tried to rotate the certificate on each control plane, the first control plane machine was successfully recreated with the newly created leaf certificate. But it failed when created the second machine due to etcd error. KCP seemed to scale down when this error happened, the machine was deleted, but KCP considered the scaling down a failure. This blocks its procedure on other rolling updates.
k get machines -A
ETCDCTL_API=3 etcdctl member list --cacert=/etc/kubernetes/pki/etcd/ca.crt --cert=/etc/kubernetes/pki/etcd/server.crt --key=/etc/kubernetes/pki/etcd/server.key
kubectl rollout restart deployment capi-kubeadm-control-plane-controller-manager -n capi-kubeadm-control-plane-system