New workers/machines fail to be provisioned after a scale operation on existing cluster. The machines enter a provisioning loop and never reached Running state. As a result, the cluster is under-resourced and pods can not be scheduled.
Reviewing the machine objects show that affected workers are stuck in Provisioned state with no nodename field. Other machines are stuck deleting, and only a subset of worker nodes remain in Running state.
Logs from kubelet/journal on the worker VMs show repeated registration failures:
Unable to register node with API server: Node "<nodename>.<domain>" is invalid: metadata.labels: Invalid value: "<nodename>.<domain>": must be no more than 63 characters
Kubernetes enforces a 63-character limit for node names used as metadata labels. The combination of the cluster name, MachineDeployment suffix, and the appended FQDN from DHCP creates node names longer than 63 characters. This causes kubelet to fail registration with the API server, leaving Machines stuck in Provisioning.
If you experience this issue, collect the following information and contact Broadcom Support for assistance:
tanzu cluster listkubectl get machines -Akubectl get md -Akubectl get nodesBroadcom Support will review the environment configuration and provide guidance specific to your deployment.