When deploying a Tanzu Kubernetes Cluster (TKC) in a vSphere Kubernetes(VKS) environment, the virtual machines remain stuck in the "Creating" state.
The following error is observed upon describing the machine, or you can find this error in the kubelet logs:
# One of the control plane node
journalctl -xeu kubeletContainer runtime network not readynetworkReady="Network"
This issue may stem from multiple factors, including Supervisor Cluster readiness and network configuration constraints.
VMware vSphere Kubernetes Service
The cause in this scenario is a limitation in the Avi Load Balancer configuration:
The Avi Service Engine (SE) group was set with a maximum limit of 10 SEs, which restricts scaling and prevents successful TKC cluster provisioning
Additionally, the Supervisor Cluster or TKGS service may not be in a healthy state, contributing to deployment failure
Ensure the Supervisor Cluster is in a Ready state. If not, escalate to Broadcom Support.
Confirm that the TKGS service on vCenter is running.
Update Avi Configuration
Increase the SE group limit from 10 to 100, or adjust based on customer requirements.
(AVI UI --> Infrastructure --> Cloud Resources --> Service Engine Group --> Edit icon --> Placement --> Virtual Services per Service Engine)
Redeploy the TKC Cluster
Retry the cluster deployment after updating the Avi SE group configuration.