On the vSphere web UI, one or more of the following symptoms are present:
no connectivity to api master at localhost:1080/external-cert/http1/<Supervisor FIP>/6443/healthz?timeout context deadline exceeeded
Unable to resolve the vCenter Primary Network Identifier <vCenter FQDN> with the configured management DNS servers on control plane VM <Supervisor control plane VM DNS name>. Validate that the management DNS servers '<DNS Servers>' can resolve <vCenter FQDN>
While SSH to the vCenter Server Appliance (VCSA), the following symptoms are present:
cat /var/log/vmware/wcp/wcpsvc.log
{"type": "ManagementNetworkConfigured", "status": "FALSE", "reason": "ManagementDNSServerHostNotFound", "messages": [{"Severity": "ERROR", "Details": {"Id": "vcenter.wcp.node_state_check.mgmt_network.vcenter_pnid_host_not_found", "DefaultMessage": "Unable to resolve the vCenter Primary Network Identifier <vCenter FQDN> with the configured management DNS servers on control plane VM <Supervisor control plane VM DNS name>. Validate that the management DNS servers '<DNS Servers>' can resolve <vCenter FQDN>.", "Args": ["<vCenter FQDN>", "<Superivsor control plane VM DNS name>", "<DNS servers>"]}}], "severity": "ERROR"
While connected to the Supervisor control plane VM from the error message, the following symptoms are observed:
kubectl get pods -A | egrep -v "Run|Complete"
nslookup to the vCenter FQDN fails:
nslookup <vCenter FQDN>
vSphere Supervisor
One or more DNS servers configured with Workload Management on the Management Network cannot reach the vCenter FQDN.
Correct the DNS server connection issue to the vCenter FQDN.
nslookup <vCenter FQDN> <DNS server>
kubectl rollout restart deployment -n kube-system coredns
nslookup <vCenter FQDN>