When deploying a VMware Cloud Foundation (VCF) 9.0.2 Automation cluster, the deployment fails at Step 6. The Fleet Manager bootstrap logs will report a timeout:
error: failed to apply the PackageDeployment: context deadline exceeded
Checking the pod status on the new Automation node with command "kubectl get pods -n vmsp-platform" reveals that the cluster-api-installer pod has failed with state: ImagePullBackOff.
VMware Cloud Foundation (VCF) Automation 9.0.2.
This issue is caused by a network communication failure. Ports 30000-30006 are restricted between the Automation cluster and the Fleet Management machine, preventing the installer from pulling the required deployment packages.
This can be verified by running the below command from the Automation node:
This will likely be stuck at "Trying..." indicating that access is not open between Automation and Fleet Management on port 30000.
Running the exact same command on the Fleet Management machine directly should return "Connected to <FLEET-MANAGEMENT-FQDN-or-IP> port 30000" showing that port 30000 is listening on that machine.
To resolve the deployment failure, update your network configuration to permit traffic on the following ports:
VMware Ports and Protocols
https://ports.broadcom.com/home/VMware-Cloud-Foundation-Automation
(Select Product: "VMware Cloud Foundation Automation", NOT "VMware Aria Automation")