VMware Cloud Director 10.6.1
This issue can occur when the Load Balancer health check does not accurately detect that a VMware Cloud Director cell is in Maintenance Mode. Endpoints such as /api/versions may still return HTTP 200 responses, even though the Cloud Director service is no longer serving traffic. As a result, the Load Balancer continues routing requests to a cell that is not operational, triggering “Failed to Start” errors in the UI.
option httpchk GET /cloudhttp-check expect status 200
The /cloud endpoint is a lightweight VCD-native check.
It returns 200 OK when the Cloud Director service is running and accepting UI/API requests.
It returns 503 Service Unavailable when the cell is in Maintenance Mode or the service is stopped.
This allows the Load Balancer to detect that the node is unavailable and remove it from the active pool automatically.
systemctl reload haproxy
curl -k https://<public-fqdn>/cloud
Expected behavior:
If traffic is being routed to a healthy node:HTTP/1.1 200 OK
If traffic is still being routed to the node in Maintenance Mode:HTTP/1.1 503 Service Unavailable
Confirm UI Availability
Open the VCD portal via the public FQDN.
Ensure the login UI loads without the “Failed to Start” error.
This issue is reproducible only when the Load Balancer continues to route to a cell whose vCD service has been intentionally paused (e.g., via Maintenance Mode).