Possible symptoms:
vcfa-health-verifier pods in the prelude namespace which have status: Error
https://$VCFA_FQDN:3006/status for health check shows vcfa-services-prelude-health-reporter-http has "ok": false curl -k https://$(kubectl get nodes -ojson | jq -r '.items[].status.addresses[] | select(.type == "ExternalIP").address' | shuf -n 1):30006/status | jq -rError Code: LCMVMSP10036Applying product patch failed.-- PACKAGE DEPLOYMENT [prelude/vcfa-bundle] STATUS: Progressing -- package deployment is in progress -- BUNDLE [prelude/vcfa-bundle] STATUS: Pushed -- -- HR [prelude/vksm-stack] READY: False (DependencyNotReady) -- dependency 'prelude/vmsp-prelude-deployer' is not ready -- HR [prelude/vmsp-prelude-deployer] READY: Unknown (Progressing) -- Running 'install' action with timeout of 1h0m0s
VMware Cloud Foundation Automation 9.x
There are pods in failed status such as Unknown, Evicted or Error
Pods which are in failed status other than Error will not be deleted/restarted automatically by the system.
kubectl get pods -A | grep -v Running | grep -v Completedkubectl delete pods <pod1> <pod2> -n <namespace>vcfa-health-verifier pods do not start completing successfully, there may be pods which are stuck in Error state.