If you SSH to one of the VCF Automation 9 appliances VCF Automation 9 and run:
kubectl get pods -n prelude -owide | grep -v Running
You may observe "prelude" Kubernetes pods in "CrashLoopBackOff", "ImagePullBackOff", "ErrImagePull" or "Init:Error" state like so:
No Healthy UpstreamHTTP ERROR 404 JSP file [/error.jsp] not foundGenericJDBCException: could not execute query/var/log/services-logs:
remaining connection slots are reserved for non-replication superuser connectionsVCF Automation 9.0.x
Snapshot-based backup systems are not supported for use with VCF Automation appliances, resulting in the above-mentioned errors.
These lead to VCF Automation services timing out and continuously being retried, leading to a spike in database connections that hit the "max_connections" limit.
Although the snapshots will likely fail, the attempt to take the snapshots can cause this issue.
For more information, see: Snapshot Management options removed from VCF Automation (VCF-A) and VCF Identity Broker in Fleet Management
Disable any snapshot backup solutions or schedules in relation to VCF Automation 9 appliances.
To resolve the crashed pods, reboot the VCFA nodes using VCF Ops Fleet Management.
If the reboot task fails here, contact Broadcom Support for help with manually rebooting the cluster nodes.