During a VMware Cloud Foundation (VCF) Operations (Aria Operations) deployment, the cluster may become stuck in a degraded or non-functional state when attempting to add a new data node. The issue occurs due to a network disruption between ESXi hosts during node expansion, which results in an incomplete failover between primary and replica nodes and breaks database replication.
You may observe:
pg_basebackup: error: connection to server failed:FATAL: no pg_hba.conf entry for replication connectionThis can occur after:
VMware Cloud Foundation (VCF) 9.x
VCF Operations / Aria Operations 9.x
The cluster enters an inconsistent state due to a partially completed failover between the primary and primary replica nodes, combined with broken PostgreSQL replication configuration.
$VMWARE_PYTHON_BIN /usr/lib/vmware-vcopssuite/utilities/sliceConfiguration/bin/vcopsClusterManager.py offline-cluster "SUPPORT"