Symptoms:
2021-05-21T04:45:32 DEBUG - startCentralPostgres:198 - Start the central postgres service
2021-05-21T04:45:32 INFO - runScript:152 - Running command: /sbin/service vpostgres-repl start
2021-05-21T04:45:33 INFO - runScript:159 - stdout:
2021-05-21T04:45:33 INFO - runScript:160 - stderr: Job for vpostgres-repl.service failed because the control process exited with error code.
See "systemctl status vpostgres-repl.service" and "journalctl -xe" for details.
2021-05-21T04:45:33 INFO - runScript:161 - exit code: 1
2021-05-21T04:45:33 ERROR - runScript:165 - Script command: "/sbin/service vpostgres-repl start" failed with exit code: 1
VMware Aria Operations 8.x
/etc/init.d/vpostgres-repl
in vROps 8.3 rely on the postgres user having read access to the .pgpass
file to perform the psql
("Test connection to $MASTE_IP") and pg_basebackup
("Base backup from $MASTER_IP") steps in the run_as_replica()
function. Starting with vROps 8.4, the postgres replication service connects using certificates instead of relying on .pgpass
for user/password connection settings.vpostgres-repl
during upgrades trap the same errors and cause the upgrade to fail..pgpass
ownership, HA failovers may complete partially, leaving both the primary and replica nodes with the file /storage/db/vcops/recovery.conf.bootstrap
. The existence of this file on the replica node triggers the vpostgres-repl
service to reset the current node as the receiving side of replication. It should never exist on the primary node.This is a known issue in Aria Operations, and there is currently no resolution available. If you believe that you have encountered this issue, please raise a case with Broadcom support.