Following remediation of VMware Identity Manager (vIDM) critical health via reboot, VMware Aria Automation (vRA) fails to start.
Users encounter the following symptoms or errors:
VMware Aria Automation 8.18.x
VMware Identity Manager 3.3.7
VMware Aria Suite Lifecycle 8.18.x
A degraded vIDM cluster prevented the identity-service-app from reaching a ready state:
Diagnose and Stabilize vIDM Cluster:
Access: https://vIDM_FQDN:8443
Review System Diagnostics and verify opensearch health reports as unknown
Identify the Primary vIDM node.
Fix quorum and daemon failures:
Check the status of key services:
/etc/init.d/vpostgres status
/etc/init.d/horizon-workspace status
/etc/init.d/opensearch status
Check the expiration of the vIDM "postgres" user account:
chage -l postgres
chage -M number_of_days_here postgres
e.g.:
chage -M 9999 postgres
Stop and start all services in the following order (Starting with the Primary viDM node, then the Secondary nodes):
/etc/init.d/opensearch stop
/etc/init.d/horizon-workspace stop
/etc/init.d/vpostgres stop
/etc/init.d/vpostgres start
/etc/init.d/horizon-workspace start
/etc/init.d/opensearch start
Monitor opensearch health:
Expect unassigned shards to resolve over time (may take up to 24 hours)
Once green, proceed to Aria Automation service restart
Restart Aria Automation services:
SSH to an Aria Automation appliance
Run:
Verify:
Confirm that identity-service-app pods now show 1/1 Running
UI and inventory sync should resume normal operation.
Opensearch requires quorum to function when clustered. If a lead node is inaccessible, dependent services will fail. Unassigned shards will normally self-recover post-restart, depending on shard volume and node availability. Always verify vIDM stability before restarting Aria Automation services.