Symptoms:
LCMVRAVACONFIG90029 Check health status failed."http://<Aria Automation FQDN>:8008/api/v1/services/cluster" is failing intermitently.time vracli service status --json" takes more than 10 seconds intermittently.VMware Aria Automation 8.x
The "vracli service status --json" command is taking too long to return the output (more than 10 seconds), which in turn reaches the timeout in the script "/opt/scripts/check_services_health.sh"
Edit script "/opt/scripts/check_services_health.sh" and increase the timeout value for the cluster and local services to a higher value (default is 10 seconds):
Example:
# Update overall status for services in the cluster. Cut at 10 seconds# to avoid error result from the infra health check.tmpCluster=$(mktemp "${SVC_MON_DIR}/tmp-cluster-XXXXXXX.json")timeout 15 vracli service status --json > "$tmpCluster" <<<<------- Change to 15 seconds or moreresult=$?
[...]
# Update status for local service instances. Cut at 10 seconds# to avoid error result from the infra health check.tmpLocal=$(mktemp "${SVC_MON_DIR}/tmp-local-XXXXXXX.json")timeout 15 vracli service status --json --local > "$tmpLocal" <<<<------- Change to 15 seconds or moreresult=$?