Symptoms:
LCMVRAVACONFIG90029 Check health status failed."
http://<Aria Automation FQDN>:8008/api/v1/services/cluster
" is failing intermitently.time vracli service status --json
" takes more than 10 seconds intermittently.VMware Aria Automation 8.x
The "vracli service status --json"
command is taking too long to return the output (more than 10 seconds), which in turn reaches the timeout in the script "/opt/scripts/check_services_health.sh
"
Edit script "/opt/scripts/check_services_health.sh
" and increase the timeout value for the cluster and local services to a higher value (default is 10 seconds):
Example:
# Update overall status for services in the cluster. Cut at 10 seconds
# to avoid error result from the infra health check.
tmpCluster=$(mktemp "${SVC_MON_DIR}/tmp-cluster-XXXXXXX.json")
timeout 15 vracli service status --json > "$tmpCluster" <<<<-------
Change to 15 seconds or moreresult=$?
[...]
# Update status for local service instances. Cut at 10 seconds
# to avoid error result from the infra health check.
tmpLocal=$(mktemp "${SVC_MON_DIR}/tmp-local-XXXXXXX.json")
timeout 15 vracli service status --json --local > "$tmpLocal" <<<<-------
Change to 15 seconds or moreresult=$?