VMware Aria Automation Health API fails with 500 error.
search cancel

VMware Aria Automation Health API fails with 500 error.

book

Article ID: 397839

calendar_today

Updated On:

Products

VMware Aria Suite

Issue/Introduction

Symptoms:

  • The health status in VMware Aria Sulit Lifecycle fails with the error - "LCMVRAVACONFIG90029 Check health status failed."
  • The Aria Automation health url "http://<Aria Automation FQDN>:8008/api/v1/services/cluster" is failing intermitently.
  • The command "time vracli service status --json" takes more than 10 seconds intermittently.

Environment

VMware Aria Automation 8.x

Cause

The "vracli service status --json" command is taking too long to return the output (more than 10 seconds), which in turn reaches the timeout in the script "/opt/scripts/check_services_health.sh"

Resolution

Edit script "/opt/scripts/check_services_health.sh" and increase the timeout value for the cluster and local services to a higher value (default is 10 seconds):

Example:

 

# Update overall status for services in the cluster. Cut at 10 seconds
# to avoid error result from the infra health check.
tmpCluster=$(mktemp "${SVC_MON_DIR}/tmp-cluster-XXXXXXX.json")
timeout 15 vracli service status --json > "$tmpCluster"          <<<<------- Change to 15 seconds or more
result=$?

[...]

# Update status for local service instances. Cut at 10 seconds
# to avoid error result from the infra health check.
tmpLocal=$(mktemp "${SVC_MON_DIR}/tmp-local-XXXXXXX.json")
timeout 15 vracli service status --json --local > "$tmpLocal"    <<<<------- Change to 15 seconds or more
result=$?