Aria Automation node marked unhealthy due to memory usage health check failure [/opt/health/Makefile:73: memory-usage] Error 1
search cancel

Aria Automation node marked unhealthy due to memory usage health check failure [/opt/health/Makefile:73: memory-usage] Error 1

book

Article ID: 394277

calendar_today

Updated On:

Products

VCF Operations/Automation (formerly VMware Aria Suite)

Issue/Introduction

Aria Automation nodes are being marked as unhealthy by the Load Balancer due to the /opt/health/run.sh script returning an HTTP 500 status. This leads to service disruption as the Load Balancer stops routing traffic to the affected nodes.

Environment

Aria Automation 8.x

Cause

The health check script /opt/health/run.sh includes a memory-usage check. When memory consumption exceeds 90%,  this check fails with the following error:

Running check memory-usage  
make: *** [/opt/health/Makefile:73: memory-usage] Error 1

As a result, the script returns a non-200 status code (HTTP 500), causing the Load Balancer to treat the node as unhealthy and stop routing traffic to it.

 

Checking the OS level Memory consumption is also above %90

ps aux --sort=-%mem

Resolution

Monitor memory usage on Aria Automation nodes and ensure it stays below the 90% threshold.

If persistent high memory usage is expected, consider increasing the memory allocated to nodes as per sizing guidelines. 

Please refer to the sizing guidelines Sizing-Guidelines