Aria Automation nodes are being marked as unhealthy by the Load Balancer due to the /opt/health/run.sh script returning an HTTP 500 status. This leads to service disruption as the Load Balancer stops routing traffic to the affected nodes.
Aria Automation 8.x
The health check script /opt/health/run.sh includes a memory-usage check. When memory consumption exceeds 90%, this check fails with the following error:
Running check memory-usage make: *** [/opt/health/Makefile:73: memory-usage] Error 1
As a result, the script returns a non-200 status code (HTTP 500), causing the Load Balancer to treat the node as unhealthy and stop routing traffic to it.
Checking the OS level Memory consumption is also above %90
ps aux --sort=-%mem
Monitor memory usage on Aria Automation nodes and ensure it stays below the 90% threshold.
If persistent high memory usage is expected, consider increasing the memory allocated to nodes as per sizing guidelines.
Please refer to the sizing guidelines Sizing-Guidelines