vsanmgmtd service on the affected ESXi host.vsanmgmtd reveals counts significantly higher than 100 (e.g., 180+).vpxa.log contains TimeoutException errors for local port 8089 (the vsanmgmtd loopback port): HTTP Connection has timed out while waiting for further requests; <TCP '[IP_ADDRESS] : 8089'>, N7Vmacore16TimeoutExceptionEVMware vSAN 8.x
This issue is caused by a massive influx of API requests—such as queryBatchPerformanceStatistics and queryAvailableMetric—hitting the ESXi host in a short period (dozens of requests per second). These requests are typically generated by external monitoring software, most commonly VMware Aria Operations, using an aggressive collection interval.
Because each request consumes a file descriptor, the high volume causes the vsanmgmtd daemon to hit its hard limit for open files. Once this limit is reached, the daemon can no longer accept local socket connections from vpxa (the vCenter agent), leading to service stalls and health check failures.
To resolve this issue, identify the source of the aggressive API calls and adjust the monitoring collection frequency.
Identify the Source Machine:
vpxa.log: grep -Ei 'queryBatchPerformanceStatistics|queryAvailableMetric' vpxa.log.vpxd-profiler.log on vCenter to identify the monitoring server. Adjust Aria Operations Collection Interval:
Monitor File Handlers:
/bin/lsof | grep -v "MMAP" | grep vsanmgmtd | wc -l