To identify the issue displayed by the vSAN health test result indicating a to memory leak.
Symptoms:
Unable to place a vSAN ESXi host into maintenance mode
Disk group in vSAN shows ‘disks in use’ as 0 in => vCenter > Cluster > Configure > vSAN> Disk management
The Health plugin => vCenter > Cluster > Monitor > vSAN > Health will show different issues every time the health test is run, with a few examples below:
- Network - Hosts with connectivity issues
- Network - Network latency check
- Host configured with different Environment variables
You will see the following error traces in the logs :
# less vsanmgmt.log |grep -i MEMORY
2019-05-21T03:48:53Z VSANMGMTSVC: ERROR vsanperfsvc[Thread-2] [VsanHealthDaemon::doVsanHealthCheck] Get exception in invoking QueryCheckLimits : [Errno 12] Cannot allocate memory
2019-05-21T03:53:53Z VSANMGMTSVC: ERROR vsanperfsvc[Thread-2] [VsanHealthDaemon::doVsanHealthCheck] Get exception in invoking QueryCheckLimits : [Errno 12] Cannot allocate memory
2019-05-21T03:58:53Z VSANMGMTSVC: ERROR vsanperfsvc[Thread-2] [VsanHealthDaemon::doVsanHealthCheck] Get exception in invoking QueryCheckLimits : [Errno 12] Cannot allocate memory
2019-05-21T04:03:53Z VSANMGMTSVC: ERROR vsanperfsvc[Thread-2] [VsanHealthDaemon::doVsanHealthCheck] Get exception in invoking QueryCheckLimits : [Errno 12] Cannot allocate memory
2019-05-21T04:08:53Z VSANMGMTSVC: ERROR vsanperfsvc[Thread-2] [VsanHealthDaemon::doVsanHealthCheck] Get exception in invoking QueryCheckLimits : [Errno 12] Cannot allocate memory
2019-05-21T04:09:11Z VSANMGMTSVC: INFO vsanperfsvc[MainThread] [statsdaemon::_logDaemonMemoryStats] Daemon memory stats: eMin=97.280MB, eMinPeak=103.396MB, rMinPeak=103.396MB
#grep -r "memory stats" var/run/log/vsanmgmt* | grep PRESSURE
No result
Restarting the vSAN mgmt service (vsanmgmt) will fix the issue for a few minutes (10 to 180 minutes) and then issue reappears on the vSAN Health Test page.
On all the hosts in the cluster run:
/etc/init.d/vsanmgmtd restart
You may also verify if the issue is resolved by starting the below services if the "vsanmgmtd" restart does not resolve the issue:
/etc/init.d/vsanvpd restart
/etc/init.d/vpxa restart
/etc/init.d/hostd restart
From vCenter Appliance SSH:
service-control --stop --all
service-control --start –all