Q: What does the “Cluster health – vSAN daemon liveness check (Former: vSAN CLOMD liveness check)” check do?It checks if CLOMD, EPD, CMMDSD, For CLOMD, it does so by first checking that the service is running on all ESXi hosts, and then contacting the service to retrieve run-time statistics to verify that CLOMD can respond to inquiries. For EPD, CMMDSD, OSFSD, and CMMDSTIMEMACHINED, it checks whether the service is running properly on all ESXi hosts.
Note: This does not ensure that all of the functionalities discussed above (For example: Object creation, rebalancing) actually work, but it gives a first level assessment as to the health of CLOMD, EPD, CMMDSD, OSFSD, and CMMDSTIMEMACHINED services.
Q: What does it mean when it is in an error state?vSAN daemons may still have issues, but this test does a very basic check to make sure that they are still running. If this reports an error, the state of the CLOMD, EPD, CMMDSD, OSFSD, and CMMDSTIMEMACHINED service(s) is not working as expected and needs to be checked on the relevant ESXi host.
A good way to further probe into CLOMD health is to perform a virtual machine creation test (Proactive tests), as this involves object creation that will exercise and test CLOMD thoroughly.
For more information about this issue, refer to the following article:
CLOM Daemon Liveness CheckQ: How does one troubleshoot and fix the error state?For standard clusters, all services CLOMD, EPD, CMMDSD, OSFSD, and CMMDSTIMEMACHINED should be running on all nodes in the cluster.
For stretched clusters and metadata clusters, see the below table whether this service is expected to be running or not for the respected node:
|
Data node of stretched cluster |
Witness node of stretched cluster |
Data node of metadata cluster |
Metadata node of metadata cluster |
CLOMD |
Yes |
No |
Yes |
Yes |
EPD |
Yes |
No |
Yes |
No |
CMMDSD |
Yes |
Yes |
Yes |
Yes |
OSFSD |
Yes |
No |
Yes |
No |
cmmdsTimemachined |
Yes |
No |
Yes |
No |
The unchecked daemon status of the ESXi host is shown as “--".
If CLOMD, EPD, CMMDSD, OSFSD, and CMMDSTIMEMACHINED service(s) is not running on a particular ESXi host, then the CLOMD, EPD, CMMDSD, OSFSD, and CMMDSTIMEMACHINED service(s) status of that host is Abnormal.
For this test to succeed, the health service needs to be installed on the ESXi host and the CLOMD, EPD, CMMDSD, OSFSD, and CMMDSTIMEMACHINED services need to be running. To get the status of CLOMD, EPD, CMMDSD, OSFSD, and CMMDSTIMEMACHINED service on the ESXi host, run this command:
/etc/init.d/cmmdsd status && /etc/init.d/epd status && /etc/init.d/clomd status && /etc/init.d/cmmdsTimeMachine status && /etc/init.d/osfsd statusIf the daemon is not running, try to run the restart command on the ESXi host:
/etc/init.d/cmmdsd restart && /etc/init.d/epd restart && /etc/init.d/clomd restart && /etc/init.d/cmmdsTimeMachine restart && /etc/init.d/osfsd restartIf the vSAN daemon liveness check is still failing after these steps or if the vSAN daemon liveness check continues to fail on a regular basis, open a support request with VMware Support. For more information, see
How to file a Support Request in Customer Connect (2006985) .