vSAN (All Versions)
This is due to a rare issue where DOM Component Manager runs out of usable memory over time.
This has been fixed in 8.0U3b
Workaround:
1) To identify the host with the issue open an SSH session to all hosts in the cluster and run the following command:
# vsish -e get /vmkModules/vsanutil/slabs/dom-CompServer-objSlab/stats | grep "Current allocations" | cut -f2 -d":"
If "Current allocations" exceeds 8000, the host hit the issue.
2) Get the impacted host UUID by running the following command:
cmmds-tool whoami
3) Exclude the impacted host from CLOM placement on all hosts by running the below command:
# /usr/lib/vmware/vsan/bin/clom-tool set-global-exclusion-list --exclusion-list=<host UUID>
4) Place the impacted host into maintenance mode with Ensure Accessibility mode and reboot the host.
5) Run the command from Step 1 again on the rebooted host to confirm "Current allocations" is now below 8000
6) Restart CLOMD service on all hosts that wasn't rebooted to revert CLOM exclusion mode back to default by running
/etc/init.d/clomd restart