This article provides information on resolving the vCLS health issues, so that DRS functions correctly in the cluster.
vSphere 7.0 Update 1, vSphere DRS for a cluster depends on the health of vSphere Cluster Services (vCLS). vCLS on a cluster configures a quorum on vCLS system VMs on the cluster. These VMs are necessary to maintain the health of the cluster services. If vCLS health gets impacted due to unavailability of these VMs in a cluster, then vSphere DRS will not be functional in the cluster until the time vCLS VMs are brought back up.
Below are the listed operations that could fail if performed when DRS is not functional. Also, another point to note that below operations on a new DRS enabled cluster will not be available until the first vCLS VM is deployed and powered-on in that cluster.
For more information, see vSphere Cluster Services (vCLS) in vSphere 7.0 Update 1 .
VMware vCenter Server 7.0.x
VMware vCenter Server 8.0.x
vCLS VMs will automatically be powered on or recreated by vCLS service. These VMs are deployed prior to any workload VMs that are deployed in a green field/fresh deployment. In an upgrade scenario, these VMs are deployed before vSphere DRS is configured to run on the clusters. When all the vCLS VMs are powered-off or deleted, the vSphere Cluster status for that cluster will turn to 'Degraded (Yellow)'. vSphere DRS needs one of the vCLS VMs to be running in a vSphere cluster to be healthy. If DRS runs prior to these VMs are brought back up, then the cluster service will be 'Unhealthy (Red)', until the time vCLS VMs are brought back up.
Scenarios with a resolution where vCLS VMs deployment could fail:
lack of suitable datastore"Scenarios with a resolution where vCLS VMs power on may fail
ESXi is stuck in maintenance mode at 32% without any Progress due to Stale/Orphaned vCLS VM