The purpose of this article is to resolve HA initialization error.
Symptoms:
- vCenter reports "unknown error initializing HA" for the host, but the Fault Default Manager log shows that the host successfully becomes an HA primary or secondary node.
- After exiting maintenance mode vCenter reports "unknown error initializing HA"
- Enabling HA times out in vCenter, but is successful on host(s).Re-configuring vsphere HA task fails on "Timed out Exception" in vCenter, but Fault Default Manager log shows that the host successfully becomes an HA primary or secondary node.
- Newly powered on virtual machine shows as unprotected by HA in vCenter,but the Fault Default Manager log shows that the virtual machine is protected.
- vCenter and ESXi hosts are not in the same subnet, there is a gateway/ firewall between vCenter and ESXi hosts.
- Disabling and re-enabling HA on the cluster will resolve the issue temporarily, but shows error "HA Initialization Error - Waiting to complete cluster election"
- Reconfiguring HA at the host level fails with error "Operation timed-out - state Uninitialized - Initialization Error - vShere HA agent cannot be correctly installed or configured"
- In vCenter vpxd.log, you will see similar entries as below
YYYY-MM-DDTTTT:MM:SS.[06212 info 'commonvpxLro' opID=########-########-##-#] [VpxLRO] -- BEGIN task-internal-27741 -- -- DasConfig.ConfigureHost --
YYYY-MM-DDTTT:MM:SS [06212 info 'vpxdvpxdMoHost' opID=########-########-##-#] [HostMo::UpdateDasState] VC state for host host-xxx (HA disabled -> uninitialized), FDM state (UNKNOWN_FDM_HSTATE -> UNKNOWN_FDM_HSTATE), src of state (null -> null)
YYYY-MM-DDTTT:MM:SS[06212 error 'DAS' opID=########-########-##-#] [VpxdDasConfigLRO::Config] Timed out waiting for election to complete or for host to join existing master
YYYY-MM-DDTTT:MM:SS[06212 error 'DAS' opID=########-########-##-#] [VpxdDasConfigLRO::Config] EnableDAS failed on host [vim.HostSystem:host-<hostname>]: class Vim:Fault:Timedout::Exception(vim.fault.Timedout)
- in ESXi host, fdm.log we see similar entries as below
fdm.log: [FFC8AB70 info 'Election' opID=SWI-6058ed8] Slave timed out
fdm.log: [FFB7AB70 verbose 'Cluster' opID=SWI-56f32f43] Marking slave host-xxx as unreachable
Note:The preceding log excerpts are only examples.Date,time and environmental variables may vary depending on your environment