1. After a recent ESXi host update the HA configuration is stuck for long time
2. HA configuration fails with "Configuration error
"
3. In the vCenter vpxd log
located at /var/log/vmware/vpxd/vpxd.log
it shows below similar entries
info vpxd[40638] [Originator@6876 sub=MoHost opID=lro-623-62130020] VC state for host host-848440 (uninitialized -> retry), FDM state (UNKNOWN_FDM_HSTATE -> UNKNOWN_FDM_HSTATE), src of state (null -> null)
info vpxd[40231] [Originator@6876 sub=MoHost opID=lro-616-7e03148e] VC state for host host-826061 (uninitialized -> retry), FDM state (UNKNOWN_FDM_HSTATE -> UNKNOWN_FDM_HSTATE), src of state (null -> null)
info vpxd[39820] [Originator@6876 sub=MoHost opID=lro-617-7c3ab743] VC state for host host-122893 (uninitialized -> retry), FDM state (UNKNOWN_FDM_HSTATE -> UNKNOWN_FDM_HSTATE), src of state (null -> null)
info vpxd[40628] [Originator@6876 sub=MoHost opID=lro-626-1d1d2032] VC state for host host-1036579 (uninitialized -> retry), FDM state (UNKNOWN_FDM_HSTATE -> UNKNOWN_FDM_HSTATE), src of state (null -> null)
info vpxd[40579] [Originator@6876 sub=MoHost opID=lro-621-15ffbc01] VC state for host host-397994 (uninitialized -> retry), FDM state (UNKNOWN_FDM_HSTATE -> UNKNOWN_FDM_HSTATE), src of state (null -> null)
info vpxd[39915] [Originator@6876 sub=MoHost opID=lro-634-98eaf24] VC state for host host-145375 (uninitialized -> retry), FDM state (UNKNOWN_FDM_HSTATE -> UNKNOWN_FDM_HSTATE), src of state (null -> null)
info vpxd[40759] [Originator@6876 sub=MoHost opID=FdmMonitor-domain-c981493-12f30c61] VC state for host host-1010562 (uninitialized -> uninitialized), FDM state (Master -> UNKNOWN_FDM_HSTATE), src of state (null -> null)
info vpxd[40638] [Originator@6876 sub=MoHost opID=lro-790-ce81c45] VC state for host host-1081598 (retry -> uninitialized), FDM state (UNKNOWN_FDM_HSTATE -> UNKNOWN_FDM_HSTATE), src of state (null -> null)
info vpxd[40638] [Originator@6876 sub=MoHost opID=lro-790-ce81c45] VC state for host host-1081592 (retry -> uninitialized), FDM state (UNKNOWN_FDM_HSTATE -> UNKNOWN_FDM_HSTATE), src of state (null -> null)
info vpxd[40638] [Originator@6876 sub=MoHost opID=lro-790-ce81c45] VC state for host host-1081595 (retry -> uninitialized), FDM state (UNKNOWN_FDM_HSTATE -> UNKNOWN_FDM_HSTATE), src of state (null -> null)
info vpxd[40440] [Originator@6876 sub=MoHost opID=lro-838-5391b34f] VC state for host host-834347 (retry -> uninitialized), FDM state (UNKNOWN_FDM_HSTATE -> UNKNOWN_FDM_HSTATE), src of state (null -> null)
info vpxd[40440] [Originator@6876 sub=MoHost opID=lro-838-5391b34f] VC state for host host-1068172 (retry -> uninitialized), FDM state (UNKNOWN_FDM_HSTATE -> UNKNOWN_FDM_HSTATE), src of state (null -> null)
info vpxd[40440] [Originator@6876 sub=MoHost opID=lro-838-5391b34f] VC state for host host-834350 (retry -> uninitialized), FDM state (UNKNOWN_FDM_HSTATE -> UNKNOWN_FDM_HSTATE), src of state (null -> null)
info vpxd[40440] [Originator@6876 sub=MoHost opID=lro-838-5391b34f] VC state for host host-834344 (retry -> uninitialized), FDM state (UNKNOWN_FDM_HSTATE -> UNKNOWN_FDM_HSTATE), src of state (null -> null)
info vpxd[39814] [Originator@6876 sub=MoHost opID=lro-843-49134a39] VC state for host host-380126 (retry -> uninitialized), FDM state (UNKNOWN_FDM_HSTATE -> UNKNOWN_FDM_HSTATE), src of state (null -> null)
info vpxd[39814] [Originator@6876 sub=MoHost opID=lro-843-49134a39] VC state for host host-380123 (retry -> uninitialized), FDM state (UNKNOWN_FDM_HSTATE -> UNKNOWN_FDM_HSTATE), src of state (null -> null)
info vpxd[40440] [Originator@6876 sub=MoHost opID=lro-838-5391b34f] VC state for host host-1078304 (uninitialized -> retry), FDM state (UNKNOWN_FDM_HSTATE -> UNKNOWN_FDM_HSTATE), src of state (null -> null)
info vpxd[40440] [Originator@6876 sub=MoHost opID=lro-838-5391b34f] VC state for host host-805681 (uninitialized -> retry), FDM state (UNKNOWN_FDM_HSTATE -> UNKNOWN_FDM_HSTATE), src of state (null -> null)
info vpxd[40440] [Originator@6876 sub=MoHost opID=lro-838-5391b34f] VC state for host host-834347 (uninitialized -> retry), FDM state (UNKNOWN_FDM_HSTATE -> UNKNOWN_FDM_HSTATE), src of state (null -> null)
info vpxd[40440] [Originator@6876 sub=MoHost opID=lro-838-5391b34f] VC state for host host-1068172 (uninitialized -> retry), FDM state (UNKNOWN_FDM_HSTATE -> UNKNOWN_FDM_HSTATE), src of state (null -> null)
info vpxd[40440] [Originator@6876 sub=MoHost opID=lro-838-5391b34f] VC state for host host-834350 (uninitialized -> retry), FDM state (UNKNOWN_FDM_HSTATE -> UNKNOWN_FDM_HSTATE), src of state (null -> null)
info pxd[40440] [Originator@6876 sub=MoHost opID=lro-838-5391b34f] VC state for host host-834344 (uninitialized -> retry), FDM state (UNKNOWN_FDM_HSTATE -> UNKNOWN_FDM_HSTATE), src of state (null -> null)
info vpxd[39505] [Originator@6876 sub=MoHost opID=lro-849-6b585da3] VC state for host host-145375 (retry -> uninitialized), FDM state (UNKNOWN_FDM_HSTATE -> UNKNOWN_FDM_HSTATE), src of state (null -> null)
4. In the vpxd-profiler.log
located at /var/log/vmware/vpxd will show similar entries as belowwarning vpxd[08461] [Originator@6876 sub=DAS opID=lro-1145282-333db184] Failed to get task ########-####-####-####-########d926:com.vmware.esx.settings.clusters.software.ha_internal status due to error: com.vmware.vapi.std.errors.unauthenticated.Retrying it again with new session.
warning vpxd[06837] [Originator@6876 sub=DAS opID=lro-1168331-684c3cd0] Failed to get task ########-####-####-####-########34b1:com.vmware.esx.settings.clusters.software.ha_internal status due to error: com.vmware.vapi.std.errors.unauthenticated.Retrying it again with new session.
warning vpxd[08678] [Originator@6876 sub=DAS opID=lro-1191620-10ab0b74] Failed to get task ########-####-####-####-########1e01:com.vmware.esx.settings.clusters.software.ha_internal status due to error: com.vmware.vapi.std.errors.unauthenticated.Retrying it again with new session.
warning vpxd[08607] [Originator@6876 sub=DAS opID=lro-1216414-24723b00] Failed to get task ########-####-####-####-########71e9:com.vmware.esx.settings.clusters.software.ha_internal status due to error: com.vmware.vapi.std.errors.unauthenticated.Retrying it again with new session.
warning vpxd[08678] [Originator@6876 sub=DAS opID=lro-1191620-10ab0b74] Failed to get task ########-####-####-####-########1e01:com.vmware.esx.settings.clusters.software.ha_internal status due to error: com.vmware.vapi.std.errors.unauthenticated.Retrying it again with new session.
warning vpxd[08607] [Originator@6876 sub=DAS opID=lro-1216414-24723b00] Failed to get task ########-####-####-####-########71e9:com.vmware.esx.settings.clusters.software.ha_internal status due to error: com.vmware.vapi.std.errors.unauthenticated.Retrying it again with new session
You will also see task waitForUpdates
in vpxd-profiler.log
ThreadState/ThreadId/48605/State/RPC::vmodl.query.PropertyCollector:propertyCollector::<hostname>::vmodl.query.PropertyCollector.waitForUpdates
5. In the ESXi host, hostd.log
located at /var/run/log
will show similar entries as below
info hostd[2102049] [Originator@6876 sub=SysCommandPosix opID=lxd8ztio-95309-auto-21ji-h5:700120##-##-##-##-##02 user=vpxuser] ForkExec(/sbin/chkconfig) 2259639
info hostd[2102049] [Originator@6876 sub=Vimsvc.TaskManager opID=lxd8ztio-95309-auto-21ji-h5:700120##-##-##-##-##02 user=vpxuser] Task Completed : haTask-ha-host-vim.host.ServiceSystem.updatePolicy-476801 Status success
info hostd[2102035] [Originator@6876 sub=Vimsvc.TaskManager opID=lxd8ztio-95309-auto-21ji-h5:700120##-##-##-##-##07 user=vpxuser] Task Created : haTask-ha-host-vim.host.ServiceSystem.stop-476802
error hostd[2102035] [Originator@6876 sub=Libs opID=lxd8ztio-95309-auto-21ji-h5:700120##-##-##-##-##07 user=vpxuser] ServiceAgent: Command(/etc/init.d/vmware-fdm ++group=hostd-tmp status) exited with status 256
info hostd[2102035] [Originator@6876 sub=Vimsvc.TaskManager opID=lxd8ztio-95309-auto-21ji-h5:700120##-##-##-##-##07 user=vpxuser] Task Completed : haTask-ha-host-vim.host.ServiceSystem.stop-476802 Status success
VMware vCenter Server 7.0.x
VMware vSphere ESXi 7.0.x
After an ESXi update, a race condition might cause the image compliance check to take longer than usual and lead to issues such as time out of the vSphere HA reconfiguration after the update.
This issue is resolved in VMware vCenter 7.0 U3q version. To download go to VCF Products download page.
To workaround this issue Enter/Exit maintenance mode manually and configure HA.
NOTE: You should apply the workaround before updating the vCenter server to fixed version.
Refer to VMware vCenter 7.0U3q release notes
PR 3279536: vSphere High Availability (HA) reconfiguration might time out due to a delay in image compliance check tasks.
After an ESXi update, a race condition might cause the image compliance check to take longer than usual and lead to issues such as time out of the vSphere HA reconfiguration after the update.
This issue is resolved in this release.