Operation Timed out
or HA Agent Unreachable
./var/log/fdm.log
file, we see entries similar to:YYYY-MM-DDTHH:MM:SS.341Z Db(167) Fdm[2676185]: [Originator@6876 sub=Cluster opID=WorkQueue-308315c0] IP xx.xx.xx.xx marked bad for reason Unreachable IP
YYYY-MM-DDTHH:MM:SS.341Z In(166) Fdm[2676185]: [Originator@6876 sub=Message opID=WorkQueue-308315c0] Destroying connection
YYYY-MM-DDTHH:MM:SS.341Z Wa(164) Fdm[2676185]: [Originator@6876 sub=VpxProfiler opID=WorkQueue-308315c0] WorkQueue [TotalTime] took 40014 ms
YYYY-MM-DDTHH:MM:SST17:52:53.437Z Er(163) Fdm[2676185]: [Originator@6876 sub=Default] SSL Async Handshake Timeout : Read timeout after approximately 25000ms. Closing stream SSL(<io_obj p:0x000000d48c8eb8e0, h:29, <TCP 'xx.xx.xx.xx : 51525'>, <TCP 'xx.xx.xx.xx : 8182'>>)
YYYY-MM-DDTHH:MM:SST17:52:53.438Z Wa(164) Fdm[2676192]: [Originator@6876 sub=IO.Connection opID=WorkQueue-308315c0] Failed to SSL handshake; SSL(<io_obj p:0x000000d48c8eb8e0, h:-1, <TCP 'xx.xx.xx.xx : 51525'>, <TCP 'xx.xx.xx.xx : 8182'>>), e: 125(Operation canceled), duration: 24102msec
YYYY-MM-DDTHH:MM:SS.879Z Db(167) Fdm[2676323]: [Originator@6876 sub=HTTP.HTTPService] HTTP Response: Auto-completing at 118/118 bytes; <<io_obj p:xxxxxxxxxxxxxxxxxxx, h:26, <TCP '127.0.0.1 : 9089'>, <TCP '127.0.0.1 : 42794'>>, xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx>
YYYY-MM-DDTHH:MM:SS.879Z Db(167) Fdm[2676323]: [Originator@6876 sub=SOAP] Responded to service state request; <<io_obj p:xxxxxxxxxxxxxxxxxxx, h:26, <TCP '127.0.0.1 : 9089'>, <TCP '127.0.0.1 : 42794'>>, /fdm/service>
YYYY-MM-DDTHH:MM:SS.882Z Er(163) Fdm[2676185]: [Originator@6876 sub=Vmomi opID=m0crhe7k-38536-auto-tqh-h5:70007578-a-DasRetryMgrPeriodic-74700c55-342fe3c9-cb] Caught exception while sending activation result; <<xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx, <TCP '127.0.0.1 : 9089'>, <TCP '127.0.0.1 : 41157'>>, fdmServi
ce, csi.FdmService.GetDebugManager, <csi.version.version1, official, 1.0>, <<io_obj p:xxxxxxxxxxxxxxxxxxx, h:26, <TCP '127.0.0.1 : 9089'>, <TCP '127.0.0.1 : 42794'>>, xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx>>, N5Vmomi5Fault11SystemError9ExceptionE(Fault cause: vmodl.fault.SystemError
[40340B70 error 'Message' opID=SWI-28480d93] [MsgConnectionImpl::FinishSSLConnect] Error N7Vmacore3Ssl18SSLVerifyExceptionE(SSL Exception: Verification parameters:
--> PeerThumbprint: xx:xx:xx:xx:xx:xx:xx:xx:xx:xx:xx:xx:xx:xx:xx:xx:xx:xx:xx:xx
--> ExpectedThumbprint: xx:xx:xx:xx:xx:xx:xx:xx:xx:xx:xx:xx:xx:xx:xx:xx:xx:xx:xx:xx
--> ExpectedPeerName: host-xxxxx
--> The remote host certificate has these problems:
-->
--> * Host name does not match the subject name(s) in certificate.
-->
--> * unable to get local issuer certificate) on handshake
[40340B70 warning 'Election' opID=SWI-28480d93] [MasterVerificationInfo::ConnectComplete] Failed to connect to master host-xxxxx
[40340B70 verbose 'Election' opID=SWI-28480d93] [ClusterElection::AddInvalidMaster] Added invalid master host-xxxxx
[40340B70 warning 'Election' opID=SWI-28480d93] [ClusterElection::UpdateInvalidMasterCountMap] Host host-xxxxx has been declared invalid 9 times
[40340B70 info 'Message' opID=SWI-28480d93] Destroying connection
[FFF45B70 verbose 'Cluster' opID=SWI-6058ed8] [ClusterManagerImpl::IsBadIP] xx.xx.xx.xx is bad ip
[FFF45B70 verbose 'Cluster' opID=SWI-6058ed8] [ClusterManagerImpl::IsBadIP] xx.xx.xx.xx is bad ip
This issue can occur due to any of the below:
Follow the steps according to the scenarios below:
Scenario 1: If the MTU is mismatching on the Hosts, then follow the steps below:
Note: First make a note of the MTU settings from the network device configurations.
Scenario 2: If the subnet mask is different than the actual subnet mask then update the same following the steps below:
Scenario 3: If the FQDN is incorrect than the actual DNS records, then update the same following the steps below:
Scenario 4:
a) For "The host name does not match the Subject Name(s) in certificate"
error, but the ESXi host is installed with self-signed certificates, then regenerate new self-signed certificates by following the steps below:
rui.crt
& rui.key
located at /etc/vmware/ssl
, by running the below commands:mv /etc/vmware/ssl/rui.crt rui.cert.old
mv /etc/vmware/ssl/rui.key rui.key.old
/sbin/generate-certificates
services.sh restart
b) For "The host name does not match the Subject Name(s) in certificate"
error, but the ESXi host is installed with custom certificates, then add custom certificate on the ESXi host by following the KB: Adding Custom Certificate on ESXi hosts through CLI
Scenario 5: If the root password of the ESXi host are changed.