There could be many reasons for the ESXi Host to NSX controller connectivity to be down. Some examples are: nsx-proxy service is down, certificate issues, network issues, and upgrades modifying some config files.
VMware NSX-T 4.x and later
To help identify the root cause of the issue, there is a run book available on the ESXi host and NSX Edge.
[root@esxi:~] nsxcliesxi.fqdn>nsxedge>esxi.fqdn> set service nsx-ods running-level internalRunning level: internalThe ODS Runbooks come in two flavors depending on the intended users:
"get runbook"start invocation runbook nsx_proxy"esxi.fqdn> start invocation runbook nsx_proxyRunbook Invocation Report
Invocation ID : ########-####-####-####-############Timestamp : DATE TIMESystem Info Host Name : esxi.fqdn OS Name : VMkernel OS Version : 8.0.3 Arch : x86_64Runbook Info Runbook ID : nsx_proxy Version : 2.1 Publisher : Broadcom Inc.Report Type : VALIDConclusion : No connectivity issues has been detected for Controller channel and Appliance channel.Recommendation : No action needs to be taken.Artifact Bundle : <none>Steps
Step Number : 1 Step Action : Fetch nsx-proxy service status. Step Result : Service nsx-proxy is up
Step Number : 2 Step Action : Fetch nsx-opsagent service status. Step Result : Service nsx-opsagent is up
Step Number : 3 Step Action : Check whether Maintenance mode is enabled on Transport Node. Step Result : Maintenance mode checks performed, no issues detected on this transport node.
Step Number : 4 Step Action : Check whether Appliance nodes settings are empty or duplicate. Step Result : No issues has been detected for Appliance channel, total Appliance channel nodes present in configuration file are: 3.
Step Number : 5 Step Action : Check whether Controller nodes settings are empty or duplicate. Step Result : No issues has been detected for Controller channel, total Controller channel nodes present in configuration file are: 3.
Step Number : 6 Step Action : Perform ping and port status check for all Appliance channel nodes. Step Result : Ping and port test are successful for all Appliance channel nodes.
Step Number : 7 Step Action : Perform ping and port status check for all Controller channel nodes. Step Result : Ping and port test are successful for all Controller channel nodes.
Step Number : 8 Step Action : Perform certification validation for all Controller channel nodes. Step Result : Controller channel nodes certification validation passed.
Step Number : 9 Step Action : Perform certification validation for all Appliance channel nodes. Step Result : Appliance channel nodes certification validation passed.
Step Number : 10 Step Action : Perform CRL validation for all Controller channel nodes. Step Result : No issues detected for Controller channel certificate revocation list validation.
Step Number : 11 Step Action : Perform CRL validation for all appliance channel nodes. Step Result : No issues detected for Appliance channel certificate revocation list validation.
esxi.fqdn> set service nsx-ods running-level externalRunning level: external