In VMware NSX, Edges may show an "MPA Connectivity Down" status in the Manager UI. When investigating the Edge CLI or logs, you may observe that the Edge is attempting to connect to a manager using a DNS error message as a hostname.
edge01> get controllers Controller IP Port SSL Status Is Physical Master Session State Controller FQDN Failure Reason :: 1235 enabled disconnected true down ;; communications error to <DNS Server IP>#53: timed out OTHER_ERROR
edge01> get managers;; communications error to <DNS Server IP>#53: timed out Unable to resolve fqdn *Error to ssl://;; communications error to <DNS Server IP>#53: timed out:1235 ... Error 1-Host not found.
Note: The preceding log excerpts are only examples. Date, time, and environmental variables may vary depending on your environment.
VMware NSX
This issue occurs when the NSX Manager has "Publish FQDN" enabled but cannot reach its DNS servers. The resulting timeout error message is incorrectly ingested by the Edge as a valid Management Plane FQDN and saved into the Edge's local configuration which is not recoverable as the edge loses connectivity to the managers.
This is a known issue impacting VMware NSX.
To manually restore connectivity, you must remove the corrupted entries from the Edge appliance.
Log into the NSX Edge CLI as admin and then switch to root (or log in directly as root if enabled).
Navigate to the configuration directory: cd /config/vmware/edge/
Back up the current configuration: cp appliance-info.xml appliance-info.xml.bak
appliance-info.xml with a text editor (e.g., vi) and remove the values between <fqdn></fqdn> and <fqdnv6></fqdnv6> then save and close the fileor run the below sedsed -i '/^<fqdn/d' /config/vmware/edge/appliance-info.xml
Restart the NSX Proxy service to apply the changes and repopulate the values correctly.
/etc/init.d/nsx-proxy restart.
Ensure that your NSX Manager has a stable and reachable DNS configuration before re-enabling "Publish FQDN" to prevent a recurrence of this behavior.