Failure domain down alarm
search cancel

Failure domain down alarm

book

Article ID: 368289

calendar_today

Updated On:

Products

VMware NSX

Issue/Introduction

Title: Failure domain down
Event ID: edge_health.failure_domain_down

Alarm Description

  • Purpose: To inform the user that the Edge has lost connectivity with manager and controller.
  • Impact: Edge has no connectivity with manager and controller.

Environment

VMware NSX-T Datacenter
VMware NSX

Resolution

Steps to resolve
For 3.2.0 and higher

Recommendation Action:

  1. On the Edge node identified by {transport_node_id}, check the connectivity to the management and control planes by invoking the NSX CLI commands `get managers` and `get controllers`.
  2. Invoke the NSX CLI command `get interface eth0` to check the management interface status.
  3. Invoke the NSX CLI command `get services` to check the core services status like dataplane/local-controller/nestdb/router, etc...
  4. Inspect the /var/log/syslog file o find the message related to manager/controller connectivity.
  5. Reboot the Edge node.

Maintenance window required for remediation? Yes

Additional Information

NSX-MGR-CONNECT

Similar issues:

"Failure Domain Down" alarm is reported even though edge node is healthy

 

If you need to file a Broadcom support request, be sure to include the output of the commands noted in the Resolution section of this article.