NSX Egde in Failed state
search cancel

NSX Egde in Failed state

book

Article ID: 426244

calendar_today

Updated On:

Products

VMware NSX

Issue/Introduction

  • The NSX Edge configuration state is showing failed with below error message , although Manager and Controller connectivity is visible in the NSX UI.

"failure_message": " Host configuration: Caught MessagingException during host config stage. [TN=TransportNode/########################]. Reason: MessagingException",
"failure_code": 8804
 
  • This issue follows a recent storage outage that caused the Edge nodes to enter a read-only state; the nodes transitioned to the failed state after a reboot.

  • The node state, retrieved via the API endpoint GET /api/v1/transport-nodes/<TN-UUID>/state, returns failure code 8804
     

    API Output:

    {
        "transport_node_id": "###########################",
        "maintenance_mode_state": "DISABLED",
        "node_deployment_state": {
            "state": "NODE_READY",
            "details": [],
            "failure_message": "",
            "failure_code": 0
        },
        "state": "failed",
        "details": [
            {
                "sub_system_id": "##############################",
                "sub_system_type": "Host",
                "state": "failed",
                "failure_message": " Host configuration: Caught MessagingException during host config stage. [TN=TransportNode/###########################]. Reason: MessagingException",
                "failure_code": 8804
            }
        ],
        "failure_message": "Host configuration failed. Number of retries : 10. Next retry attempt will be between [timestamp]",
        "failure_code": 8804
    }

 

 

Environment

VMware NSX

Cause

The edge nodes entered a read-only mode following a storage outage. A subsequent reboot resulted in an inconsistent synchronization state between the management plane and the edge node local configuration.

Resolution

Perform Edge sync configuration by following the below steps :

  1. Log in to the NSX manager user interface with administrative privileges.
  2. Navigate to the System tab on the top navigation bar. Select Fabric from the left-side menu and click on Nodes.
  3. Click on the Edge Transport Nodes tab to view the list of available edge appliances. Locate the specific node reporting the failed configuration state.
  4. Select the checkbox next to the affected edge transport node. Click on the Actions drop-down menu located above the grid.
  5. Choose the Sync Edge Configuration  option from the list. Confirm the action if prompted by clicking sync.
  6. Monitor the Configuration State column. The status should transition from "In Progress" to "Success" once the synchronization is complete.