NSX standby relocation is not working as expected.
search cancel

NSX standby relocation is not working as expected.

book

Article ID: 411683

calendar_today

Updated On:

Products

VMware NSX

Issue/Introduction

  • Standby relocation is feature enabled in the environment.
  • Placing the active/standby edges into maintenance mode or both nodes becoming unavailable, a network outage can occur.

Environment

VMware NSX.

Cause

This behavior is expected.

A Tier-1 Gateway's high availability (HA) configuration is designed to protect against a single point of failure. When both the Edge nodes are unavailable, it causes the Tier 1 to go down.

Resolution

If the Edges were placed in NSX maintenance mode:

To immediately restore network services, at least one of the Edge nodes must be taken out of maintenance mode.

  • In the NSX Manager UI, navigate to  System, Fabric, Edge Cluster and identify the Edge nodes that were placed in maintenance mode.
  • Select one of the nodes used by Teir1 gateway.
  • Exit maintenance mode for that selected node.

If the Edge nodes are down and there are other Edge nodes available in the Edge cluster:

  • Go to NSX manager UI, navigate to networking, Tier 1 gateways ,Edit and disable auto allocate Edges
  • Select the available Edge nodes as active and standby Edge node.

Additional Information

Standby Edge relocation does not work when any Edge node is placed in NSX maintenance mode since it is an administrative task and not a failure occurred on NSX Edge node.

To avoid multiple Edge node failures, Failure domain can be configured. For more information refer: Configuring failure domain