Data plane outage during control plane outage
search cancel

Data plane outage during control plane outage

book

Article ID: 324869

calendar_today

Updated On:

Products

VMware NSX for vSphere

Issue/Introduction

In an environment where the DLR Control VM is deployed either in HA mode or standalone with dynamic routing is configured, you see these symptoms:
  • Data plane experiences outage, no more dynamic routes on the ESXi hosts.
  • DLR Control VM sending explicit ‘leave’ message to the Controller will remove dynamic routes on the ESXi hosts. No route situation on the hosts will cause data plane outage in the North-South direction.
  • In the NSX Controller logs, you see entries similar to:

       

       Note: The preceding log excerpts are only examples. Date, time, and environmental variables may vary depending on your environment.



Environment

VMware NSX for vSphere 6.3.x
VMware NSX for vSphere 6.2.x

Cause

This issue occurs if the DLR Control VM is disconnected (For example: Shutdown, switchover, etc.) from the NSX Controller Cluster, the NSX Controllers cluster will initiate a flush of the dynamic routes on all the ESXI hosts (related to that DLR Control VM).

Resolution

This issue is resolved in:
  • VMware NSX for vSphere 6.3.5 and later versions.
  • VMware NSX for vSphere 6.4.0 and later versions.
Note: After the fix, the NSX Controllers cluster keep the routes even if the DLR Control VM is disconnected to the Controller or switchover to standby DLR Control VM.

Additional Information

Impact/Risks:
Control plane outage can affect data plane by removing dynamically learned routes from DLR instance.