CSR
| |
BGP BGP
| |
node1 --isr--node2
Active Active
VMware NSX-T Datacenter
VMware NSX
Suppose both edge has two vteps X1, X2 and Y1, Y2. Two tunnels for HA will be created (X1, Y1), (X2, Y2). These two tunnels are excluded when considering "All Tunnels Down" scenario, i.e. we won't trigger node down if there are only these two tunnels on the edge and both of them are down.
However, in addition to the these two tunnels, if there are logical topology that include DR and overlay segment (for example, T0-LR will have a transit logical switch between T0-SR & T0-DR) the tunnels (X1, Y2) and (X2, Y1) may also be created. The tunnel driven by l2 span is based on a hash so it is still possible they may reuse tunnel (X1, Y1) or (X2, Y2), but may use different tunnel (X1, Y2) or (X2, Y1).
When the edge-2 exits the maintenance mode, since it is possible that the new TEP tunnels (X1, Y2) or (X2,Y1) could be formed, these TEPs are not added into an excluded list. Since this was marked down immediately while coming up, and bgp is also marked down because routing is marked down.
Workaround:
1. Use only single vtep for the edge.
2. Add some VMs to downlink segments.
This issue is resolved in VMware NSX 4.2.0 available at Broadcom downloads.
If you are having difficulty finding and downloading software, please review the Download Broadcom products and software KB.
If using workaround, adding VMs to downlink segment can be done at any time without maintenance window.
If reducing multi-vteps to single vtep, traffic impact can happen for few seconds to probably a minute depending on the scale.
Otherwise a complete fix will require upgrade to 4.2.0 or later.