Alarms are seen in the NSX Manager reporting the BGP has flapped, but the issue is self resolved.
The flaps happen at irregular intervals with no apparent cause.
The flaps are isolated to a single Edge node and BGP neighbor, and other nodes with the same BGP neighbor have no issues.
The issue occurs when the Edge node is on different ESXi hosts, ruling out an underlying ESXi issue.
In the /var/log/syslog file of the impacted Edge node you see logs like the following:
YYYY-MM-DDTHH:MM:SS.sssZ edge NSX ##### - [nsx@#### comp="nsx-edge" s2comp="nsx-monitoring" entId="########-####-####-####-############" tid="10525" level="ERROR" eventState="On" eventFeatureName="routing" eventSev="error" eventType="bgp_down"] In Router ########-####-####-####-############, BGP neighbor ########-####-####-####-############ (###.###.###.###) is down. Reason: Edge is not ready.
VMware NSX
Reboot the NSX Edge node that reports the flapping in a maintenance window.
If the flapping continues, please open a support case with Broadcom about this issue and provide the following:
Please see Creating and managing Broadcom support cases for assistance in creating a support case.