Logs:
Edge frr.log :30.725986 ZEBRA: zebra_ptm_handle_bfd_msg: Recv Port [uplink-885] bfd status [Down] vrf [default] peer [xx:xx:xx:xx] local [xx:xx:xx:xx] 1/1/1 01:01:01 ZEBRA: MESSAGE: ZEBRA_INTERFACE_BFD_DEST_UPDATE xx:xx:xx:xx/32 on uplink-885 Down event 1/1/1 01:01:01 BGP: [xx:xx:xx:xx]: BFD Down .... 1/1/1 01:01:01 BGP: [xx:xx:xx:xx]: BFD Up 1/1/1 01:01:01 BGP: BFD status for peer xx:xx:xx:xx changed from Down -> Up 1/1/1 01:01:01 BGP: xx:xx:xx:xx [FSM] Timer (start timer expire). Edge syslog BGP Down: ventFeatureName="routing" eventSev="error" eventType="bgp_down"] In Router d304f6f0-1e1c-4ad8-b372-116c91da3b55, BGP neighbor ########-f5a2-4a29-b6bc-############ (xx:xx:xx:xx) is down, reason: Network or config error. ... BGP UP. 1/1/1 01:01:01 nsxedge-01.local NSX 4928 - [nsx@6876 comp="nsx-edge" s2comp="nsx-monitoring" entId="########-f5a2-4a29-b6bc-############" tid="4965" level="ERROR" eventState="Off" eventFeatureName="routing" eventSev="error" eventType="bgp_down"] Context report: {"entity_id":"########-f5a2-4a29-b6bc-############","sr_id":"########-3785-45ae-be9a-############","lr_id":"########-1e1c-4ad8-b372-############","bgp_neighbor_ip":"xx:xx:xx:xx","failure_reason":"BGP Established"} 1/1/1 01:01:01 nsxedge-01.local bgpd 11191 - - %ADJCHANGE: neighbor xx:xx:xx:xx(Unknown) in vrf default Up vmware.log for vmotion: 1/1/1 01:01:01| vmx| I125: MigrateVMXdrToSpec: type: 1 srcIp=<xx:xx:xx:xx> dstIp=<xx:xx:xx:xx> mid=1647b15e666ece27 uuid=########-0031-4d10-8037-############ priority=yes checksumMemory=no maxDowntime=0 encrypted=0 resumeDuringPageIn=no latencyAware=yes diskOpFile= srcLogIp=<<unknown>> dstLogIp=<<unknown>> ftPrimaryIp=<<unknown>> ftSecondaryIp=<<unknown>> 1/1/1 01:01:01| vmx| I125: MigrateSetInfo: state=8 srcIp=<xx:xx:xx:xx> dstIp=<xx:xx:xx:xx> mid=1605446811184451111 uuid=########-0031-4d10-8037-############ priority=high
VMware NSX-T Data Center
This is an expected behavior when the Edge VM's are vMotioned, and if BFD timers are aggressive, a BGP flap can occur.
Workaround:
Recommendation 1:
Note: vMotion of Edge VMs should not happen under nominal operations and should be done only when necessary (during Host upgrades etc.). Also, as a best practice, vMotion of Edge VMs should only happen during a maintenance window.
Recommendation 2: