/var/log/vmware/vcf/domainmanager/domainmanager.log indicates that the 'get bgp neighbor' command failed to return any BGP neighbors for an Edge node.false[YYYY-MM-DDTHH:MM:SS] DEBUG [74125e06972b5c5d,ff90] [c.v.e.s.common.util.SSHCommandUtil,dm-exec-1] Executed command "vrf 1; get bgp neighbor | json; exit" against x.y.x.xx:22. Exit status: 4, output: %[YYYY-MM-DDTHH:MM:SS] ERROR [74125e06972b5c5d,ff90] [c.v.e.s.o.model.error.ErrorFactory,dm-exec-1] [Reference Token] FAILED_TO_CONNECT_TO_NSXT_EDGE Failed to connect to NSX-T edge.com.vmware.evo.sddc.orchestrator.exceptions.OrchTaskException: Failed to connect to NSX-T edge. at com.vmware.vcf.common.fsm.plugins.nsxt.action.VerifyBgpPeeringAction.execute(VerifyBgpPeeringAction.java:95) at com.vmware.vcf.common.fsm.plugins.nsxt.action.VerifyBgpPeeringAction.execute(VerifyBgpPeeringAction.java:32)SDDC manager gives a message in the GUI that looks like the following:
Check BGP Neighbor Connection state for Edge Neighbors[Tier0BgpNeighborInfo(nsxtId=x.x.xx.xx_12345, neighborIp=x.x.xx.sy, remoteAsn=12345, maximumHopLimit=1, sourceIps=[x.x.x.yy, x.x.x.yx]), Tier0BgpNeighborInfo(nsxtId=x.x.x.gh_12345, neighborIp=x.x.xx.yy, remoteAsn=12344, maximumHopLimit=1, sourceIps=[x.x.x.yy, x.x.x.yx])]' failed to complete after 3 retries. InternalServerError (com.vmware.vapi.std.errors.internal_server_error) => { messages = [], data = struct => {error_message=Internal server error has occurred., details=Request processing failed; nested exception is com.vmware.nsx.management.policy.policyframework.exceptions.ObjectNotFoundException: Enforcement point connection info not found, error_code=99, module_name=common-services}, errorType = INTERNAL_SERVER_ERROR }
To most users, this appears to be an error message coming back from NSX-T to SDDC manager, but it may be an issue within the SDDC manager.
Example of the domainmanager.log output:
Caused by: java.lang.RuntimeException: Retriable operation 'Check BGP Neighbor Connection state for Edge Neighbors[Tier0BgpNeighborInfo(nsxtId=null, neighborIp=xx.xx.xx.xx, remoteAsn=15001, sourceIps=[xx.xx.xx.xx]), Tier0BgpNeighborInfo(nsxtId=null, neighborIp=xx.xx.xx.xx, remoteAsn=15001, sourceIps=[xx.xx.xx.xx])]' failed to complete after 3 retries. at com.vmware.evo.sddc.common.util.RetriableCallable.call(RetriableCallable.java:196) at com.vmware.vcf.common.fsm.plugins.nsxt.policy.action.VerifyBgpPeeringNsxApiAction.execute(VerifyBgpPeeringNsxApiAction.java:132) ... 17 common frames omittedTimeStamp DEBUG [vcf_dm,f9f2f70fc9174e70,4fb2] [c.v.e.s.o.c.ProcessingTaskSubscriber,dm-exec-20] Collected the following errors for task with name VerifyBgpPeeringNsxApiAction and ID Example ID: [ExecutionError [errorCode=null,VMware Cloud Foundation 4.x
VMware Cloud Foundation 5.x
This is a known issue with NSX-T 3.x and NSX-T 4.x and VMware Cloud Foundation.
Workaround:
To workaround this issue, either:
To enable and disable maintenance-mode, navigate through the NSX-T UI
If the workaround doesn't work, log a ticket with GSS through the Broadcom portal and reference this KB.