After upgrading NSX-T Federation environment from version 3.1.x to 3.2.x, Remote Tunnel Endpoint (RTEP) status shows Degraded or Down in the NSX-T Manager UI. This issue is cosmetic and does NOT affect cross-site connectivity in NSX Federation deployments.
Error messages similar to the following may appear in the NSX-T Manager UI:
In the NSX-T Manager UI, when viewing the affected edge node's tunnels, they will appear with a red "Active" status indicator despite being functional, as shown in the Tunnels tab. This discrepancy between the active status and the red indicator reflects the degraded state of the RTEP tunnels.
The issue can be verified by checking the NSX Manager log located at /var/log/proton/nsxapi.log, where the following message may be observed:
YYYY-MM-DDTHH:MM:SS.201Z INFO l3-tasks1 EdgeClusterMeshUpdateTask 20324 ROUTING [nsx@6876 comp="nsx-manager" level="INFO" subcomp="manager"] Continue with full underlay mesh span RemoteMeshSpanForEc [isFullSpan=true, remoteEClusterIds=[########-####-####-####-####], getRevision()=49545, getIdentifier()=RemoteMeshSpanForEc/########-####-####-####-####]] for edge cluster ########-####-####-####-####]
This is a known issue that occurs when upgrading from NSX 3.1.x to 3.2.x in a Federated environment. The root cause is that one Site T1 gateway does not have the information about the remote site edge cluster ID after the upgrade.
This occurs because the configuration revision counter needs to be updated to trigger proper synchronization between the NSX Federation Global Manager and the Local Managers.
To resolve this issue, follow these steps to update all affected Tier-1 gateways:
Description: RTEP (Remote Tunnel Endpoint) BGP session from source IP ###.###.###.### to remote location example-location neighbor IP ###.###.###.### is down. View Runtime Details
GET https://<NSX_GLOBAL_MANAGER>/global-manager/api/v1/global-infra/tier-1s/
GET https://<NSX_GLOBAL_MANAGER>/global-manager/api/v1/global-infra/tier-1s/<TIER1_ID>/locale-services
PATCH https://<NSX_GLOBAL_MANAGER>/global-manager/api/v1/global-infra/tier-1s/<TIER1_ID>/locale-services/<LOCALE_SERVICE_ID>
curl -k -H "Content-Type:application/json" -u admin -X PATCH https://<NSX_GLOBAL_MANAGER>/global-manager/api/v1/global-infra/tier-1s/EXAMPLE-T1-01-ID/locale-services/example-locale-service-01 -d '{"display_name" : "example-locale-service-01"}'
{"display_name" : "example-locale-service-01"}
If the issue persists after following these steps, contact Broadcom Support for further assistance.
Please provide the following information when opening a support request with Broadcom for this issue:
/var/log/proton/nsxapi.logRefer to the following resources for creating a Broadcom case and uploading files to that case