<DATE>T03:23:11.823Z INFO nsx-rpc:unix:///var/run/vmware/nsx-opsagent/alarms-provider-service.sock:user-executor-0 EventSource 87147 MONITORING [nsx@6876 comp="nsx-manager" level="INFO" subcomp="async-replicator"] EventSource: Sync triggered. featureName: federation, eventType: gm_to_lm_synchronization_warning, entityId: UUID, status: true, context: {"site_id":"SITE ID","remote_site_id":"UUID","site_name":"SITE_NAME","remote_site_name":"SITE_NAME","flow_identifier":"FlowIdentifier{role='Policy', nameSpace='LM_2_GM_ONBOARD_CONFIG'}","sync_issue_reason":"Remote site disconnected"}
<DATE>T11:59:08.285Z INFO ClusteringRpcServer-Leadership-Thread1 LeadershipRpcHandler 84807 - [nsx@6876 comp="global-manager" level="INFO" s2comp="leadership-rpc-handler" subcomp="cbm"] Renewing the leadership lease of group <GROUP_ID>, new lease LeadershipLease{serviceName=ArAlarmService, leaderId=<MANAGER_UUID_1>, leaseVersion=16####8, revocationCount=0, serviceWeight=1, serviceWeightCategory=SMALL, leaseId=<LEASE_ID>, relinquishInProgress=false}<DATE>T12:00:41.312Z INFO ClusteringRpcServer-Leadership-Thread1 LeadershipRpcHandler 84807 - [nsx@6876 comp="global-manager" level="INFO" s2comp="leadership-rpc-handler" subcomp="cbm"] Renewing the leadership lease of group <GROUP_ID>, new lease LeadershipLease{serviceName=ArAlarmService, leaderId=<MANAGER_UUID_2>, leaseVersion=16####6, revocationCount=0, serviceWeight=1, serviceWeightCategory=SMALL, leaseId=<LEASE_ID>, relinquishInProgress=false}VMware NSX 4.x
VMware NSX-T 3.x
After a Synchronization alarm validly triggers, if the leader of the ArAlarmService changes from an NSX Manager to another NSX Manager before the alarm resolves then the alarm cannot be cleared.
This issue is resolved in VMware NSX 4.2.0 available at Broadcom Downloads.
To resolve the alarm, restart the Async Replication service on the Local NSX Manager
ArAlarmService 1 SMALL <node UUID> 1641194#systemctl stop async-replicator-service#systemctl start async-replicator-serviceNote: Also you might be able to fix this alarm by rebooting one of the Lm manager.
Note: If you see this issue on a later version than 4.2.0 let the alarm go for 5 or more minutes and the do the workaround above and collected both local manager and global manager log's and upload to the case.