VMware NSX-T Data Center 3.0
During restart of an appliance "search" service didn't have enough CPU resources to finish initialization within expected time threshold which caused "*_metadata" indices to fail to store latest indexing position. It caused further out of sync issue after services completed initialization.
This issue has been fixed in NSX 4.2. The following workarounds are available:
Execute the following recovery instructions on each NSX node.
1. Stop all services which use "search" service
service phonehome-coordinator stop
service idps-reporting-service stop
service proton stop
service nsx-policy-manager stop
2. Remove stale *_metadata indices (it's runtime data and safe to delete)
curl -XDELETE 'http://localhost:9200/manager_metadata/'
curl -XDELETE 'http://localhost:9200/policy_metadata/'
curl -XDELETE 'http://localhost:9200/security_data_service_metadata/'
curl -XDELETE 'http://localhost:9200/monitoring_metadata/'
3. Restart "search" service to make sure it is fully initialized
service search restart
grep " started" /var/log/search/elasticsearch.log
4. Start phonehome-coordinator service
service phonehome-coordinator start
grep "Complete Indexing is finished" /var/log/phonehome-coordinator/phonehome-coordinator.log
5. Start idps service
service idps-reporting-service start
grep "Complete Indexing is finished" /var/log/idps-reporting/idps.log
6. Start proton service
service proton start
grep "Complete Indexing done" /var/log/search/search-manager.log
7. Start policy service
service nsx-policy-manager start
grep "Complete Indexing done" /var/log/search/search-policy.log
8. Verify UI is fully functional