NSX-T Data Center 3.2.2.* has been upgraded to NSX 4.0.x or 4.1.x or 4.2.x
The upgrade completed successfully, but within the UI nothing will index.
/var/log/search/opensearch.log2024-12-21T18:13:34.004Z ERROR SearchHealthCheck SearchServiceWatchDog 77041 - Insx@6876 comp="nsx-manager" errorCode="MP60524" level="ERROR" subcomp="manager"] [Search: Wat A chDog] Could not connect to OpenSearch java.net.ConnectException: Connection refusedat org.opensearch.client.RestClient.extractAndWrapCause (RestClient.java:954) ~[?:?]at org.opensearch.client.RestClient.performRequest (RestClient.java:333) ~[?:?]at org.opensearch.client.RestClient.performRequest (RestClient.java:321) ~[?:?]at org.opensearch.client.RestHighLevelClient.internalPerformRequest (RestHighLevelClient.java:1918) ~[?:?]at org.opensearch.client.RestHighLevelClient.performRequest (RestHighLevelClient.java:1901) ~[?:?]at org.opensearch.client.RestHighLevelClient.ping(RestHighLevelClient.java:688) ~[?:?]at com.vmware.nsx.management.search.manager.SearchServiceWatchDog.isOpenSearchHealthy (SearchServiceWatchDog.java:185) ~[?:?]at com.vmware.nsx.management.search. manager. SearchServiceWatchDog.run(SearchServiceWatchDog.java:85) ~[?:?]at java.util.concurrent.Executors$RunnableAdapter.call(Unknown Source) ~[?:?]at java.util.concurrent.FutureTask.runAndReset(Unknown Source) ~[?:?1at java.util.concurrent. ScheduledThreadPoolExecutor$ beduledFutureTask.run(Unknown Source) ~[?:?]at java.util.concurrent.ThreadPoolExecutor.runWorker (Unknown Source) ~[?:?]at java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source) ~[?:?]at com.vmware.nsx.util.concurrent.Executors$MeteredRunnable.run(Executors.java:353) Insx-util.jar:?]at java.lang.Thread.run(Unknown Source) [?:?1Caused by: java.net.ConnectException: Connection refusedat sun.nio.ch.SocketChannelImpl.checkConnect (Native Method) ~[?:?]at sun.nio.ch.SocketChannelImpl.finishConnect (Unknown Source) ~[?:?]at org.apache.http.impl.nio.reactor.DefaultConnectingIOReactor.processEvent (DefaultConnectingIOReactor.java:171) ~[?:?]atPoolingNHttpClientConnectionManager.execute(PoolingNHttpClientConnectionManager.java:221) ~[?:?]at org.apache.http.impl.nio.reactor.DefaultConnectingIOReactor.processEvents(DefaultConnectingIOReactor.java:145) ~[?:?] at org.apache.http.impl.nio.reactor. Abstract Multiworker IOReactor.execute(Abstract Multiworker IÕReactor.java:351) ~[?:?] org.apache.http.impl.nio.conn. at org.apache.http.impl.nio.client.CloseableHttpAsyncClientBase$1.run(CloseableHttpAsyncClientBase.java:64) ~[?:?1 1 more
VMware NSX-T Data Center
VMware NSX
During the upgrade process Elasticsearch JVM isn't able to stop in a timely manner and is eventually terminated after the upgrade process tries to start Opensearch. It prevents an update to user permissions for the /nonconfig/search folder from elasticsearch to nsx-search user which causes Opensearch to fail upon start.
Workaround:
This workaround needs to be applied on all NSX Managers in the impacted cluster:
/nonconfig/search:cd /nonconfig/searchls -l# ls -ldrwxr-x--- 3 elasticsearch elasticsearch 4096 Apr 25 2023 nodesdrwxr-xr-x 2 nsx-search nsx-search 4096 Jan 4 13:13 tmp/nonconfig/search to owner and group "nsx-search" on all NSX Managers in the cluster:chown -R nsx-search:nsx-search nodesstart search resync recovery (if your on version 4.2.0 or newer)
start search resync all