When installing the NSX on an older Host where NSX install was tried before, you will see on the NSX UI --> System --> Fabric ---> Hosts section, NSX install fails with an error "Preparation failed" or "Validation Errors" on that host:
From logs: /var/log/nsxapi.log:
2024-09-26T18:50:31.376Z WARN HostMovementOperationHandler-1-1 HostTnPrepServiceImpl 4792 FABRIC [nsx@6876 comp="nsx-manager" level="WARNING" subcomp="manager"] Error occurred during HostTn creation for discovered node ######-######-####
com.vmware.nsx.management.common.exceptions.OperationNotPermittedException: null
at com.vmware.nsx.management.service_fabric.hostnode.util.HostNodeUtil.verifyIPIsUnique(HostNodeUtil.java:172) ~[?:?]
at com.vmware.nsx.management.service_fabric.hostnode.util.HostNodeUtil.verifyIPIsUnique(HostNodeUtil.java:154) ~[?:?]
2024-09-26T18:50:31.376Z INFO HostMovementOperationHandler-1-1 TransportNodeProfileUtilImpl 4792 FABRIC [nsx@6876 comp="nsx-manager" level="INFO" subcomp="manager"] Record validation error in TNC for discovered node ######-#####-#####.:host-#### message : Node ######-#####-##### with same ip #.#.#.# already exists.
VMware NSX
- This issue is caused due to a stale entry for the host with same IP addresses persisted in NSX causing this Host to fail with validation errors
- The host was taken out of the cluster previously and NSX was not gracefully removed from the host causing a stale reference with same IP for this host.
Steps to Resolve:
1) To move this host out of the prepared cluster to datacenter level
2) In the NSX UI, check if you can find the impacted Host on the following pages:
System > Fabric > Hosts > Other hosts
3) ESXi host will be present here, select it and click Delete NSX and select Force Delete
4) Once the force delete is complete, verify in ESXi host to see if all the vibs are deleted
5) Lastly move the host back to the cluster and this should install NSX successfully