Install Failure/ Validation error: While preparing an ESXi host as a Transport Node, the following errors are seen:
Node <node-name> with same ip <###.###.###.###> already exists
Or
Error: General error has occurred. Discovered node with id:<Discovered Node ID:host-###> is already prepared having fabric node id:<Transport Node UUID>.
Or
Node <node_name> with same ip <ip_address> already exists. Check the state of existing transport node. If it is a failure in deletion then use force option to delete previous transport node and retry the operation.
Note: node_name is the last part of the resource path as found when you click the hamburger menu on the host in System, Fabric, Host and select Copy path to Clipboard. For example here it is #######-########-9c7e-436b-8660-############host-## from this path is /infra/sites/default/enforcement-points/default/host-transport-nodes/########-########-9c7e-436b-8660-############host-##.
Note: Transport Node UUID is the value retrieved when you click the hamburger menu on the host in System, Fabric, Host and select Copy ID to Clipboard. For above, it would be ########-9c7e-436b-8660-############
Upgrade Failure: during host upgrade the following error is seen:
Failed to get Host status for upgrade unit <Upgrade Unit ID> due to error Transport node <Transport Node UUID> not found
An ESXi host was removed from vCenter without first removing it from NSX.
The Host is not visible under the Host and Cluster section of the UI: System, Fabric, Hosts, Clusters or Standalone.
An upgrade pre-check may fail and can cause the upgrade process to pause.
Running either of the below Policy or Manager GET API's, using the <Transport Node UUID> or <node_name> from above errors, reveal the following results:
'/api/v1/transport-nodes/<Transport Node UUID>/state''/policy/api/v1/infra/sites/default/enforcement-points/default/host-transport-nodes/<node_name>/state'"node_deployment_state" : {
"state" : "failed","details" : [ {"sub_system_id" : "########-######-######-######-############","state" : "failed","failure_message" : "Failed to uninstall the software on host. Host OS version not found.\n","failure_code" : 26020} ]},"deployment_progress_state" : {"progress" : 40,"current_step_title" : "Removing NSX bits"
Note: A transport node is a host prepared with NSX VIB's.
VMware NSX 9.0.0
VMware NSX 4.x
VMware NSX-T Data Center 3.x
When an ESXi host is removed directly from vCenter without first removing NSX, it can result in entries for that host to remain in the NSX database.
The correct procedure to remove NSX from a single host:
Note: This is also referred to as a "Stale Host" and will prevent adding back a refreshed or new host using the same name.
This issue has been resolved in NSX 4.2.3 and 9.0.1, available at Broadcom downloads.
If you are having difficulty finding and downloading software, please review the Download Broadcom products and software KB.
Environments may have reached this state by following different steps, therefore, there are a number of possible workaround options that need to be tried.
Install Failure Prerequisite
Before proceeding with the following options, move the host that has failed to install/upgrade out of the vSphere cluster and make it a standalone host in vSphere.
Then proceed to go through the below Options in order from 1 to 4.
Upgrade Failure Prerequisite
No action required, proceed through the below Options in order 1 to 4.
To move the host out of the vSphere cluster:
If the Host is not present in the NSX UI, under System, Fabric, Hosts, in either Clusters/Other Nodes/Standalone, please proceed to Option 2. If you have already completed Option 2 and and retrying Option 1 after reindexing and the issue is still not resolved, please proceed to Option 3.
Option 2
In some cases, the host may not appear in the NSX UI due to search indexing failures.
On all three NSX manager nodes, log in as the admin user and run the following three commands. Ensure the resync/re-indexing is completed before running the second command, refer KB NSX Manager UI displays notification "Failed to fetch System details. Please contact the administrator. Error: null (Error code: 513002)" for more information.
start search resync policystart search resync manager
start search resync telemetry
After you run the above commands, please allow some time for the reindexing to complete, this depends on the size of the environment, please allow at least 10 minutes.
Note: During the period of reindexing, you may notice the NSX UI will display an error in relation to the indexing and indicates to try again later; this is expected, due to the indexing occurring.
If the host is still listed, please refresh indexing using the following commands:
start search resync all
Please allow some time for re-indexing to complete. Depending on the size of the environment, this may take some time. During the period of reindexing, NSX UI will show indexing notifications.
Once the reindexing is complete, after at least 10 minutes, go back to Option 1 and follow the steps there again.
Option 3 using Manager API call
If you have completed Option 1 and Option 2 and the host still does not appear on the NSX UI, to allow removal, the following API steps can be used to remove the transport node.
Run the following API call:
GET https://<NSX Mgr IP>/api/v1/transport-nodes/<Transport Node UUID>/state. Note: Replace <Transport Node UUID> with the Transport Node UUID, as reported in the error message (see Issue/Introduction section).
If the state value in the API response is not "Object Not found" then proceed to Option 3 Step 3.
Note: The state value should be "Object not found" when the host is successfully removed.
DELETE https://<NSX Mgr IP>/api/v1/transport-nodes/<Transport Node UUID>?force=true&unprepare_host=false
<Transport Node UUID> with the Transport Node UUID, as reported in the error message (see Issue/Introduction section). Object Not found" is returned.Once GET API returns "Object Not found", move the host back into the original cluster to prepare it for NSX. If a Transport Node Profile is applied, host preparation should start automatically. Otherwise, proceed and prepare the transport node as before.
Option 4 using Policy API call
If completing Option 3 did not clear the transport node, you can use the following:
GET https://<NSX Mgr IP>/policy/api/v1/infra/sites/default/enforcement-points/default/host-transport-nodes/<node_name>?force=true&unprepare_host=false<node_name> with the node ID you received in the error (see Issue/Introduction section). If the state value in the API response is not "Object Not found" then proceed to Option 4 Step 3.
Note: The state value should be object not found when the host is successfully removed.
DELETE https://<NSX Mgr IP>/policy/api/v1/infra/sites/default/enforcement-points/default/host-transport-nodes/<node_name>?force=true&unprepare_host=false Note: Replace <node_name> with the node ID you received in the error (see Issue/Introduction section).
Wait five minutes, then run the GET transport node state command again, as seen in Option 4 Step 1 periodically until "Object Not found" is returned.
Once GET API returns "Object Not found", move the host back into the original cluster to prepare it for NSX. If a Transport Node Profile is applied, host preparation should start automatically. Otherwise, proceed and prepare the transport node as before.
Security Only Clusters
If the cluster is Security Only, prepared using vLCM or has Service Insertion installed, it is not possible to detach the transport node profile.
In such circumstances, to uninstall NSX, either remove NSX from the whole cluster or move the single host out of the NSX prepared cluster to the datacenter level in vCenter.
Once the host is no longer part of an NSX prepared cluster NSX can be removed using the NSX GUI.
Note: If using curl to run the API, if the API has '?' in the URL, then enclose the full URL in quotes.
Note: In all above API calls, replace <NSX Mgr IP> with the IP address or FQDN of an NSX manager node.
If none of the options have resolved the issue, please collect the information outlined in the Additional Information section below and open a technical support case with Broadcom Support for further investigation. and refer to this KB article.
For more information, see Creating and managing Broadcom support cases.
If you are contacting Broadcom support about this issue, in order to aid a timely response and resolution, please provide the following:
Handling Log Bundles for offline review with Broadcom support
Related Installation documentation:
Related issue: