NSX installation failed on the newly commissioned hosts, stuck at 48%
search cancel

NSX installation failed on the newly commissioned hosts, stuck at 48%

book

Article ID: 431552

calendar_today

Updated On:

Products

VMware NSX

Issue/Introduction

  • The management cluster attempted to expand with new hosts.
  • The host commission workflow has been completed.
  • NSX installation stuck at 48%, waiting for connection to Managers.
  • Get managers and get controllers' output is blank/empty.

Connection to ports 1234 & 1235 is successful.

nsxcli.log

2026-02-20T12:07:56.003Z 2135841 cli.server.cli_command_service INFO {0} CMD: join management-plane nsx-mgr-01 thumbprint ######################## token <token-obfuscated> node-uuid ########-68ea-####-bbff-########
2026-02-20T12:07:56.536Z 2135841 cli.commands.host_shared.register INFO version 9.0.0 buildnum 24755229
2026-02-20T12:07:56.537Z 2135841 cli.commands.host_shared.register INFO Tokenfile is not given
2026-02-20T12:07:56.539Z 2135841 cli.utils.cert_utils INFO Generating certificate using make_cert.py
2026-02-20T12:07:56.539Z 2135841 vmware.runcommand INFO runcommand called with: args = ['/opt/vmware/nsx-common/python/nsx_utils/make_cert.py', 'openssl', 'req', '-new', '-newkey', 'rsa:2048', '-days', '3650', '-nodes', '-x509', '-generatedby', 'TN', '-keyout', '/tmp/tmppyak5nta', '-out', '/tmp/tmpa422oodh', '-config'
, '/tmp/tmptrskmlei', '-extensions', 'req_ext'], outfile = None, returnoutput = True, timeout = 0.0.
2026-02-20T12:08:42.676Z 2135841 cli.utils.apiclient ERROR POST /api/v1/fabric/nodes/########-68ea-####-bbff-########?action=register_node raised exception: <class 'socket.gaierror'>
Traceback (most recent call last):
  File "/opt/vmware/nsx-cli/bin/python/cli/utils/apiclient.py", line 90, in request
    conn.connect()
  File "lib64/python3.11/python3.11/http/client.py", line 1468, in connect
  File "lib64/python3.11/python3.11/http/client.py", line 962, in connect
  File "lib64/python3.11/python3.11/socket.py", line 839, in create_connection
  File "lib64/python3.11/python3.11/socket.py", line 974, in getaddrinfo
socket.gaierror: [Errno -3] Temporary failure in name resolution

  • nsxcli log reports that the join management workflow fails with 'temporary failure in name resolution.'
  • nslookup from the host to the NSX manager fails.

    [root@esx:~] nslookup ##.##.##.##
    ;; connection timed out; no servers could be reached

Environment

VMware NSX.

Cause

DNS resolution failure from the host to the NSX manager.

Resolution

 

  • Make sure the host has the respective allowed firewall rule for the manager subnet.

Sample:-

Before
esxcli network firewall ruleset allowedip list  | grep -i dns
 dns                            

 esxcli network firewall ruleset allowedip add -r dns -i ##.##.##.##/##

After

esxcli network firewall ruleset allowedip list  | grep -i dns
dns                           ##.##.##.##/##