After the NSX Manager component has completed its upgrade to target version, users may notice that:
Subsequent NSX Precheck operations fail or
Remaining NSX Component upgrades (Host / Edge or Finalize) cannot be started and remain in PAUSED state.
To narrow down and identify the specific issue, perform the following actions:
Trigger the following GET API call on all nodes in the NSX Manager Cluster:
https://{{nsx-mp-ip}}/api/v1/upgrade/status-summary
Note: On a cluster with 3 Manager nodes, this call needs to be invoked on the address of each of the 3 nodes.
Verify that the "overall_upgrade_status" is in "IN_PROGRESS" state on exactly one of the Management Plane nodes
Verify that the "overall_upgrade_status" is in "PAUSED" state on the other Management Plane nodes.
VMware NSX 9.1
This issue could be seen when the upgrade-coordinator (UC) service startup takes longer than expected (about 2 - 3 minutes) during the NSX Manager component upgrade phase. It can manifest as a precheck failure or NSX component upgrade being stuck
SSH (as root) to the NSX Manager node showing "overall_upgrade_status" as "IN_PROGRESS"
Restart the upgrade-coordinator service on the node with the following command:
service upgrade-coordinator restart (from admin cli)
or
systemctl restart upgrade-coordinator (from root cli)
Wait 10 minutes.
SSH to all 3 NSX Manager nodes and trigger the following GET API call on all nodes:
GET https://{{nsx-mp-ip}}/api/v1/upgrade/status-summary
Ensure that they all show the same state of "overall_upgrade_status" (PAUSED).