Unexpected error while upgrading upgrade unit: Install of offline bundle failed on host <host-uuid> with error : VI SDK invoke exception:java.rmi.RemoteException: VI SDK invoke exception:org.dom4j.DocumentException. Please refer Recovering from an NSX-T In-place Upgrade Failure article for troubleshooting steps./var/log/syslog contains alerts such as:NSX 73274 - [nsx@6876 comp="nsx-manager" level="INFO" subcomp="ccp"] Connection closed received NettyConnection(NettyChannel(local=<ManagerNodeIP>:1235, remote=<TransportNodeIP>:45998), active=false)/var/run/log/vmkernel.log contains entries similar to:cpu33:223985481)Team.vswitch: TeamVSLACPLAGEventCB:9087: [nsx@6876 comp="nsx-esx" subcomp="vswitch"]Received event LAG DESTROY, LAG /0, link UNKNOWN, uplink /0x0, link UNKNOWNcpu34:223985789)kcp: KCPSHARegisterEvent:545: [nsx@6876 comp="nsx-esx" subcomp="kcp"]KCP_SHA register VMK_PORTSET_EVENT_LACP_LAG event successcpu6:223985481)Net: 2184: connected LACP_MgmtPort to null config, portID 0x4000032cpu6:223985481)Team.vswitch: TeamVSLACPLAGEventCB:9119: [nsx@6876 comp="nsx-esx" subcomp="vswitch"]Received event LAG CREATE, LAG /0, link UNKNOWN, uplink /0x0, link UNKNOWN...cpu6:223985481)Team.vswitch: TeamVSPolicySet:8123: [nsx@6876 comp="nsx-esx" subcomp="vswitch"]Invalid Uplink : <LAG-NAME>, ignore itcpu6:223985481)Team.vswitch: TeamVSPolicySet:8123: [nsx@6876 comp="nsx-esx" subcomp="vswitch"]Invalid Uplink : <LAG-NAME>, ignore itDue to a race condition, when using a lag, the lag may break during the NSX upgrade.
This can occur under the following conditions:
This issue is resolved in VMware NSX 4.1.2.4
This issue is resolved in VMware NSX 4.2.0
Workaround:
To prevent this issue from occurring, set the ESXi host to upgrade in "In-place" mode. In this mode, the ESXi host will not enter maintenance mode and will not trigger the script which leads to the race condition.
If the host is already upgraded when this is issue occurs, reboot the ESXi host to regain management connectivity.
Please note that In-place mode is not supported mode of upgrade for VMware Lifecycle Manager (vLCM) enabled clusters.