Symptoms:
VMware NSX-T Data Center 2.x
VMware NSX-T Data Center
This issue is caused by a memory leak that occurs during vMotion and is associated with the DFW fqdn data structure.
This results in memory still being allocated to the vsip module even after the ESXi host has entered the maintenance mode, preventing the older version of NSX-T software from being removed.
This issue is resolved in NSX-T Data Center 3.0.
Upgrades from NSX-T Data Center 3.0 and higher will not experience this issue.
Workaround:
ESXi Hosts can be configured to automatically reboot as part of the upgrade process to avoid failures
OR
For hosts that have already failed