ESXi hosts running NSX Transformers may experience a Purple Screen of Death (PSOD) due to a corruption in the dlmalloc memory allocator.
Symptoms:
Those with the issue should see log entries similar to the following:
The PSOD is caused by a corruption in the dlmalloc memory allocator, which is triggered by a race condition in the NSX Transformers FP Infra component. The issue occurs when the Ens_UpdateUplinkLinkState function drops the switch lock to notify the uplink layer, and the subsequent Ens_HandleUplinkReset function tries to access an already freed uplink FCPort pointer.
1. Upgrade NSX Transformers to version 4.1.2.4 or later, which includes a fix for this issue:
a. Download the latest version of NSX Transformers from the official VMware website.
b. Follow the upgrade instructions provided in the NSX Transformers documentation.
c. Verify that all ESXi hosts are running the updated version of NSX Transformers.
2. If upgrading is not immediately feasible, apply the following workaround:
a. Disable the net-stats feature in NSX Transformers.
b. Contact VMware Support for further assistance in implementing the workaround.
The Purple Screen of Death (PSOD) occurs due to a memory corruption issue in the NSX Transformers software. The problem is caused by a timing issue between two internal functions, which may lead to one function trying to access memory that has already been freed by another function. This situation can cause the ESXi host to crash and display the PSOD error.
The fix for this issue involves making changes to how the affected functions handle memory and validate pointers to prevent the corruption from occurring. Upgrading to NSX Transformers 4.1.2.4 or later ensures that the necessary fixes are in place to prevent this specific PSOD from happening.
If you are experiencing this issue and cannot immediately upgrade, a temporary workaround is available by disabling the net-stats feature in NSX Transformers. For further assistance with implementing the workaround, please contact VMware Support.