/var/log/nsx-syslog
on the ESXi host:2024-03-13T07:13:55.427Z nsx-opsagent[2102926]: NSX 2102926 - [nsx@6876 comp="nsx-esx" subcomp="opsagent" s2comp="nsxa" tid="2103675" level="WARNING"] VDRVMK_GetVMAC(<host switch UUID>) failed with code -1
{
"details": [
{
"failure_code": 8804,
"failure_message": " Host configuration: VDRVMK_GetVMAC(<host switch UUID>) failed with code -1; LogicalSwitch full-sync: LogicalSwitch full-sync realization query skipped.",
"state": "partial_success",
"sub_system_id": "<UUID>",
"sub_system_type": "Host"
}
],
After an ESXi host reboot, the nsxa app restarts and reconnects with the VDR. Part of the appInit process checks if the VDR MAC addresses have changed. In a rare occurrence, the VDRVMK_GetVMAC
can return with errno = ENOENT
which is treated as a failure and causes the host to enter the Partial Success state.
This issue is resolved in VMware NSX 3.2.4
This issue is resolved in VMware NSX 4.2.0
Workaround:
/etc/init.d/nsx-opsagent stop
/etc/init.d/nsx-opsagent start
/etc/init.d/nsx-opsagent status
On NSX 3.1 and previous releases:
/etc/init.d/nsxa stop
/etc/init.d/nsxa start
For verification if the nsx-opsagent is running, please use the below command
/etc/init.d/nsxa status
You can alternatively reboot the affected host.
The probability of re-occurrence is extremely low.
Impact/Risks:
If the ESXi host is taken out of Maintenance Mode in the Partial Success state, and VMs are then vMotioned to this host, those VMs will lose connectivity with machines on other networks.