When pinging between two VMs that are on different ESXi hosts but in the same segment, some intermittent disconnects occur.
NSX 4.1.2.3
This issue can occur when the NSX Managers have long uptimes and can cause the control plane to have some syncing issues.
To determine if the TEP/MAC table is experiencing issues with syncing with accurate data the below process can be attempted:
esxtop and then type the letter "n" to gain access to the networking view. pktcap-uw --uplink vmnic# --capture UplinkSndKernel -o - |tcpdump-uw -enr- | grep <desination IP>
vmnic# with the correct vmnic in use by the source VM and the <destination IP> is replaced with the destination VMs IP address (no need to include the <> in the command).pktcap-uw --uplink vmnic# --capture UplinkRcvKernel -o - |tcpdump-uw -enr- | grep <source IP>
vmnic# with the correct vmnic in use by the source VM and the <source IP> is replaced with the source VMs IP address (no need to include the <> in the command).ICMP echo requests going out and on the destination VM packet capture there should be ICMP echo requests coming in as the packet capture commands being ran are only outgoing from the source VM and only incoming to the destination VM.Once its confirmed that there is a TEP/MAC table issue the following process can be performed to resolve the issue:
opsagent, proxy, and cfgagent on the effected hosts by running the following commands
/etc/init.d/nsx-opsagent restart/etc/init.d/nsx-proxy restart/etc/init.d/nsx-cfgagent restartTo open a support case with Broadcom Support please refer to Creating and managing Broadcom support cases.
Please be sure to provide the following data: