In a vCenter ESXi host cluster, a virtual machine (VM) running on one ESXi host is unable to reach a network interface of a physical machine outside the ESXi host. However, when that VM is migrated to another ESXi host in the cluster, it is able to reach the interface.
Note: sometimes people reference an ESXi host that is in a cluster as a "node" of that cluster.
These symptoms may also appear on a non-clustered host.
You may see alerts on vCenter for the problematic host:
You find that the VM in question cannot ping out from the problematic host.
VMware vSphere ESXi
If packets supplied to the stack by the guest VM at capture point VnicTx are delivered completely and in a timely way (microseconds) to the physical infrastructure as seen at capture point UplinkSndKernel, and if packets received by the stack from the physical infrastructure as seen at capture point UplinkRcvKernel are delivered completely and in a timely way (microseconds) to the guest VM at capture point VnicRx, then the root cause of the symptom is not in the ESXi networking stack.
The next step is to investigate the guest VM's operating system and/or internal networking stack, and/or the physical infrastructure external to the vmnic (uplink) on the ESXi host.
If packets are seen at one end of the ESXi networking stack, but not the other, perform packet captures at the intermediate capture points as described in KB 341568 Packet capture on ESXi using the pktcap-uw tool, to determine where the packet blockage is occurring.
net-stats -l
pktcap-uw -- switchport <PortNum> -- capture VnicTx, VnicRx -o - | tepdump-uw -r - -enn
esxcfg-vswitch -l
esxtop, Identify in the "Pnic" column, which uplink (vmnic) is actually carrying the traffic in question.esxtop --> nThe
esxtop --> n output will either show "vmnic#" (where # --> the vmnic number) or "all(#)" (where # is the number of uplinks configured in an LACP LAG or Etherchannel configuration)all(#)" then you must capture on EACH of the uplinks simultaneously, because the physical switches to which the uplinks are connected determine which data path is used. pktcap-uw -- uplink <vmnic> -- capture UplinkSndKernel, UplinkRcvKernel -o - | tcpdump-uw -r - -enn | grep <VM MacAddress> | grep - i arp
Review Packet Capture results to confirm if traffic is flowing or not through the uplink to the physical switch.
If you see traffic flowing from the VM and being delivered to the physical network but no responses from the physical network, then the issue is occurring outside of the virtual network and the physical network configuration should be investigated to identify the problem.