Getting the message "Dropped due to ARP failure" when testing routing in Traceflow
search cancel

Getting the message "Dropped due to ARP failure" when testing routing in Traceflow

book

Article ID: 413576

calendar_today

Updated On:

Products

VMware NSX

Issue/Introduction

  • You are testing North-South connectivity and Traceflow shows "Dropped due to ARP failure," despite BGP peering being healthy and routes present in the forwarding table.

  • Traceflow analysis reveals packets are "Dropped due to ARP failure" before reaching the transit-bp interface on the ESXi. This indicates a drop within the ESXi's prior reaching the edge node/s.

    Below is a sample screenshot from Traceflow:



  • ESXi logs show vmk10 and/or vmk11 (tunnel endpoints) are configured for DHCP but are assigned IP addresses from range 169.254.x.x (APIPA) due to a lack of DHCPOFFERs.

    Note: 169.254.x.x can be configured to be used as TEP IP's, however in this case that IP range is not being used purposefully and is not expected.

    Log-location: /var/run/log/dhclient.log (grep with 'No DHCPOFFERS received') on ESXi transport node.

    Below are the sample logs for reference :

    <Timestamp> In(30) dhclient-uw[20####1]: No DHCPOFFERS received.
    <Timestamp> In(30) dhclient-uw[20####1]: No DHCPOFFERS received.

  • Log-location: /var/run/log/nsx-syslog.log (grep with 'nsx-exporter') on ESXi transport node.

    NSX exporter logs show "Overlay tunnel summary -- up: 0, down: 0, unknown: 0" and "EVPN tunnel summary -- up: 0, down: 0, unknown: 0" on the ESXi, confirming no tunnels are established.

Environment

VMware NSX
VMware NSX-T Datacenter

Cause

The network unreachability stems from a failure in establishing tunnels between the ESXi hosts and the Edge nodes.
This is directly caused by the TEP vmk interfaces (vmk10 and vmk11) failing to obtain valid IP addresses via DHCP, instead auto-assigning APIPA addresses, thus preventing proper tunnel initialization.

Resolution

If the APIPA addresses are unexpected and are not configured purposefully, please investigate and resolve the DHCP service availability/reachability for vmk10 and vmk11 on the ESXi hosts.

This includes:

  • Verifying DHCP server reachability from the hosts.
  • Ensuring the IP pool has sufficient free IP addresses for these vmkernel interfaces.

    Once vmk10 and vmk11 receive valid IP addresses, the NSX tunnels should establish, restoring connectivity between ESXi TEPs and edge TEPs.

Note: If the issue persists after resolving TEP connectivity issues, feel free to open a case with Broadcom Support Team for further troubleshooting