After a power outage and subsequent restoration, ESXi hosts experience network connectivity failures. Port channels between ESXi hosts and physical switches fail to establish proper Link Aggregation Control Protocol (LACP) negotiation, resulting in degraded or complete loss of network connectivity.
The issue manifests when physical network switches complete their boot sequence before ESXi hosts during power restoration. Network tunnels show as down, workload connectivity is disrupted, and vMotion operations fail with timeout errors.
Symptoms observed:
To verify port channel status and confirm this issue, use Testing VMkernel network connectivity with the vmkping command. If vmkping tests show connectivity through one physical path but not the other, or no connectivity when redundant paths should exist, this maintenance procedure is required.
Physical network switches initialize faster than ESXi hosts during power restoration. The switch initializes port channel configuration but does not receive LACP negotiation from the host side because ESXi network services are still initializing. This timing mismatch leaves the port channel in an inconsistent state where the switch does not properly recognize the aggregated links.
The port channel remains in this state because:
Administratively disable and re-enable affected port channels on the physical switch to force LACP renegotiation:
Note: This procedure causes a brief network disruption to the affected ESXi host. During outage recovery scenarios, this is the fastest resolution method and avoids host reboots.
To prevent recurrence:
If the error persists after following these steps, contact Broadcom Support for further assistance.
When opening a support request with Broadcom for this issue, provide: