HCX - Communication failure over NE HA extended network post infra outage
search cancel

HCX - Communication failure over NE HA extended network post infra outage

book

Article ID: 378050

calendar_today

Updated On:

Products

VMware HCX

Issue/Introduction

  • After a Network Extension (NE) High Availability (HA) failover due to an infrastructure outage, VMs are not reachable between the sites on one of the extended networks.
  • The HCX tunnels between the sites remain up and operational.
  • Post-NE HA failover, one of the NE Appliance’s vNICs, associated with the affected extended network, is not passing traffic.
  • Other extended networks on the same NE Appliance are working fine.
  • The following log messages are observed in /var/log/messages log of the affected NE Appliance after it becomes the active node (the vNIC number may vary):
 cgw 1316 - - [Err-Tasker] : cmd (/usr/sbin/ip link set dev vNic_1 up) done, error: Timeout
 cgw 1316 - - [Err-configer] : cmd(execCmd<ip link set dev vNic_1 up>) failed: /usr/sbin/ip link set dev vNic_1 up: Timeout


 

Environment

HCX 4.9.x and earlier

Resolution

This issue is resolved in VMware HCX version 4.10, available at Broadcom downloads.

Workaround:  

Re-extend the affected network.

  1. Remove affected network extension
  2. Extend affected network