Symptoms:
HCX Network Extension (NE) appliance services (ZMQ/CGW) crash and restart.
Log pattern - Found in NE-APPLIANCE </var/log/messages> : panic: RestartZMQ: Failed to bind(tcp://127.0.0.1:5500), err: system call interrupted.
Traffic fails for specific VLANs while others on the same NE pair recover.
Absence of "learned MAC change" entries for the impacted VLAN in the Network Extension appliance logs </var/log/messages>.
<timestamp> NE-R1 cgw 10378 - - [Info-arper] : New shadow IP (IP rule) = {tapbr4 00:50:56:##:##:##ff:ff:ff:ff:ff:ff 1 00:50:56:##:##:##00:00:00:00:00:00 #.#.#.# #.#.#.#}
<timestamp> NE-R1 cgw 10378 - - [Info-arper] : New shadow IP (IP rule) = {tapbr4 00:50:56:##:##:## ff:ff:ff:ff:ff:ff 1 00:50:56:##:##:##00:00:00:00:00:00 #.#.#.# #.#.#.#}
<timestamp> NE-R1 cgw 10378 - - [Info-arper] : New shadow IP (IP rule) = {tapbr3 00:50:56:##:##:## ff:ff:ff:ff:ff:ff 1 00:50:56:##:##:## 00:00:00:00:00:00 #.#.#.# #.#.#.#}
VMware HCX 4.11.#
NSX-T / NSX 4.x
Mobility Optimized Networking (MON) Enabled
00:00:00:00:00:00.This issue is resolved in HCX 4.11.4 , available at Broadcom downloads.
Refer >> VMware HCX 4.11.4 Release Notes
If you are having difficulty finding and downloading software, please review the Download Broadcom products and software KB.
Verify Service Health: Access the HCX Manager and check for Network-Extension appliance stability. Review /var/log/messages for ZMQ/CGW panic or restart events.
Verify MAC Learning: Log into the impacted NE-R appliance via CCLI and check if the Gateway MAC has been learned for the failing segment:
Navigate to HCX Cloud Manager Admin Console.
Enter CCLI -> list -> go <Appliance_Number> -> ssh.
Run: grep -r "learned MAC change" /var/log/messages
Check for the impacted Bridge ID (e.g., br5) to see if the NewMac is populated.
Example:
<timestamp> NE-R1 cgw 10378 - - [Info-arper] : New shadow IP (IP rule) = {tapbr3 00:50:56:##:##:## ff:ff:ff:ff:ff:ff 1 00:50:56:##:##:## 00:00:00:00:00:00 #.#.#.# #.#.#.#}
WORKAROUND :
00:00:00:00:00:00), initiate an HA Failover of the Network Extension appliance pair. This forces a fresh initialization of the bridge ports and ARP resolution.<timestamps> UTC NE-R1 cgw 10378 - - [Info-configer] : l2ArpResolver: learned MAC change: <BrId: br3, Intf: tapbr3, Ip: 10.105.62.1, NewMac: <Learned GW MAC>, OldMac: 00:00:00:00:00:00, When: <timestamp> +0000 UTC>
If the issue persists or RCA is required, please engage Broadcom Support immediately. Creating and managing Broadcom cases