After upgrading NSX Edge nodes, all IPSec VPN tunnels configured on a Tier-1 (T1) Gateway fail to establish and remain in a down state. This results in a production outage for any services relying on these VPN connections.
The VPN tunnels were functioning normally prior to the upgrade. No configuration changes were made to the VPN settings. The issue may occur on different T1 Edges across subsequent upgrades.
Symptoms include:
VMware NSX
During the Edge node upgrade process, VPN services may attempt to re-establish connections before the Edge node has fully completed its post-upgrade initialization. This timing condition prevents the tunnels from coming up properly even though no configuration changes occurred.
To restore VPN tunnel connectivity, perform a controlled failover by placing each Edge node into maintenance mode:
If the tunnels remain down after completing these steps, review the Edge node logs for any additional errors and contact Broadcom Support for further assistance.
When opening a support request, provide: