Following a vCenter certificate renewal or a global state refresh, you observe the following symptoms in your vSAN stretched cluster:
vCenter 8.x
ESXi 8.x
vSAN 8.x
vSAN Stretched Cluster
This issue occurs when a duplicate internal host UUID is shared between two separate vSAN witness appliances. This typically happens if a witness appliance was cloned or deployed from a virtual machine template instead of being deployed as a fresh OVF.
During a vCenter certificate renewal, a host reconnection and cluster state rediscovery are forced. If two distinct appliances share the same UUID, vCenter may incorrectly merge both IP addresses into the active witness role metadata. This creates a logic loop in the unicast agents on the ESXi data hosts as they attempt to route to the same UUID via conflicting IP addresses (e.g., ####.####.####.2 and ####.####.####.3). This conflict breaks communication between the data nodes and the witness.
To resolve this issue and restore cluster stability, you must override the corrupted configuration using the command line.
Step 1: Remove Fault Domains
In the vSphere Client, temporarily remove the ESXi data hosts from their designated stretched cluster fault domains to isolate them from the conflicting configuration.
cd /localhost/####-DC-####-####/computers/####-cl-####-####This command cannot be undone. Verify every parameter before running. WARNING: Removing the witness component breaks stretched cluster site quorum and will impact data availability if the primary and secondary sites are not fully synchronized. Verify all parameters before execution. If you are uncertain about the potential impact, consult a technical lead.
vsan.stretchedcluster.remove_witness .
Task result: success.Reconfigure the stretched cluster and fault domains through the vSphere Client UI, explicitly pointing to the correct, original witness IP address (e.g., ####.####.####.2).