When performing a failback or configuring replication from a Disaster Recovery (DR) site to a Production (PR) site, the synchronization process becomes stuck at 50%. The vCenter Server or Site Recovery Manager (SRM) UI displays the following error:
> "A replication error occurred at the vSphere Replication Server for replication '[VM_NAME]'. Details: No connection to VR Server for virtual machine [VM_NAME] on host [ESXi_HOSTNAME] in cluster [CLUSTER_NAME] in [DATACENTER]: Not responding."
Vsphere Replication 9.x
This issue is caused by ESXi host instability at the target (Production) site. When the host running the vSphere Add-on or Replication Server becomes unresponsive or unstable, the TCP communication required for synchronization (typically on ports 31031 or 8123) is interrupted.
To restore replication functionality, the Add-on Replication Server must be moved to a stable environment.
Contributing factors may include: