Operation Failed. Cannot reconfigure replication group 'VM_NAME' (managed object ID: 'GID-'). Details: 'Failed storing configuration state and error'.
hms and hbrsrv services on the vSphere Replication appliance alone does not resolve the issue.This issue is caused by metadata desynchronization between the vSphere Replication Management Service (HMS) and the Host-Based Replication (HBR) agents on the target ESXi host.
When a destination datastore becomes completely full, the HBR processes encounter an I/O hang while attempting to update Persistent State Files (.psf) and delta disks. Even after space is reclaimed, the hbr-agent and hbrsrv processes on the target ESXi host may retain stale memory states or exclusive file locks on the replication metadata. These "zombie" locks prevent the HMS from committing new configuration changes or acknowledging incoming delta blocks.
To resolve this state, the host-level replication services must be restarted on the affected target ESXi host(s) to clear stale metadata locks.
Reclaim Storage Ensure the destination datastore has sufficient free space. A minimum of 20% free space is recommended to accommodate replication overhead, redo logs, and persistent state files.
Restart Host-Side Replication Services Log in to the Target ESXi host (where the replica files reside) via SSH as root and execute the following commands. Note: These commands only affect replication traffic and do not impact the running state of virtual machines.
Restart Appliance Services (Optional) If the issue persists after restarting host services, restart the management services on the vSphere Replication Appliance (VRA) at both the source and destination sites:
Validation
Impact/Risks Restarting the hbr-agent and hbrsrv services will momentarily pause active replication syncs for all VMs residing on that specific ESXi host. Normal replication will resume automatically once the services are back online. This does not cause any downtime for production virtual machines.