When attempted to migrate a VM using RAV and encountered repeated failures with the following error: "Not able to get group instance post snapshot."
HCX 4.11
The failure occurs during the switchover phase due to the online sync process running too long (exceeding 5 hours) without completing. As per the current implementation, if the sync does not produce a group instance after 5 retry attempts, the migration is designed to fail.
app.log entries also show repeated sync conflicts:
This issue is linked to high data churn on the VM, where data is written faster than it can be replicated within the RPO cycle.
If zero downtime is not a strict requirement, it is recommended to use Bulk migration for high churn VMs. This approach is more tolerant of heavy write activity and does not rely on continuous replication within RPO limits.