HCX Replication Assisted vMotion (RAV) migrations fail with " Source side relocate failed for the virtual machine" in HCX 9.0.0
search cancel

HCX Replication Assisted vMotion (RAV) migrations fail with " Source side relocate failed for the virtual machine" in HCX 9.0.0

book

Article ID: 438922

calendar_today

Updated On:

Products

VMware HCX

Issue/Introduction

  • HCX Replication Assisted vMotion (RAV) fail during the switchover phase
  • Error message in HCX UI:
    vMotion failed. System Error. Source side error is : Source side relocate failed for the virtual machine. Could not complete network copy for file /vmfs/volumes/#/#.vmdk : Target side error is : Operation timed out.
    or
    vMotion failed. System Error. Source side error is : Source side relocate failed for the virtual machine. Could not complete network copy for file  /vmfs/volumes/#/#.vmdk : Target side error is : An error occurred while communicating with the remote host.
  • The following errors are observed on the /common/logs/admin/app.log:
    <timestamps> UTC [RAVService_SvcThread-13373, Ent: HybridityAdmin, , TxId: TxId: #######-####-####-####-#######] ERROR c.v.h.s.rav.jobs.RAVSwitchoverJob- [migId=#######-####-####-####-#######] Error while
    executing RAVSwitchoverJob state 'VERIFY_RESULT'.
    com.vmware.vchs.hybridity.migration.common.MigrationException: vMotion failed. System Error. Source side error is : Source side relocate failed for the virtual machine. Could not complete network copy for file /vmfs/volumes/#/#/<vmname-disk>.vmdk :
    Target side error is : Operation timed out.
          at com.vmware.vchs.hybridity.migration.common.MigrationJobHelper.verifySubflowJobsSuccessful(MigrationJobHelper.java:188)
    or 
     ERROR c.v.h.s.v.j.MonitorSourceSideProgressWorkflow- [migId=#######-####-####-####-#######] Source side relocate 'task-#####' failed for the virtual machine. Error is Could not complete network copy for file /vmfs/volumes/#/#.vmdk :. Total progress % is 'null'.
  • HCX Diagnostics does not display any errors.
  • Network performance between HCX Uplinks shows good results with no detectable errors or packet loss.
  • Small VMs may eventually complete, though they take an unusually long time.

Environment

VCF Operations HCX 9.0.x

Cause

During the RAV switchover phase, HCX attempt a copy of the main (base) disk instead of the delta disk. This triggers a full re-copy of the virtual machine's data rather than a delta-only sync. For large virtual machines, this unexpected volume of data transfer exceeds the available maintenance window or the vMotion switchover timeout, resulting in an Operation timed out failure.

Resolution

If you experience this issue, please contact Broadcom Support and provide the following log files:

  • HCX log bundles from both the Source and Target environments (including the DB and IX appliances).

  • ESXi host logs from both the Source and Target hosts.

Note: Target ESXi host details can be located in the migration's Events view