This article provides steps to troubleshoot and resolve the issue to resolve recovery plan failure for a VM due to operation time out error.
Symptoms:
Planned migration for the recovery plan fails at configure storage step for a specific VM with "Operation timed out: 900 seconds."
This is due to the VSR migration being invoked while TRIM/UNMAP is running with large deltas, triggering VSAN disk consolidation, causing VSR to reach the default timeout period.
The following can be seen in the HBR logs:
2023-07-22T15:43:13.857Z esx-xx.vmwarevmc.com Hostd: info hostd[2107594] [Originator@6876 sub=Vimsvc.TaskManager opID=hsl-46f677eb-5580 user=vpxuser] Task Created : haTask--vim.VirtualDiskManager.consolidateDisks-2139137619 2023-07-22T16:55:53.663Z esx-xx.vmwarevmc.com Hostd: info hostd[2103242] [Originator@6876 sub=Vimsvc.TaskManager opID=hsl-46f677eb-5580 user=vpxuser] Task Completed : haTask--vim.VirtualDiskManager.consolidateDisks-2139137619 Status success
Before the recovery plan failover takes place, the VM in question had a TRIM/UNMAP disk consolidation task begin which took longer than 1 hour, causing VSR to timeout the recovery plan.