Issue:
ESXi host was placed into maintenance mode.
VMs failed to complete migration causing the host to fail entering maintenance mode and the VMs had to be restarted
ESXi 8.0 U3
vCenter vpxd.log shows the host entering maintenance mode and VMs failing to migrate:
info vpxd[420038] [Originator@6876 sub=vpxLro opID=186b591f] [VpxLRO] -- BEGIN task-423425 -- host-20 -- vim.HostSystem.enterMaintenanceMode -- 52987cda-1e9e-4dd7-e893-3532728be41d(525b9309-e79e-cdcc-1023-4f8710b8222d)
error vpxd[23719] [Originator@6876 sub=drsExec opID=CdrsLoadBalancer-1f115075-317110a6-01] Failed migrating VM [vim.VirtualMachine:vm-100544,<vm1>] to host vim.HostSystem:host-20error vpxd[23731] [Originator@6876 sub=drsExec opID=CdrsLoadBalancer-1f115075-7cba67b0-01] Failed migrating VM [vim.VirtualMachine:vm-100543,<vm2>] to host vim.HostSystem:host-29error vpxd[419862] [Originator@6876 sub=drsExec opID=2a4a2acf-01-0d] Failed migrating VM [vim.VirtualMachine:vm-116004,<vm2>] to host vim.HostSystem:host-42error vpxd[419820] [Originator@6876 sub=drsExec opID=186b591f-01-01] Failed migrating VM [vim.VirtualMachine:vm-103291,<vm3>] to host vim.HostSystem:host-29
ESXi hostd.log shows Failed migrating VM
error vpxd[23719] [Originator@6876 sub=drsExec opID=CdrsLoadBalancer-1f115075-317110a6-01] Failed migrating VM [vim.VirtualMachine:vm-100544,<vm1>] to host vim.HostSystem:host-20
In(05) vmx - Log for VMware ESX pid=2135295 version=8.0.3 build=build-24674464 option=Release...In(05) vmx - Hostname=<vm1>...In(05) vmx - DICT scsi0:1.fileName = "<vm1>.vmdk"...In(05) vcpu-0 - VTHREAD 65891591936 "vcpu-0" wid 2135717...In(05) vcpu-0 - Migrate: Preparing to suspend.In(05) vcpu-0 - Migrate: VM starting stun, waiting 100 seconds for go/no-go message....In(05) vcpu-0 - Closing disk 'scsi3:0'In(05) vcpu-0 - DISKLIB-CBT : Shutting down change tracking for untracked fid 4143688.In(05) vcpu-0 - DISKLIB-CBT : Successfully disconnected CBT node.In(05) vcpu-0 - DISKLIB-VMFS : "vsan://522c836ff7f38e0a-2d4e6def3b6c0f58/e90fa467-a28a-dd82-227e-b49691db2ddc" : closed.In(05) vcpu-0 - Closing disk 'scsi2:0'In(05) vcpu-0 - DISKLIB-CBT : Shutting down change tracking for untracked fid 5913159.In(05) vcpu-0 - DISKLIB-CBT : Successfully disconnected CBT node.In(05) vcpu-0 - DISKLIB-VMFS : "vsan://522c836ff7f38e0a-2d4e6def3b6c0f58/b30ea467-8e50-074a-23ff-b49691db2ddc" : closed.In(05) vcpu-0 - Closing disk 'scsi1:0'In(05) vcpu-0 - DISKLIB-CBT : Shutting down change tracking for untracked fid 6437446.In(05) vcpu-0 - DISKLIB-CBT : Successfully disconnected CBT node.In(05) vcpu-0 - DISKLIB-VMFS : "vsan://522c836ff7f38e0a-2d4e6def3b6c0f58/c10ca467-6ec3-fd2d-6ddf-b49691db2ddc" : closed.In(05) vcpu-0 - Closing disk 'scsi0:1'In(05) vcpu-0 - DISKLIB-CBT : Shutting down change tracking for untracked fid 10697289.In(05) vcpu-0 - DISKLIB-CBT : Successfully disconnected CBT node.
In(182) vmkernel: cpu10:2135717)CBT: 765: Disconnecting the cbt device 3f3a48-cbt with filehandle 4143688In(182) vmkernel: cpu10:2135717)FiltModS: 379: Aborted 0 IOs and completed 0 IOs after exit of upcall threadIn(182) vmkernel: cpu10:2135717)VDFM: 1301: Destroying VDFM file node 293a40-vdfm with fid 2701888.In(182) vmkernel: cpu10:2135717)CBT: 765: Disconnecting the cbt device 5a3a47-cbt with filehandle 5913159In(182) vmkernel: cpu10:2135717)FiltModS: 379: Aborted 0 IOs and completed 0 IOs after exit of upcall threadIn(182) vmkernel: cpu10:2135717)VDFM: 1301: Destroying VDFM file node 583a3c-vdfm with fid 5782076.In(182) vmkernel: cpu10:2135717)CBT: 765: Disconnecting the cbt device 623a46-cbt with filehandle 6437446In(182) vmkernel: cpu10:2135717)FiltModS: 379: Aborted 0 IOs and completed 0 IOs after exit of upcall threadIn(182) vmkernel: cpu10:2135717)VDFM: 1301: Destroying VDFM file node 263a3f-vdfm with fid 2505279.In(182) vmkernel: cpu10:2135717)CBT: 765: Disconnecting the cbt device a33a49-cbt with filehandle 10697289
...
Wa(180) vmkwarning: cpu10:2097222)WARNING: VMotion: 1134: 8567820442367493545 S: Maximum switchover time (100 seconds) reached. Failing VMotion; VM should resume on source.
2025-05-18T15:52:04.209Z In(14) shell[2240556]: [root]: esxcli vm process kill --type=hard --world-id=2135298
Investigating the disk issue
In ESXi /var/run/log/veecdp.log
In(158) veecdp[2135295]: [0002135717] 468 (I) {501d141e-37a0-3ef8-475e-4bdc339adb9f} [6000c292-3040-1b35-50de-eac293636d49] [DiskFilter] DiskClose finishedIn(158) veecdp[2135295]: [0002135717] 591 (I) {501d141e-37a0-3ef8-475e-4bdc339adb9f} [6000c293-b79c-81ce-b790-2a7842fdc5ec] [DiskFilter] DiskClose startedIn(158) veecdp[2135295]: [0002135717] 591 (I) {501d141e-37a0-3ef8-475e-4bdc339adb9f} [6000c293-b79c-81ce-b790-2a7842fdc5ec] [DiskSync] DisableIn(158) veecdp[2135295]: [0002135717] 591 (I) [WorkGroup] WorkGroup wait started, self: 0xeffc11b80In(158) veecdp[2135295]: [0002135717] 591 (I) [WorkGroup] WorkGroup wait finished, self: 0xeffc11b80...In(158) veecdp[2135295]: [0002135717] 595 (I) {501d141e-37a0-3ef8-475e-4bdc339adb9f} [6000c293-b79c-81ce-b790-2a7842fdc5ec] [DAT] Session destroy started, this: 0xf544c9650In(158) veecdp[2135295]: [0002135717] 595 (I) {501d141e-37a0-3ef8-475e-4bdc339adb9f} [6000c293-b79c-81ce-b790-2a7842fdc5ec] [DAT] Session protocol destroy started, self: 0xf544c9340In(158) veecdp[2135295]: [0002135717] 595 (I) [WorkGroup] WorkGroup wait started, self: 0xeffc12a60In(158) veecdp[2135295]: [0002135717] 595 (I) [WorkGroup] WorkGroup wait finished, self: 0xeffc12a60In(158) veecdp[2135295]: [0002135717] 595 (I) [WorkGroup] WorkGroup destroying, self: 0xeffc12a60In(158) veecdp[2135295]: [0002135717] 595 (I) {501d141e-37a0-3ef8-475e-4bdc339adb9f} [6000c293-b79c-81ce-b790-2a7842fdc5ec] [Transceiver] [DAT] Destroying, self: 0xf544c8650Er(155) veecdp[2135295]: [0002135717] 595 (E) [Timer] Failed to remove timer, this: 0xeffc07668, status 'Object not found'In(158) veecdp[2135295]: [0002135717] 595 (I) [WorkGroup] WorkGroup wait started, self: 0xeffc13200In(158) veecdp[2135295]: [0002135709] 728 (I) {501d141e-37a0-3ef8-475e-4bdc339adb9f} [6000c29f-b403-0d50-eaf7-99c84bdc110a] [OneOfConnector] [DAT] Begin connecting to SHM:33035In(158) veecdp[2135295]: [0002135709] 728 (I) {501d141e-37a0-3ef8-475e-4bdc339adb9f} [6000c29f-b403-0d50-eaf7-99c84bdc110a] [ShmSocketConnector] [:33035] Begin SHM connection to 127.0.0.1:33035In(158) veecdp[2135295]: [0002135709] 728 (I) {501d141e-37a0-3ef8-475e-4bdc339adb9f} [6000c29f-b403-0d50-eaf7-99c84bdc110a] [TcpConnector] [:33035] Begin connection attempt to 127.0.0.1:33035In(158) veecdp[2135295]: [0002135709] 728 (I) {501d141e-37a0-3ef8-475e-4bdc339adb9f} [6000c29f-b403-0d50-eaf7-99c84bdc110a] [TcpConnector] [:33035] Disable Nagle algorithm for socket (socket=135)In(158) veecdp[2135295]: [0002135709] 728 (I) {501d141e-37a0-3ef8-475e-4bdc339adb9f} [6000c29f-b403-0d50-eaf7-99c84bdc110a] [TcpConnector] [:33035] Setting send & receive timeout for socket (socket=135)In(158) veecdp[2135295]: [0002135709] 728 (I) {501d141e-37a0-3ef8-475e-4bdc339adb9f} [6000c29f-b403-0d50-eaf7-99c84bdc110a] [TcpConnector] [:33035] Setting keep alive settings for socket (socket=135). Idle time: 60 seconds, retry interval: 5 secondsIn(158) veecdp[2135295]: [0002135709] 728 (I) {501d141e-37a0-3ef8-475e-4bdc339adb9f} [6000c29f-b403-0d50-eaf7-99c84bdc110a] [TcpConnector] [:33035] Connecting to 127.0.0.1:33035Er(155) veecdp[2135295]: [0002135709] 728 (E) {501d141e-37a0-3ef8-475e-4bdc339adb9f} [6000c29f-b403-0d50-eaf7-99c84bdc110a] [TcpConnector] [:33035] Connection attempt failed with error 111In(158) veecdp[2135295]: [0002135709] 728 (I) {501d141e-37a0-3ef8-475e-4bdc339adb9f} [6000c29f-b403-0d50-eaf7-99c84bdc110a] [OneOfConnector] [DAT] Connection to SHM:33035 failed: error=111In(158) veecdp[2135295]: [0002135709] 728 (I) {501d141e-37a0-3ef8-475e-4bdc339adb9f} [6000c29f-b403-0d50-eaf7-99c84bdc110a] [OneOfConnector] [DAT] Begin connecting to TCP :33033In(158) veecdp[2135295]: [0002135709] 728 (I) {501d141e-37a0-3ef8-475e-4bdc339adb9f} [6000c29f-b403-0d50-eaf7-99c84bdc110a] [TcpConnector] [:33033] Begin connection attempt to 127.0.0.1:33033In(158) veecdp[2135295]: [0002135709] 728 (I) {501d141e-37a0-3ef8-475e-4bdc339adb9f} [6000c29f-b403-0d50-eaf7-99c84bdc110a] [TcpConnector] [:33033] Setting buffer size for TCP socket (socket=135, size=65536)In(158) veecdp[2135295]: [0002135709] 728 (I) {501d141e-37a0-3ef8-475e-4bdc339adb9f} [6000c29f-b403-0d50-eaf7-99c84bdc110a] [TcpConnector] [:33033] Disable Nagle algorithm for socket (socket=135)
IOFilter: Name: veecdp Vendor: VEE Version: 12.3.20-1OEM.800.1.0.20613240 Description: Veeam CDP IOFilter ID: VEE_bootbank_veecdp_12.3.20-1OEM.800.1.0.20613240 LocalID: veecdp Class: replication Release Date: 2025-03-07T16:39:47.883348+00:00 Enabled: YesSolution Recommendation:
Vendor support (Veeam) should be contacted to look into why the close operations in the iofilter did not finish.